Circuit breakers for LLM calls: stop cascading failures
How circuit breakers prevent LLM outages from cascading through your agent. The 3 states, the failure window, and the 50-line implementation.
Loading...
Expert articles on building production AI systems, covering system design, architecture, observability, and real-world implementations.
57 posts in total
How circuit breakers prevent LLM outages from cascading through your agent. The 3 states, the failure window, and the 50-line implementation.
How to manage context windows in production AI agents. The 4 strategies that keep long sessions bounded without losing critical context.
How coding agents remember context across sessions. The memory store, the recall pattern, and the 3 kinds of memory every agent should keep.
How to design custom tools for coding agents that are not read, edit, or bash. The naming rules, the schema patterns, and the 3 custom tools that pay off.
Why ripgrep is the right code search tool for coding agents. The speed, the flags, and the wrapper that turns rg into a safe agent tool.
How to design an edit tool a coding agent can use without wrecking files. The string-match rule, the dry-run pattern, and the 50-line safe edit function.
How to structure tool integrations so an agent stays clean as it grows. The registry pattern, schema-first design, and the 80-line dispatcher that scales.
How coding agents run shell commands safely. The bash tool design, the denylist trap, and the sandbox that keeps rm -rf away from your home directory.
How to build a coding agent with Claude in 100 lines of Python. Tool use, file editing, bash, and the loop that ties them together. No frameworks.
How a coding agent loop actually works. The minimal while loop that turns a chatbot into Claude Code, and the 4 states every agent cycles through.
Master end-to-end system design for production AI chatbots. Learn streaming architecture, memory management, RAG integration, rate limiting, circuit bre...
Learn how to architect a production-ready local financial AI agent that keeps data private and guarantees mathematical accuracy. Deep dive into hybrid a...