47% OFFYearly Pro
$30/mo$16/mobilled yearlyGet Pro
Topic

AI Agents

Explore our latest articles and insights about AI Agents.

Explore posts

86 posts in total

AI Engineering in Practice

pip vs uv vs poetry for Python AI services

Which Python dependency manager should you use for production agent services in 2026? The install speed, lockfile story, and Docker build times compared.

PythonAI Agents+3
Read post
9 min
AI Engineering in Practice

Retry patterns for LLM API errors in production

How to build retry logic that handles rate limits, timeouts, and transient failures without burning money. The backoff rules and the 3 errors you must not retry.

AI AgentsError Handling+3
Read post
8 min
LLM Engineering

Pydantic output structuring for RAG agent plans

How to use Pydantic models to force your RAG planner LLM to return structured steps. The schema, the retry loop, and why plain JSON prompts break in production.

RAGPydantic+3
Read post
8 min
LLM Engineering

Query rewriting in RAG with LLMs: the rewrite loop

How LLM-powered query rewriting fixes vague user questions before retrieval. The prompt, the multi-query fan-out, and when rewriting hurts more than helps.

RAGLLM+3
Read post
8 min
AI Engineering in Practice

Real-time agent debugging with Langfuse traces

How to debug a live agent incident using Langfuse traces. The search patterns, the 5-minute workflow, and the post-mortem that catches the root cause.

ObservabilityAI Agents+3
Read post
8 min
AI Engineering in Practice

Agent cost optimization from trace data

How to use Langfuse trace data to find where your agent burns tokens. The 4 queries, the cost-per-user view, and the 50 percent savings patterns.

ObservabilityAI Agents+3
Read post
9 min
AI Engineering in Practice

Langfuse + Grafana: agentic AI monitoring

How to combine Langfuse traces with Grafana dashboards for agent monitoring. The integration, the panels, and the alerting that catches real problems.

ObservabilityAI Agents+3
Read post
8 min
AI Engineering in Practice

Prometheus performance analysis for agentic AI systems

How Prometheus metrics surface performance bottlenecks in agentic AI. The 3 queries, the alert rules, and the dashboard that finds hot loops fast.

ObservabilityAI Agents+3
Read post
8 min
AI Engineering in Practice

Stress testing agentic AI systems beyond the laptop

How to stress test an agentic AI service before it ships. Concurrency, tokens, latency budgets, and the load profile that simulates real traffic.

AI AgentsProduction AI+3
Read post
8 min
LLM Engineering

Automated evaluation pipelines for agentic AI systems

How to wire eval pipelines into CI so every agent change is scored automatically. The nightly job, the regression gate, and the dashboard that matters.

EvaluationAI Agents+3
Read post
8 min
LLM Engineering

Dynamic evaluation metric loading in Python

How to load evaluation metrics dynamically in a Python eval pipeline. The registry pattern, entry points, and the test override that makes CI fast.

EvaluationPython+3
Read post
8 min
LLM Engineering

LLM judges: enforcing reasoning with explicit rationales

Why LLM judges without explicit reasoning drift, and how chain-of-thought rationales make their scores defensible. The prompt, the parser, the trust.

EvaluationLLM+3
Read post
9 min

Weekly Bytes of AI

Technical deep-dives for engineers building production AI systems.

Architecture patterns, system design, cost optimization, and real-world case studies. No fluff, just engineering insights.

Unsubscribe anytime. We respect your inbox.