Skip to main content

Loading...

47% OFFYearly Pro

$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro$30/mo$16/mobilled yearlyGet Pro

Pricing

Log in Start learning free

Theme

Start learning free Log in

Topic

LLM

Explore our latest articles and insights about LLM.

Learn LLM hands-on. Learn text generation, vision, structured outputs, and function calling

Explore posts

28 posts in total

Search posts

Filter by

Category

AI Engineering57
AI Engineering in Practice40
LLM Engineering31
Prompt Engineering7

Popular tags

LLM28
RAG13
AI Engineering13
Production AI11
Evaluation8
AI Agents8
Prompt Engineering6
Data Processing5
System Design3
Optimization3
Cost3
Embeddings2
Vector Databases2
LangGraph2

LLM Engineering

Choosing the LLM judge for evaluation pipelines

How to pick the LLM that grades your LLM. The cost-quality tradeoffs, the calibration check, and why a weaker judge is sometimes the right call.

EvaluationLLM+3

LLM Engineering

Hallucination testing for RAG pipelines

How to test a RAG pipeline for hallucinations systematically. Adversarial prompts, the out-of-scope set, and the metric that catches confabulation.

RAGEvaluation+3

LLM Engineering

Fact-checking RAG answers: grounding with verification

How to fact-check RAG answers with a second LLM pass that verifies every claim against the retrieved context. The prompt, the rejection rule, and the loop.

LLM Engineering

Query rewriting in RAG with LLMs: the rewrite loop

How LLM-powered query rewriting fixes vague user questions before retrieval. The prompt, the multi-query fan-out, and when rewriting hurts more than helps.

LLM Engineering

LLM-based content filtering for RAG pipelines

How to filter irrelevant retrieved chunks with a cheap LLM call before the final answer. The prompt, the batch pattern, and the 40 percent noise reduction.

AI Engineering in Practice

Agent cost optimization from trace data

How to use Langfuse trace data to find where your agent burns tokens. The 4 queries, the cost-per-user view, and the 50 percent savings patterns.

ObservabilityAI Agents+3

LLM Engineering

LLM judges: enforcing reasoning with explicit rationales

Why LLM judges without explicit reasoning drift, and how chain-of-thought rationales make their scores defensible. The prompt, the parser, the trust.

EvaluationLLM+3

LLM Engineering

LLM-as-a-judge: production evaluation framework for agents

How to build an LLM-as-a-judge evaluation framework for agentic AI. The prompt, the rubric, the bias controls, and the loop that catches regressions.

EvaluationLLM+3

Circuit breakers for LLM calls: stop cascading failures

How circuit breakers prevent LLM outages from cascading through your agent. The 3 states, the failure window, and the 50-line implementation.

AI Engineering in Practice

Resilient LLM services with Tenacity and fallback models

How to survive LLM provider outages with Tenacity retries and fallback models. The retry policy, the fallback chain, and the 60-line pattern.

Context window management for production AI agents

How to manage context windows in production AI agents. The 4 strategies that keep long sessions bounded without losing critical context.

LLM Engineering

Chain-of-thought reasoning in RAG: a practical guide

How to add chain-of-thought reasoning to a RAG pipeline. The prompt, the parsing, and the cases where CoT beats a straight answer by a wide margin.

Weekly Bytes of AI

Technical deep-dives for engineers building production AI systems.

Architecture patterns, system design, cost optimization, and real-world case studies. No fluff, just engineering insights.

Unsubscribe anytime. We respect your inbox.

Join the engineers shipping production AI

Live sessions, code reviews, and a community of engineers building real systems on Skool.

Join the community

Learn

Find your path
Free Resources
Blog
Workshops
Webinars
Ebooks
All Courses
All Programs
RSS Feed

Courses by Topic

RAG Courses
AI Agent Courses
Multi-Agent Systems
LangGraph Courses
MCP Courses
Voice AI Courses
LLM Observability
Streaming LLM Apps
Prompt Engineering

By Role

AI Engineer
Backend Engineer
Frontend + AI
Data Engineer
Python
FastAPI

By Level

Beginner
Intermediate
Advanced

Programs

AI Engineer Accelerator
Backend Engineer Accelerator
AI Bootcamp
Backend Engineer Bootcamp
Fullstack Bootcamp
Production AI Masterclass
Fullstack AI Masterclass
MCP Masterclass
Voice AI Masterclass
All Programs

Featured Free

RAG Fundamentals
AI Agents Fundamentals
Agent Design Patterns
MCP Fundamentals
Generative AI Foundations
Prompt Engineering
Python for GenAI
FastAPI Fundamentals

Company

About
Pricing
FAQ
Convince Your Boss

Connect

YouTube
Skool Community

Build production AI systems with hands-on courses and live workshops.

© 2026 learnwithparam. All rights reserved.