47% OFFYearly Pro
$30/mo$16/mobilled yearlyGet Pro
Topic

Data Processing

Explore our latest articles and insights about Data Processing.

Explore posts

13 posts in total

LLM Engineering

Query anonymization for RAG bias mitigation

How to strip names, roles, and demographics from queries before retrieval to reduce RAG bias. The redaction pipeline and the 3 leakage traps to avoid.

RAGGuardrails+3
Read post
9 min
LLM Engineering

Testing and evaluating RAG pipelines end to end

How to test a RAG pipeline like real software. Unit, integration, and eval tests that catch regressions before they ship. The 3-layer test strategy.

RAGEvaluation+3
Read post
8 min
LLM Engineering

Query rewriting in RAG with LLMs: the rewrite loop

How LLM-powered query rewriting fixes vague user questions before retrieval. The prompt, the multi-query fan-out, and when rewriting hurts more than helps.

RAGLLM+3
Read post
8 min
LLM Engineering

LLM-based content filtering for RAG pipelines

How to filter irrelevant retrieved chunks with a cheap LLM call before the final answer. The prompt, the batch pattern, and the 40 percent noise reduction.

RAGLLM+3
Read post
8 min
LLM Engineering

Retriever k-value tuning for RAG: the right top-k

How to pick the right k value for your RAG retriever. The 3-step tuning process, the failure modes of k=3 and k=20, and the sweet spot in between.

RAGVector Databases+3
Read post
8 min
LLM Engineering

Combining vector stores in RAG: multi-source retrieval

How to combine multiple vector stores in one RAG pipeline. The merge pattern, the deduplication rule, and when multi-source beats a single index.

RAGVector Databases+3
Read post
8 min
LLM Engineering

FAISS vector stores in production RAG

How to use FAISS for production RAG. Index types, persistence, memory trade-offs, and the 4 settings that decide if FAISS beats a managed vector DB.

RAGVector Databases+3
Read post
8 min
LLM Engineering

Automated evaluation pipelines for agentic AI systems

How to wire eval pipelines into CI so every agent change is scored automatically. The nightly job, the regression gate, and the dashboard that matters.

EvaluationAI Agents+3
Read post
8 min
LLM Engineering

Dynamic evaluation metric loading in Python

How to load evaluation metrics dynamically in a Python eval pipeline. The registry pattern, entry points, and the test override that makes CI fast.

EvaluationPython+3
Read post
8 min
LLM Engineering

LLM judges: enforcing reasoning with explicit rationales

Why LLM judges without explicit reasoning drift, and how chain-of-thought rationales make their scores defensible. The prompt, the parser, the trust.

EvaluationLLM+3
Read post
9 min
LLM Engineering

LLM-as-a-judge: production evaluation framework for agents

How to build an LLM-as-a-judge evaluation framework for agentic AI. The prompt, the rubric, the bias controls, and the loop that catches regressions.

EvaluationLLM+3
Read post
9 min
LLM Engineering

RecursiveCharacterTextSplitter: a deep dive for RAG

How RecursiveCharacterTextSplitter works, why it beats naive chunking, and the separator order that makes or breaks retrieval quality.

RAGLLM+3
Read post
11 min

Weekly Bytes of AI

Technical deep-dives for engineers building production AI systems.

Architecture patterns, system design, cost optimization, and real-world case studies. No fluff, just engineering insights.

Unsubscribe anytime. We respect your inbox.