47% OFFYearly Pro
$30/mo$16/mobilled yearlyGet Pro
Topic

Vector Databases

Explore our latest articles and insights about Vector Databases.

Explore posts

10 posts in total

LLM Engineering

Retriever k-value tuning for RAG: the right top-k

How to pick the right k value for your RAG retriever. The 3-step tuning process, the failure modes of k=3 and k=20, and the sweet spot in between.

RAGVector Databases+3
Read post
8 min
LLM Engineering

Combining vector stores in RAG: multi-source retrieval

How to combine multiple vector stores in one RAG pipeline. The merge pattern, the deduplication rule, and when multi-source beats a single index.

RAGVector Databases+3
Read post
8 min
LLM Engineering

FAISS vector stores in production RAG

How to use FAISS for production RAG. Index types, persistence, memory trade-offs, and the 4 settings that decide if FAISS beats a managed vector DB.

RAGVector Databases+3
Read post
8 min
LLM Engineering

Choosing an embedding model for RAG

How to pick an embedding model for production RAG. The 5 criteria that matter, the benchmarks that lie, and the migration cost nobody warns you about.

RAGEmbeddings+3
Read post
11 min
LLM Engineering

Hybrid retrieval in RAG: vector + graph search

How hybrid retrieval combines vector search and graph traversal in RAG. The when, the why, and the 60-line fusion that beats either alone.

RAGVector Databases+3
Read post
11 min
AI Engineering in Practice

The 'brain' of RAG: a guide to embeddings & vector databases

Understand how embeddings and vector databases work under the hood. Learn how computers translate text meaning into numbers and search millions of docum...

RAGVector Databases+1
Read post
10 min
AI Engineering

How to choose your vector database

Compare vector databases for RAG systems. Learn when to use Chroma, Qdrant, Weaviate, pgvector, Pinecone, and Vespa based on performance, scale, and dev...

RAGVector Databases+1
Read post
13 min
AI Engineering

Vector databases and embeddings: the brain of RAG

Learn how embeddings and vector databases power RAG systems. Understand semantic search, cosine similarity, metadata filtering, and choose between open-...

RAGVector Databases+1
Read post
8 min
AI Engineering

Splitting techniques for RAG: the art of the right chunk

Master document chunking for RAG systems. Learn fixed-size, recursive, semantic, and content-aware splitting techniques to improve retrieval quality and...

RAGVector Databases+1
Read post
7 min
AI Engineering

Retrieval-augmented generation (RAG): giving LLMs an open book

How retrieval works in production. Learn how RAG solves LLM limitations by connecting models to external documents. Master chunking, embeddings, vector dat

RAGVector Databases+1
Read post
9 min

Weekly Bytes of AI

Technical deep-dives for engineers building production AI systems.

Architecture patterns, system design, cost optimization, and real-world case studies. No fluff, just engineering insights.

Unsubscribe anytime. We respect your inbox.