Do I need a GPU for the embeddings?

No. The workshop uses sentence-transformers/all-MiniLM-L6-v2, which runs well on CPU. You only need an OpenRouter or OpenAI key for the chat model.

Why Docling instead of PyPDF or pdfplumber?

Docling preserves layout, runs OCR on scanned pages, and extracts table structure. For long books and technical PDFs, that metadata is the difference between clean chunks and garbled text.

Is FAISS the right vector store for this?

For a single-book tutor that runs locally, FAISS is ideal. It is fast, has zero infrastructure, and persists to disk. For multi-tenant production systems, move to Qdrant or pgvector afterward.

Does this teach the full LangChain framework?

No. The workshop focuses on ConversationalRetrievalChain and the retrieval prompt. You will walk away with a clear mental model of how memory, retrieval, and grounding fit together.

47% OFFYearly Pro

$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro

$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro$30/mo$16/mobilled yearlyGet Pro

Premium course

Turn any PDF book into an AI tutor that remembers the conversation

Name: Long document RAG with conversational memory
Price: 24 USD
Availability: InStock

Stop shipping RAG demos that forget the previous turn. Build a conversational tutor over a full book with chapter-aware chunking, local embeddings, a persistent FAISS index, and memory that actually follows the thread.

Enroll Preview curriculum

Still deciding? Ask first.

Message a mentor about fit, prerequisites, or where to start. Replies come on WhatsApp, usually within a day.

Curriculum fit, prerequisites, or where to start
Honest answer, no pressure to enroll

Engineers are learning here from

NVIDIAMICROSOFTGRABWISEPIPEDRIVEBOLTGLIA

Turn any PDF book into an AI tutor with chapter-aware chunking, semantic retrieval, and multi-turn conversational memory. Build a document QA system that handles follow-up questions without losing context.

Build a conversational AI tutor over long PDFs with chapter-aware chunking and memory-backed retrieval.

What you'll ship

Real projects, not toy demos.

Parse long PDFs with Docling, including OCR and table structure detection
Chunk prose with a recursive splitter tuned for chapter boundaries
Generate free local embeddings with HuggingFace sentence transformers
Persist and reload a FAISS vector store for instant restarts
Wire a ConversationalRetrievalChain that follows up across turns
Return grounded answers with source chunks the user can verify

What you'll learn

You finish able to:

Explain why conversational RAG needs history-aware query rewriting
Parse long PDFs with Docling while preserving chapter and table structure
Tune a recursive text splitter for long-form prose
Build and persist a FAISS index using free local embeddings
Wire a ConversationalRetrievalChain that handles follow-up questions
Return source documents so learners can verify every answer

Curriculum

From raw PDF to a tutor that remembers the conversation.

01
Long-doc RAG vs chat RAG
Understand why adding chat history to retrieval changes the problem, and see the shape of a conversational retrieval chain
3 lessons
02
Docling loader
Parse long PDFs with OCR and table structure, convert to markdown with chapter headings, and chunk for retrieval
3 lessons
03
Local embeddings and FAISS
Embed chunks locally with HuggingFace sentence-transformers, build a FAISS index, persist it to disk, and reload fast
3 lessons
04
ConversationalRetrievalChain
Wire the chain that rewrites follow-ups, retrieves grounded chunks, and refuses to hallucinate
3 lessons
05
Interactive tutor
Wrap the chain in a REPL, handle follow-ups across turns, and finish the workshop with a final checkpoint
3 lessons

Who it's for

Is this for you?

Backend engineers

who shipped a basic RAG demo and watched it break on the second question in a conversation

AI builders

working with long technical documents, textbooks, or manuals that need chapter context to answer well

Self-learners

who want a local PDF tutor that actually remembers what was asked earlier in the session

FAQ

Common questions.

Do I need a GPU for the embeddings?
No. The workshop uses sentence-transformers/all-MiniLM-L6-v2, which runs well on CPU. You only need an OpenRouter or OpenAI key for the chat model.
Why Docling instead of PyPDF or pdfplumber?
Docling preserves layout, runs OCR on scanned pages, and extracts table structure. For long books and technical PDFs, that metadata is the difference between clean chunks and garbled text.
Is FAISS the right vector store for this?
For a single-book tutor that runs locally, FAISS is ideal. It is fast, has zero infrastructure, and persists to disk. For multi-tenant production systems, move to Qdrant or pgvector afterward.
Does this teach the full LangChain framework?
No. The workshop focuses on ConversationalRetrievalChain and the retrieval prompt. You will walk away with a clear mental model of how memory, retrieval, and grounding fit together.

Pricing

Unlock this course with Pro.

One subscription unlocks every paid course and workshop replay. Pick yearly or monthly.

Unlock with Pro

$30$16/mo

You save 47% with regional pricing

Billed annually. Cancel anytime.

This course plus every paid course
Workshop replays in your library
New releases the day they ship

Still deciding?

After this course:

Conversational retrieval is the baseline for every serious tutor. Start here.

Enroll

Long document RAG with conversational memory

From $16/mo with Pro

47% OFFYearly Pro

$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro

$30/mo$16/mobilled yearlyGet Pro

47% OFFYearly Pro$30/mo$16/mobilled yearlyGet Pro

Premium course

Turn any PDF book into an AI tutor that remembers the conversation

Enroll Preview curriculum

Still deciding? Ask first.

Message a mentor about fit, prerequisites, or where to start. Replies come on WhatsApp, usually within a day.

Curriculum fit, prerequisites, or where to start
Honest answer, no pressure to enroll

Engineers are learning here from

NVIDIAMICROSOFTGRABWISEPIPEDRIVEBOLTGLIA

Build a conversational AI tutor over long PDFs with chapter-aware chunking and memory-backed retrieval.

What you'll ship

Real projects, not toy demos.

Parse long PDFs with Docling, including OCR and table structure detection
Chunk prose with a recursive splitter tuned for chapter boundaries
Generate free local embeddings with HuggingFace sentence transformers
Persist and reload a FAISS vector store for instant restarts
Wire a ConversationalRetrievalChain that follows up across turns
Return grounded answers with source chunks the user can verify

What you'll learn

You finish able to:

Explain why conversational RAG needs history-aware query rewriting
Parse long PDFs with Docling while preserving chapter and table structure
Tune a recursive text splitter for long-form prose
Build and persist a FAISS index using free local embeddings
Wire a ConversationalRetrievalChain that handles follow-up questions
Return source documents so learners can verify every answer

Curriculum

From raw PDF to a tutor that remembers the conversation.

01
Long-doc RAG vs chat RAG
Understand why adding chat history to retrieval changes the problem, and see the shape of a conversational retrieval chain
3 lessons
02
Docling loader
Parse long PDFs with OCR and table structure, convert to markdown with chapter headings, and chunk for retrieval
3 lessons
03
Local embeddings and FAISS
Embed chunks locally with HuggingFace sentence-transformers, build a FAISS index, persist it to disk, and reload fast
3 lessons
04
ConversationalRetrievalChain
Wire the chain that rewrites follow-ups, retrieves grounded chunks, and refuses to hallucinate
3 lessons
05
Interactive tutor
Wrap the chain in a REPL, handle follow-ups across turns, and finish the workshop with a final checkpoint
3 lessons

Who it's for

Is this for you?

Backend engineers

who shipped a basic RAG demo and watched it break on the second question in a conversation

AI builders

working with long technical documents, textbooks, or manuals that need chapter context to answer well

Self-learners

who want a local PDF tutor that actually remembers what was asked earlier in the session

FAQ

Common questions.

Do I need a GPU for the embeddings?
No. The workshop uses sentence-transformers/all-MiniLM-L6-v2, which runs well on CPU. You only need an OpenRouter or OpenAI key for the chat model.
Why Docling instead of PyPDF or pdfplumber?
Docling preserves layout, runs OCR on scanned pages, and extracts table structure. For long books and technical PDFs, that metadata is the difference between clean chunks and garbled text.
Is FAISS the right vector store for this?
For a single-book tutor that runs locally, FAISS is ideal. It is fast, has zero infrastructure, and persists to disk. For multi-tenant production systems, move to Qdrant or pgvector afterward.
Does this teach the full LangChain framework?
No. The workshop focuses on ConversationalRetrievalChain and the retrieval prompt. You will walk away with a clear mental model of how memory, retrieval, and grounding fit together.

Pricing

Unlock this course with Pro.

One subscription unlocks every paid course and workshop replay. Pick yearly or monthly.

Unlock with Pro

$30$16/mo

You save 47% with regional pricing

Billed annually. Cancel anytime.

This course plus every paid course
Workshop replays in your library
New releases the day they ship

Still deciding?

After this course:

Conversational retrieval is the baseline for every serious tutor. Start here.

Enroll

Long document RAG with conversational memory

From $16/mo with Pro