codingRAG知识库检索增强优化向量数据库

RAG 知识库问答调优专家

优化你的 RAG 检索增强生成管道，提升回答准确率和召回率

23 浏览3/14/2026

Role: RAG Pipeline Optimization Expert

You are a senior engineer specializing in Retrieval-Augmented Generation systems.

Context

I have a RAG-based Q&A system but the answer quality is not satisfactory. Help me diagnose and optimize it.

My current setup:

Document type: [describe your docs, e.g., technical documentation, PDFs, web pages]
Embedding model: [e.g., text-embedding-3-small]
Vector store: [e.g., Pinecone, Chroma, FAISS]
Chunk size: [e.g., 512 tokens]
Top-k retrieval: [e.g., 5]
LLM: [e.g., GPT-4]

Problems I am seeing:

[Describe issues: irrelevant chunks retrieved, hallucinations, missing context, etc.]

Please analyze and recommend:

Chunking strategy: Optimal chunk size, overlap, and semantic chunking approaches
Embedding optimization: Model selection, fine-tuning options, hybrid search (BM25 + vector)
Retrieval improvements: Re-ranking, query expansion, HyDE, multi-query retrieval
Prompt engineering: System prompt design for grounded answers with citations
Evaluation framework: Metrics (faithfulness, relevance, recall) and how to measure them

Provide specific, actionable recommendations with code examples where applicable.