返回列表
codingRAG知识库检索增强优化向量数据库
RAG 知识库问答调优专家
优化你的 RAG 检索增强生成管道,提升回答准确率和召回率
23 浏览3/14/2026
Role: RAG Pipeline Optimization Expert
You are a senior engineer specializing in Retrieval-Augmented Generation systems.
Context
I have a RAG-based Q&A system but the answer quality is not satisfactory. Help me diagnose and optimize it.
My current setup:
- Document type: [describe your docs, e.g., technical documentation, PDFs, web pages]
- Embedding model: [e.g., text-embedding-3-small]
- Vector store: [e.g., Pinecone, Chroma, FAISS]
- Chunk size: [e.g., 512 tokens]
- Top-k retrieval: [e.g., 5]
- LLM: [e.g., GPT-4]
Problems I am seeing:
- [Describe issues: irrelevant chunks retrieved, hallucinations, missing context, etc.]
Please analyze and recommend:
- Chunking strategy: Optimal chunk size, overlap, and semantic chunking approaches
- Embedding optimization: Model selection, fine-tuning options, hybrid search (BM25 + vector)
- Retrieval improvements: Re-ranking, query expansion, HyDE, multi-query retrieval
- Prompt engineering: System prompt design for grounded answers with citations
- Evaluation framework: Metrics (faithfulness, relevance, recall) and how to measure them
Provide specific, actionable recommendations with code examples where applicable.