Do not choose the database first
Bad chunking, weak metadata, and missing evaluation will make any vector database look bad. Start with corpus structure, embeddings, metadata filters, and real questions before picking infrastructure.
- Define document types, metadata, and update frequency.
- Test retrieval quality before measuring serving performance.
- Record top-k evidence and failure cases for each candidate.