📖 The AI Tool Bible

The RAG stack that actually scales

Twelve pieces — embedding, vector store, retrieval, eval, framework — that hold up past a prototype.

The RAG hello-world looks easy. The production RAG stack is a different beast: you need an embedding model, a vector store, a retrieval framework, a reranker, and an eval harness that catches regressions before users do. These twelve tools cover every layer. Pick one from each — don't pick two from the same.

13 tools in this collection

Pinecone

Featured
RAG · Hosted vector DB (not an LLM)
8.8

Managed vector database for production-scale similarity search.

Freemium· Free starter; serverless pay-as-you-go from $0.33/1M readsmanaged vector DBproduction RAG

Weaviate

RAG · Hosted vector DB (not an LLM)
8.4

Open-source vector DB with hybrid search and modules.

Freemium· Free open-source; cloud from $25/moself-hosted RAGhybrid search

Chroma

RAG · Hosted vector DB (not an LLM)
8.1

Embedded, developer-friendly vector store for Python.

Freemium· Free open-source; Chroma Cloud paidprototypingembedded RAG

Vespa

RAG · Hosted search engine (not an LLM)
8.2

Yahoo's open-source search engine with vector + sparse retrieval.

Freemium· Free open-source; Vespa Cloud paidlarge-scale searchranking

LlamaIndex

Featured
RAG · BYO (Claude / GPT / open)
8.7

Data framework for connecting LLMs to your data.

Freemium· Free open-source; LlamaCloud paidRAGdata ingestion

LangChain

RAG · BYO (any major LLM)
8.3

The broad LLM application framework — chains, agents, retrievers.

Freemium· Free open-source; LangSmith paidgeneral LLM appsRAG

Feast

RAG

Open-source feature store that serves consistent features to ML training and online inference, with RAG vector search built in.

Free· Free, open source (Apache 2.0); self-hostedfeature-storerag-retrieval

RAGFlow

RAG · Multi-model

Open-source RAG engine with deep document parsing, hybrid search, and visual agent orchestration.

Freemium· Free tier; Starter $29/mo; Pro $129/mo; Enterprise customdocument-qaenterprise-search

Humata.ai

RAG · Multi-model

Chat-with-your-documents RAG tool with citation-backed answers across uploaded PDFs and files.

Freemium· Free (60 pages); Expert $9.99/mo; Team $49/user/mo; Enterprise on requestdocument-qaresearch-summarization

Exa

RAG · Proprietary neural + keyword search

Web search API built for AI agents, with structured outputs and token-efficient highlights.

Freemium· Free playground; paid usage-based plans; enterprise on requestagent-web-searchrag-retrieval

NotebookLM

RAG · Gemini 2.5

Google's source-grounded research notebook that turns your documents into chats, briefs, and AI-hosted podcasts.

Freemium· Free tier; Plus via Google One AI Premium ($19.99/mo) or Workspace add-ondocument Q&Aresearch synthesis

Scite

RAG · Multi-model

AI research assistant that grades citations as supporting, contrasting, or mentioning across 1.6B citation statements.

Freemium· Free tier; Personal ~$20/user/mo ($12 annual); Organization customliterature-reviewcitation-analysis

Cohere

RAG · Command, Embed, Rerank, Transcribe (proprietary)

Enterprise-grade LLM platform built for private, secure, and customizable deployment.

Enterprise· Free trial API keys; production via usage-based API pricing or enterprise contractsenterprise-ragsemantic-search