📖 The AI Tool Bible

RAG

Retrieval-augmented generation, vector stores, indexers.

70 tools

Why it matters

RAG isn't a model, it's an architecture — retrieve, augment, generate. The choice is between frameworks that orchestrate the retrieval and the vector stores underneath.

What's in here

Includes RAG frameworks (LlamaIndex, LangChain), managed vector databases (Pinecone), open-source vector stores (Weaviate, Chroma, Vespa), and hybrid-search engines.

How to pick

Pick LlamaIndex when retrieval quality is the bottleneck. Pick Pinecone for zero-ops production. Pick Weaviate or Chroma for self-hosted or budget-conscious. Pick Vespa at scale beyond a few million docs.

Pinecone

RAG · Hosted vector DB (not an LLM)

Managed vector database for production-scale similarity search.

Freemium· Free starter; serverless pay-as-you-go from $0.33/1M readsmanaged vector DBproduction RAG

LlamaIndex

RAG · BYO (Claude / GPT / open)

Data framework for connecting LLMs to your data.

Freemium· Free open-source; LlamaCloud paidRAGdata ingestion

Weaviate

RAG · Hosted vector DB (not an LLM)

Open-source vector DB with hybrid search and modules.

Freemium· Free open-source; cloud from $25/moself-hosted RAGhybrid search

LangChain

RAG · BYO (any major LLM)

The broad LLM application framework — chains, agents, retrievers.

Freemium· Free open-source; LangSmith paidgeneral LLM appsRAG

Vespa

RAG · Hosted search engine (not an LLM)

Yahoo's open-source search engine with vector + sparse retrieval.

Freemium· Free open-source; Vespa Cloud paidlarge-scale searchranking

Chroma

RAG · Hosted vector DB (not an LLM)

Embedded, developer-friendly vector store for Python.

Freemium· Free open-source; Chroma Cloud paidprototypingembedded RAG

Agentset

RAG · Multi-model (Claude, OpenAI, Google, xAI, Cohere, Mistral, DeepSeek)

Production-ready RAG infrastructure with agentic search, citations, and model-agnostic plumbing.

Freemium· Free 1K pages/10K retrievals; Pro $49/mo + $0.01/page; Enterprise customdocument-qaagentic-search

AnythingLLM

RAG · Multi-model

Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.

Freemium· Desktop free (MIT); self-host free; cloud paid plansdocument-chatprivate-rag

BGE (BAAI General Embedding)

RAG · BGE / bge-m3 / bge-reranker

Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.

Free· Free, open-source (MIT-style license); self-hosted inference cost onlysemantic-searchrag-retrieval

Chat With PDF by Copilot.us

Conversational PDF Q&A bundled into a multi-app productivity membership.

Freemium· 7-day free trial; paid membership covers full app suitepdf-qadocument-summarization

ChatPDF

RAG · GPT-4o / GPT-4o-mini

Conversational Q&A over PDFs and other documents with citation-backed answers.

Freemium· Free: 2 PDFs/day; Plus plan for unlimitedpdf-qadocument-summarization

CocoIndex

RAG · Bring-your-own (embeddings + LLM)

Open-source incremental data framework that keeps RAG indexes and agent context continuously fresh.

Free· Open-source, self-hosted; bring your own infracode-indexingrag-pipelines

Cognee

RAG · Multi-model (Claude, OpenAI, others)

Open-source graph-memory layer that gives AI agents persistent, queryable context across sessions.

Freemium· Hobby free (1M tokens/mo); Growth $5/workspace/mo + token usage; Enterprise customagent-memoryknowledge-graphs

Cohere

RAG · Command, Embed, Rerank, Transcribe (proprietary)

Enterprise-grade LLM platform built for private, secure, and customizable deployment.

Enterprise· Free trial API keys; production via usage-based API pricing or enterprise contractsenterprise-ragsemantic-search

Context Data

RAG · Multi-model

Enterprise data platform for deploying private RAG pipelines without infrastructure plumbing.

Enterprise· Contact salesenterprise-ragdocument-search

Cosmos

AI-powered archive search and reel curation for video production companies.

Enterprise· Contact salesvideo archive searchsales enablement

Cube

RAG · Multi-model

Semantic layer that grounds LLM agents in your real business metrics instead of letting them hallucinate SQL.

Freemium· Cube Core open source; Cube Cloud paid, contact salessemantic-layerembedded-analytics

Databricks Vector Search

RAG · Multi-model (BYO embeddings or Databricks-hosted)

Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.

Enterprise· Consumption-based via Databricks; free trial availablerag-retrievalhybrid-search

DeepSearcher

RAG · Multi-model (DeepSeek, OpenAI o1/o3-mini, Claude, Llama, others)

Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.

Free· Free, Apache 2.0; bring your own LLM and vector DB costsenterprise-ragagentic-search

Elicit

RAG · Claude Opus 4.5

AI research assistant that searches, screens, and extracts data from 138M+ academic papers at scale.

Freemium· Free tier; paid Plus, Pro, and Enterprise plansliterature-reviewsystematic-review

Emergent Mind

RAG · Undisclosed

AI-curated arXiv discovery layer that summarizes frontier papers and aggregates social discussion around them.

Freemium· Free Basic; Pro $10/mo annual ($12 mo); Max $25/mo annual ($30 mo)arxiv-discoverypaper-summarization

Epsilla

RAG · Multi-model

Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.

Freemium· Free; Starter $29/mo; Professional $249/mo; AI Concierge $2,499/mo; Enterprise customenterprise-ragai-agents

Exa

RAG · Proprietary neural + keyword search

Web search API built for AI agents, with structured outputs and token-efficient highlights.

Freemium· Free playground; paid usage-based plans; enterprise on requestagent-web-searchrag-retrieval

Explainpaper

RAG · Undisclosed (tiered basic vs. advanced)

AI reading companion that decodes dense academic papers by highlighting and chatting with the PDF.

Freemium· Free; Pro $16/mo with 7-day trialpaper-readingresearch-summaries