📖 The AI Tool Bible

Best AI tools for rag frameworks

42 tools in the RAG category, filtered to rag frameworks.

All RAG

Pinecone

Featured
RAG · Hosted vector DB (not an LLM)
8.8

Managed vector database for production-scale similarity search.

Freemium· Free starter; serverless pay-as-you-go from $0.33/1M readsmanaged vector DBproduction RAG

LlamaIndex

Featured
RAG · BYO (Claude / GPT / open)
8.7

Data framework for connecting LLMs to your data.

Freemium· Free open-source; LlamaCloud paidRAGdata ingestion

Weaviate

RAG · Hosted vector DB (not an LLM)
8.4

Open-source vector DB with hybrid search and modules.

Freemium· Free open-source; cloud from $25/moself-hosted RAGhybrid search

LangChain

RAG · BYO (any major LLM)
8.3

The broad LLM application framework — chains, agents, retrievers.

Freemium· Free open-source; LangSmith paidgeneral LLM appsRAG

Chroma

RAG · Hosted vector DB (not an LLM)
8.1

Embedded, developer-friendly vector store for Python.

Freemium· Free open-source; Chroma Cloud paidprototypingembedded RAG

Agentset

RAG · Multi-model (Claude, OpenAI, Google, xAI, Cohere, Mistral, DeepSeek)

Production-ready RAG infrastructure with agentic search, citations, and model-agnostic plumbing.

Freemium· Free 1K pages/10K retrievals; Pro $49/mo + $0.01/page; Enterprise customdocument-qaagentic-search

AnythingLLM

RAG · Multi-model

Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.

Freemium· Desktop free (MIT); self-host free; cloud paid plansdocument-chatprivate-rag

BGE (BAAI General Embedding)

RAG · BGE / bge-m3 / bge-reranker

Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.

Free· Free, open-source (MIT-style license); self-hosted inference cost onlysemantic-searchrag-retrieval

CocoIndex

RAG · Bring-your-own (embeddings + LLM)

Open-source incremental data framework that keeps RAG indexes and agent context continuously fresh.

Free· Open-source, self-hosted; bring your own infracode-indexingrag-pipelines

Cognee

RAG · Multi-model (Claude, OpenAI, others)

Open-source graph-memory layer that gives AI agents persistent, queryable context across sessions.

Freemium· Hobby free (1M tokens/mo); Growth $5/workspace/mo + token usage; Enterprise customagent-memoryknowledge-graphs

Cohere

RAG · Command, Embed, Rerank, Transcribe (proprietary)

Enterprise-grade LLM platform built for private, secure, and customizable deployment.

Enterprise· Free trial API keys; production via usage-based API pricing or enterprise contractsenterprise-ragsemantic-search

Context Data

RAG · Multi-model

Enterprise data platform for deploying private RAG pipelines without infrastructure plumbing.

Enterprise· Contact salesenterprise-ragdocument-search

Databricks Vector Search

RAG · Multi-model (BYO embeddings or Databricks-hosted)

Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.

Enterprise· Consumption-based via Databricks; free trial availablerag-retrievalhybrid-search

DeepSearcher

RAG · Multi-model (DeepSeek, OpenAI o1/o3-mini, Claude, Llama, others)

Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.

Free· Free, Apache 2.0; bring your own LLM and vector DB costsenterprise-ragagentic-search

Epsilla

RAG · Multi-model

Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.

Freemium· Free; Starter $29/mo; Professional $249/mo; AI Concierge $2,499/mo; Enterprise customenterprise-ragai-agents

Exa

RAG · Proprietary neural + keyword search

Web search API built for AI agents, with structured outputs and token-efficient highlights.

Freemium· Free playground; paid usage-based plans; enterprise on requestagent-web-searchrag-retrieval

Feast

RAG

Open-source feature store that serves consistent features to ML training and online inference, with RAG vector search built in.

Free· Free, open source (Apache 2.0); self-hostedfeature-storerag-retrieval

Firecrawl

RAG

Web scraping and crawling API that returns LLM-ready markdown, JSON, or structured data from any URL.

Freemium· Free 1,000 credits/mo; paid Hobby/Standard/Growth tiers; Scale/Enterprise annualweb-scrapingrag-ingestion

GaliChat

RAG

No-code AI chatbot builder that trains on your website content for support and lead capture.

Freemium· Free tier (no credit card); paid plans for advanced featurescustomer-supportlead-generation

Haystack

RAG · Multi-model

Open-source Python framework from deepset for building production RAG pipelines and LLM agents.

Freemium· Open-source free; deepset Enterprise Support and AI Platform via salesragagents

HelixDB

RAG

Unified graph-and-vector database built for AI agent memory and GraphRAG.

Freemium· Open-source core; managed cloud pricing on requestagent-memorygraphrag

Kotaemon

RAG · Multi-model (OpenAI, LlamaCPP, any OpenAI-compatible endpoint)

Open-source RAG UI for chatting with your own documents, locally or self-hosted.

Free· Free, open-source (MIT-style); self-hosted infrastructure costs onlydocument-qaprivate-rag

LanceDB

RAG

Open-source multimodal lakehouse and vector database built for AI training and retrieval at petabyte scale.

Freemium· Open-source free; LanceDB Cloud and Enterprise via contact salesvector-searchrag

Langchain-Chatchat

RAG · Multi-model (GLM-4, Qwen2, Llama 3, etc. via Xinference/Ollama/LocalAI/FastChat)

Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.

Free· Apache-2.0 open source; self-hosted, infra costs onlyprivate-knowledge-baseoffline-rag

MaxKB

RAG · Multi-model

Open-source enterprise RAG and agent platform with built-in workflow engine and multi-LLM support.

Freemium· Community edition free (GPLv3); paid enterprise editionenterprise-knowledge-basecustomer-support-bots

OpenDataLoader PDF

RAG

Open-source PDF parser built for RAG pipelines, with reading-order detection, table extraction, and bounding-box citations.

Freemium· Free (Apache 2.0); enterprise tier for PDF/UA export and visual editorpdf-parsingrag-preprocessing

PageIndex

RAG

Vectorless reasoning-based retrieval for long documents, with traceable, auditable answers.

Freemium· Free Try Now tier; enterprise pricing on requestdocument-qalong-pdf-retrieval

Pathway

RAG · Multi-model

Live data framework for production RAG and streaming ETL pipelines in Python.

Freemium· Community free (BSL 1.1, 8GB/4 cores); Scale and Enterprise tiers with license keylive-ragstreaming-etl

Perplexity AI

RAG · Multi-model (Sonar, GPT-4 class, Claude, Gemini)

Conversational answer engine that cites its sources by default.

Freemium· Free tier; Pro $20/mo or $200/yr; Enterprise from $40/user/mo; Sonar API usage-basedai-searchresearch

PostgresML

RAG · Multi-model (Llama, Mistral, open-source embeddings)

PostgreSQL extension that runs embeddings, vector search, and LLM inference inside your database.

Freemium· Open-source self-host free; managed cloud usage-based with $100 free creditsvector-searchrag

PrivateGPT

RAG · Multi-model (BYO local LLM)

Production-ready, air-gapped RAG framework for querying your documents with local LLMs.

Freemium· OSS free; Zylon enterprise contract (contact sales)private-ragchat-with-documents

Quivr

RAG · Multi-model (OpenAI, Anthropic, Mistral, Gemma)

Open-source RAG framework for building custom AI assistants over your own documents in a few lines of Python.

Free· Open source (pip install quivr-core); pay only for LLM/vector-store usagedocument-qacustom-knowledge-base

RAGFlow

RAG · Multi-model

Open-source RAG engine with deep document parsing, hybrid search, and visual agent orchestration.

Freemium· Free tier; Starter $29/mo; Pro $129/mo; Enterprise customdocument-qaenterprise-search

RAGs by LlamaIndex

RAG · Multi-model (OpenAI, Anthropic, Replicate, HuggingFace)

Open-source Streamlit app that builds a custom RAG pipeline from a natural-language brief.

Free· Free, MIT-licensed; bring your own model/API keysnatural-language-rag-builderdocument-qa

Rivestack

RAG · OpenAI embeddings (auto-embeddings)

Managed Postgres with pgvector on dedicated NVMe, pitched as a cheaper RAG backend than Pinecone or Supabase.

Freemium· Free shared tier; Solo $15/mo, Starter $35, Growth $59, Scale $99 (EU Central)rag-backendvector-search

Singlebase Cloud

RAG · Multi-model

AI-native Firebase alternative bundling document DB, vector DB, auth, storage, and built-in AI services.

Freemium· Free tier available; paid plans scale with usagevector-searchrag-apps

Superduper

RAG · Multi-model

Enterprise AI agent orchestration that brings RAG and agents to your existing data stack without migration.

Enterprise· Free trial on Snowflake Marketplace; enterprise self-hosted pricing on requestin-database-ragagent-orchestration

TurboVec

RAG

Rust-powered vector index with 2-4 bit TurboQuant compression for SIMD-accelerated RAG search.

Free· Free, MIT licensedvector-searchrag

UltraRAG

RAG · Multi-model (MiniCPM-Embedding-Light, AgentCPM-Report, BYO LLM)

Low-code, YAML-driven RAG pipeline orchestrator with a visual UI for building and demoing retrieval systems.

Free· Open source; self-hostedrag-pipelinesknowledge-base-qa

Vanna.ai

RAG · Multi-model (Anthropic, OpenAI, Gemini, Ollama)

Open-source text-to-SQL agent that learns your schema and writes queries against your real warehouse.

Freemium· Open-source free; paid cloud tier for hosted admin featurestext-to-sqlnatural-language-bi

You.com

RAG · Multi-model

Web search and research APIs purpose-built for LLMs and AI agents.

Freemium· Free trial; enterprise pricing on requestweb-search-apiagent-grounding

Yuxi

RAG · Multi-model

Open-source AI agent platform that fuses agentic RAG with knowledge graphs on a LangGraph runtime.

Free· Free, MIT-licensed self-hostagentic-ragknowledge-graphs