RAG
Retrieval-augmented generation, vector stores, indexers.
70 tools
RAG isn't a model, it's an architecture — retrieve, augment, generate. The choice is between frameworks that orchestrate the retrieval and the vector stores underneath.
Includes RAG frameworks (LlamaIndex, LangChain), managed vector databases (Pinecone), open-source vector stores (Weaviate, Chroma, Vespa), and hybrid-search engines.
Pick LlamaIndex when retrieval quality is the bottleneck. Pick Pinecone for zero-ops production. Pick Weaviate or Chroma for self-hosted or budget-conscious. Pick Vespa at scale beyond a few million docs.
Pinecone
FeaturedManaged vector database for production-scale similarity search.
LlamaIndex
FeaturedData framework for connecting LLMs to your data.
Weaviate
Open-source vector DB with hybrid search and modules.
LangChain
The broad LLM application framework — chains, agents, retrievers.
Vespa
Yahoo's open-source search engine with vector + sparse retrieval.
Chroma
Embedded, developer-friendly vector store for Python.
Agentset
Production-ready RAG infrastructure with agentic search, citations, and model-agnostic plumbing.
AnythingLLM
Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.
BGE (BAAI General Embedding)
Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.
Chat With PDF by Copilot.us
Conversational PDF Q&A bundled into a multi-app productivity membership.
ChatPDF
Conversational Q&A over PDFs and other documents with citation-backed answers.
CocoIndex
Open-source incremental data framework that keeps RAG indexes and agent context continuously fresh.
Cognee
Open-source graph-memory layer that gives AI agents persistent, queryable context across sessions.
Cohere
Enterprise-grade LLM platform built for private, secure, and customizable deployment.
Context Data
Enterprise data platform for deploying private RAG pipelines without infrastructure plumbing.
Cosmos
AI-powered archive search and reel curation for video production companies.
Cube
Semantic layer that grounds LLM agents in your real business metrics instead of letting them hallucinate SQL.
Databricks Vector Search
Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.
DeepSearcher
Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.
Elicit
AI research assistant that searches, screens, and extracts data from 138M+ academic papers at scale.
Emergent Mind
AI-curated arXiv discovery layer that summarizes frontier papers and aggregates social discussion around them.
Epsilla
Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.
Exa
Web search API built for AI agents, with structured outputs and token-efficient highlights.
Explainpaper
AI reading companion that decodes dense academic papers by highlighting and chatting with the PDF.