📖 The AI Tool Bible

AI tools tagged Powered By Llama

48 tools matching this tag.model

All tags →

LlamaIndex

Featured
RAG · BYO (Claude / GPT / open)
8.7

Data framework for connecting LLMs to your data.

Freemium· Free open-source; LlamaCloud paidRAGdata ingestion

Together AI

Featured
Fine-tuning · Llama / Mistral / Qwen / DeepSeek and others
8.6

Fine-tune & serve open-weight models (Llama, Mistral, DeepSeek).

Paid· Pay-per-token; fine-tuning per-tokenopen modelsfine-tuning

Replicate

Fine-tuning · Thousands of community + first-party models
8.5

One-API platform for running and fine-tuning open-source models.

Paid· Pay-per-second of GPU timemodel hostingfine-tuning

LangChain

RAG · BYO (any major LLM)
8.3

The broad LLM application framework — chains, agents, retrievers.

Freemium· Free open-source; LangSmith paidgeneral LLM appsRAG

Continue

Coding · BYO (any OpenAI-compatible API + Ollama for local)
7.9

Open-source, self-hostable VS Code/JetBrains AI assistant.

Free· Free / open-source; you pay model costsself-hostedopen source

AI Dungeon

Writing · Multi-model (rotating open-weights: Mixtral, Llama, Hermes variants)

AI-powered text adventure platform where the story is generated turn-by-turn by an LLM.

Freemium· Free tier; paid tiers roughly $10-$30/mo (Adventurer / Champion / Mythic)interactive-fictionai-roleplay

Agenta

Evaluation · Multi-model

Open-source LLMOps platform for prompt engineering, evaluation, and observability in one workspace.

Freemium· Open-source self-host free; managed cloud has free tier plus paid plansprompt-engineeringllm-evaluation

AgenticSeek

Agents · Bring-your-own local LLM (Ollama / llama.cpp compatible)

Open-source local-first AI agent that browses the web, writes code, and runs tasks without sending anything to the cloud.

Free· Free and open source; optional SerpApi key for enhanced searchlocal-ai-agentautonomous-web-browsing

AnythingLLM

RAG · Multi-model

Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.

Freemium· Desktop free (MIT); self-host free; cloud paid plansdocument-chatprivate-rag

Arthur

Evaluation · Multi-model

Open-source toolkit for testing, tracing, and monitoring production AI agents.

Freemium· Open-source (MIT) + free SaaS tier; paid/enterprise plans on requestagent-evaluationprompt-management

AstrBot

Agents · Multi-model (OpenAI, Anthropic, Gemini, DeepSeek, Ollama, Dify, Coze)

Open-source agentic AI assistant that bridges chat platforms like Telegram, Discord, and QQ with any LLM and a 1000+ plugin ecosystem.

Free· Free, open-source (AGPL-3.0); self-hosted, you pay your own LLM API costs.chatbotgroup-chat-assistant

BGE (BAAI General Embedding)

RAG · BGE / bge-m3 / bge-reranker

Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.

Free· Free, open-source (MIT-style license); self-hosted inference cost onlysemantic-searchrag-retrieval

BentoML

Agents · Multi-model

Open-source framework and managed platform for serving and scaling AI models in production.

Freemium· OSS free (Apache 2.0); managed Bento cloud has free tier + usage-based pricingmodel-servingllm-inference

Browser Use

Agents · Multi-model (BYO LLM; Claude in hosted Box)

Open-source browser automation harness and cloud platform for LLM agents that drive real websites.

Freemium· Free open-source library; cloud usage-based with free tier and enterprise plansweb-automationscraping

Browser Use Web UI

Agents · Multi-model

Gradio web UI for running browser-use AI agents in a real or persistent Chrome session.

Free· Free, open-source; bring your own LLM API keysbrowser-automationweb-agents

Chainlit

Agents · Multi-model

Open-source Python framework for building production-grade conversational AI interfaces in minutes.

Free· Open-source (Apache 2.0); optional paid Literal AI observability tierchatbot-uiagent-frontend

Cherry Studio

Agents · Multi-model

Open-source desktop AI client that wires 300+ LLMs into one chat, knowledge-base, and agent workspace.

Free· Free and open source; bring your own API keysmulti-model chatlocal knowledge base

CompassRank

Evaluation · Multi-model

Public leaderboard from the OpenCompass project ranking open and closed LLMs across 100+ benchmarks.

Free· Free leaderboard; OpenCompass toolkit is Apache 2.0 open sourcellm-benchmarkingmodel-selection

Databricks Vector Search

RAG · Multi-model (BYO embeddings or Databricks-hosted)

Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.

Enterprise· Consumption-based via Databricks; free trial availablerag-retrievalhybrid-search

DeepSearcher

RAG · Multi-model (DeepSeek, OpenAI o1/o3-mini, Claude, Llama, others)

Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.

Free· Free, Apache 2.0; bring your own LLM and vector DB costsenterprise-ragagentic-search

ElizaOS

Agents · Multi-model (BYO via plugins)

Open-source agentic OS for building and orchestrating multi-agent swarms across Discord, Telegram, X, and onchain.

Free· Free, open-source (self-hosted); bring your own model API keysmulti-agent-swarmssocial-bots

Epsilla

RAG · Multi-model

Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.

Freemium· Free; Starter $29/mo; Professional $249/mo; AI Concierge $2,499/mo; Enterprise customenterprise-ragai-agents

GPTLocalhost

Writing · Bring-your-own (Ollama, LM Studio, llama.cpp, Foundry Local, etc.)

Run local LLMs directly inside Microsoft Word without sending text to the cloud.

Freemium· Free tier (512-char limit); paid monthly subscription and lifetime license availableprivate draftingoffline writing assistant

Groq

Coding · Multi-model (Llama, Mixtral, Gemma, Qwen, Whisper)

Custom-silicon LPU inference platform serving open models at GPU-trouncing latency via an OpenAI-compatible API.

Freemium· Free API key with rate limits; per-token paid tiers; enterprise contractslow-latency inferencevoice agents

Haystack

RAG · Multi-model

Open-source Python framework from deepset for building production RAG pipelines and LLM agents.

Freemium· Open-source free; deepset Enterprise Support and AI Platform via salesragagents

Hermes One

Agents · Multi-model (BYO via OpenRouter/OpenAI/Anthropic/Gemini/xAI/Ollama)

Open-source desktop AI agent with a self-improving learning loop and multi-platform messaging connectors.

Free· Free, MIT-licensed; you pay your own model inference costsautonomous-agentschat-ops

IntelliBar

Writing · Multi-model (GPT-4o, Claude 3.5, Gemini, o1, Llama, DeepSeek, Ollama)

Native macOS menu-bar client that talks to every major LLM with your own API keys.

Freemium· Free evaluation + one-time license; you pay model providers directly via your own API keysmulti-model-chatmenu-bar-assistant

Jan

Writing · Multi-model (local open-weights + OpenAI/Claude/Gemini via API)

Open-source desktop ChatGPT alternative that runs local LLMs and routes to cloud providers from one app.

Free· Free and open source; bring-your-own keys for cloud modelslocal-llm-chatprivate-ai-assistant

KNIME

Agents · Multi-model (OpenAI, Anthropic, Gemini, Ollama)

Visual node-based data science platform with built-in connectors for OpenAI, Anthropic, Gemini, and local LLMs.

Freemium· Free open-source desktop; Team and Business Hub plans paiddata-pipelinesllm-workflows

Katonic AI

Agents · Multi-model (2,600+ via AI Gateway)

Sovereign enterprise platform for building, deploying, and governing AI agents on your own infrastructure.

Enterprise· Contact sales for quote; no public pricingenterprise-agentson-prem-llm

Kiln AI

Evaluation · Multi-model

Open-source workbench for building, evaluating, and fine-tuning AI agents across 190+ models.

Freemium· Free Individual tier; Team (request access); Enterprise (custom)llm-evaluationfine-tuning

Kilo Code

Coding · Multi-model (500+ via BYO keys or routing)

Open-source agentic coding assistant for VS Code, JetBrains, and the terminal with bring-your-own-key routing across 500+ models.

Freemium· Free tier; Kilo Pass subscription; BYO-keys with zero markupai-pair-programmingcode-review

Kotaemon

RAG · Multi-model (OpenAI, LlamaCPP, any OpenAI-compatible endpoint)

Open-source RAG UI for chatting with your own documents, locally or self-hosted.

Free· Free, open-source (MIT-style); self-hosted infrastructure costs onlydocument-qaprivate-rag

LLM by Datasette

Coding · Multi-model

A CLI and Python library for running prompts against any LLM provider and logging everything to SQLite.

Free· Free and open source (Apache 2.0); pay underlying model providers separatelycli-promptingprompt-logging

LLaMA Factory

Fine-tuning · Multi-model (LLaMA, Mistral, Qwen, Gemma, Phi, LLaVA, ChatGLM, Yi)

Open-source, no-code WebUI for fine-tuning 100+ open LLMs with LoRA, QLoRA, DPO, and PPO.

Free· Free, open-source (Apache-2.0); self-hostedlora-fine-tuningqlora

LM Studio

Agents · Multi-model (gpt-oss, Qwen3, Gemma, DeepSeek-R1, Llama, others)

Desktop app for discovering, downloading, and running open-weight LLMs locally with an OpenAI-compatible server.

Freemium· Free for personal and commercial use; paid LM Studio for Work / Enterprise tierlocal-llm-inferenceprivate-chat

LMQL

Coding · Multi-model (OpenAI, Hugging Face Transformers, llama.cpp)

A query language for LLMs that bolts types, templates, and constraints onto prompting.

Free· Free and open source (Apache-style); self-host or use with your own model API keysconstrained-decodingstructured-output

LangExtract

RAG · Multi-model (Gemini, GPT-4/4o, Ollama-hosted local models)

Google's open-source Python library for LLM-driven structured extraction from unstructured text, with source-grounded outputs.

Free· Library is free (Apache-2.0); LLM API costs depend on chosen backendstructured-extractiondocument-parsing

Langchain-Chatchat

RAG · Multi-model (GLM-4, Qwen2, Llama 3, etc. via Xinference/Ollama/LocalAI/FastChat)

Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.

Free· Apache-2.0 open source; self-hosted, infra costs onlyprivate-knowledge-baseoffline-rag

Langflow

Agents · Multi-model

Open-source visual builder for LangChain-style AI agents and RAG pipelines.

Freemium· Open-source free; hosted free tier + paid enterprise via DataStaxagent-prototypingrag-pipelines

Langfuse

Evaluation · Model-agnostic

Open-source LLM observability, prompt management, and evaluation in one platform.

Freemium· Free self-host & Hobby tier; Core $29/mo, Pro $199/mo, Enterprise $2,499/mollm-observabilityprompt-management

LibreChat

Writing · Multi-model (OpenAI, Anthropic, Google, AWS Bedrock, Azure, Ollama, and others)

Open-source, self-hostable ChatGPT-style frontend that brings every major LLM provider under one roof.

Free· Free and open source; self-hosted (you pay model providers for API usage)multi-model chatself-hosted chatgpt

LiveBench

Evaluation · Multi-model

Contamination-free LLM benchmark that refreshes its questions monthly to keep frontier models honest.

Free· Free and open source; self-hosted evaluation runnerllm-benchmarkingmodel-selection

Llama

Fine-tuning · Llama 4 (Maverick, Scout), Llama 3.3/3.2/3.1

Meta's open-weight LLM family covering 1B mobile models up to 405B frontier and natively multimodal 10M-context Llama 4 variants.

Freemium· Weights free under Llama Community License; partner API inference ~$0.19-$0.49 per 1M tokensself-hosted-llmfine-tuning

Llama 3

Writing · Llama 3 / 3.1 (8B, 70B, 405B)

Meta's open-weights LLM family that put serious frontier-adjacent models in everyone's hands.

Free· Weights free under Meta Llama Community License; inference cost via self-hosting or 3rd-party providerschatlong-context reasoning

LocalAI

Writing · Multi-model (llama.cpp, diffusers, whisper, etc.)

Self-hosted OpenAI-compatible API for running LLMs, image, and audio models on your own hardware.

Free· Free and open source (MIT)local-llm-inferenceopenai-api-replacement

MCP Toolbox for Databases

Agents · Multi-model

Open-source MCP server that wires AI agents and IDEs straight into 50+ production databases with auth, pooling, and observability baked in.

Free· Free and open source (Apache 2.0); self-hostedagent-database-accessmcp-server

Manifest

Agents · Multi-model

Open-source LLM router that fans your agent traffic across providers and your existing AI subscriptions.

Freemium· Open-source self-host is free; managed cloud in early accessllm-routingcost-control