📖 The AI Tool Bible

Best AI tools for hosted fine tuning

24 tools in the Fine-tuning category, filtered to hosted fine tuning.

All Fine-tuning

Together AI

Featured
Fine-tuning · Llama / Mistral / Qwen / DeepSeek and others
8.6

Fine-tune & serve open-weight models (Llama, Mistral, DeepSeek).

Paid· Pay-per-token; fine-tuning per-tokenopen modelsfine-tuning

Modal

Fine-tuning · Infrastructure (any model you can host)
8.7

Serverless GPUs and infra for training & serving ML.

Freemium· $30/mo free credits; pay-as-you-go GPU ratesserverless GPUfine-tuning

Replicate

Fine-tuning · Thousands of community + first-party models
8.5

One-API platform for running and fine-tuning open-source models.

Paid· Pay-per-second of GPU timemodel hostingfine-tuning

Lamini

Fine-tuning · Lamini (built on open base models)
7.7

Memory-tuning platform for grounding LLMs in your facts.

Paid· Enterprise / contact salesenterprise FTfactual recall

CoreWeave

Fine-tuning

AI-native GPU cloud built for large-scale training, fine-tuning, and inference on NVIDIA hardware.

Enterprise· Contact sales; Capacity Plans with reserved GPU commitmentsmodel-trainingfine-tuning

FedML

Fine-tuning · Bring-your-own (PyTorch, Hugging Face)

Distributed training, fine-tuning, and serving platform with federated learning roots.

Freemium· Open-source library free; managed GPU usage pay-as-you-gofine-tuningdistributed-training

Fireworks AI

Fine-tuning · Multi-model (DeepSeek, Qwen, GLM, Kimi, Gemma, Minimax, others)

Production inference and fine-tuning platform for open-source LLMs, tuned for speed and enterprise economics.

Freemium· Free signup credits; pay-per-token from ~$0.14/M in; enterprise reserved capacity on requestllm-fine-tuningserverless-inference

Forefront

Fine-tuning · Multi-model (Mistral-7B, Mixtral, Phi-2)

Fine-tune and serve open-source LLMs on your own data without managing GPUs.

Paid· Usage-based per token (e.g. Phi-2 $0.0006/1k, Mixtral $0.004/1k)fine-tuningopen-source-llms

H2O AutoML

Fine-tuning · H2O-3 (GBM, XGBoost, GLM, DRF, Deep Learning, Stacked Ensembles)

Open-source automated machine learning that handles feature engineering, model selection, and stacked ensembling out of the box.

Free· Free and open-source (Apache 2.0); paid Driverless AI sold separatelyautomltabular-ml

Hugging Face AutoTrain

Fine-tuning · Multi-model (Hugging Face Hub)

No-code fine-tuning and training pipeline that spins up state-of-the-art models on the Hugging Face Hub.

Paid· Per-minute billing based on hardware tier; self-hosted OSS version is freellm-fine-tuningtext-classification

LLaMA Factory

Fine-tuning · Multi-model (LLaMA, Mistral, Qwen, Gemma, Phi, LLaVA, ChatGLM, Yi)

Open-source, no-code WebUI for fine-tuning 100+ open LLMs with LoRA, QLoRA, DPO, and PPO.

Free· Free, open-source (Apache-2.0); self-hostedlora-fine-tuningqlora

Lambda

Fine-tuning

On-demand NVIDIA GPU cloud built specifically for training, fine-tuning, and serving large AI models.

Paid· Pay-by-the-minute: A100 from $1.29/hr, H100 SXM from $3.99/hr, B200 from $6.69/hr; clusters priced by quotellm-trainingfine-tuning

Llama

Fine-tuning · Llama 4 (Maverick, Scout), Llama 3.3/3.2/3.1

Meta's open-weight LLM family covering 1B mobile models up to 405B frontier and natively multimodal 10M-context Llama 4 variants.

Freemium· Weights free under Llama Community License; partner API inference ~$0.19-$0.49 per 1M tokensself-hosted-llmfine-tuning

Ludwig

Fine-tuning · Multi-model (PyTorch + HuggingFace Transformers)

Declarative, YAML-driven deep learning framework for fine-tuning LLMs and multi-modal models without writing training loops.

Free· Free, Apache 2.0 open sourcellm-fine-tuningmulti-modal-training

OpenPipe

Fine-tuning · Llama, Mistral, Qwen and other open-weight base models

Fine-tuning and reinforcement learning platform for turning expensive prompts into cheap, fast, task-specific models.

Freemium· Free tier available; usage-based pricing for training and hosted inference; enterprise plans on requestllm-cost-reductionfine-tuning

Optuna

Fine-tuning

Open-source Python framework for automated hyperparameter optimization across any ML stack.

Free· Free and open source (MIT)hyperparameter-tuningml-experiment-tracking

Paperspace Gradient

Fine-tuning · Bring-your-own (PyTorch, TensorFlow, Hugging Face)

End-to-end MLOps platform with GPU notebooks, training jobs, and model deployment, now folded into DigitalOcean.

Freemium· Free notebook tier; paid Pro/Growth plans + per-second GPU billingmodel-trainingfine-tuning

Ray Tune

Fine-tuning

Open-source Python library for distributed hyperparameter tuning at any scale.

Free· Open-source (Apache 2.0); managed via Anyscale offers a $100 starting credithyperparameter-tuningdistributed-training

RunPod

Fine-tuning · Bring-your-own (any open-weight or custom model)

On-demand GPU cloud and serverless inference platform built specifically for AI workloads.

Paid· Pay-per-second GPU rental; H100 from ~$1.89/hr, consumer GPUs from ~$0.20/hrllm-fine-tuninggpu-rental

Together AI Fine-tuning

Fine-tuning · Multi-model (any Hugging Face open-source model)

Managed fine-tuning platform for open-source LLMs and vision models with LoRA, full fine-tuning, and RL support.

Paid· Usage-based; cost estimator in-product, no public price listllm-fine-tuningvision-fine-tuning

Unsloth

Fine-tuning · Llama, Mistral, Gemma, Qwen, GLM (multi-model)

Open-source LLM fine-tuning toolkit with custom kernels that train 2-30x faster and use up to 90% less VRAM.

Freemium· Free open-source; Pro and Enterprise contact saleslora-finetuningqlora

Velda

Fine-tuning

Serverless GPU orchestration that runs AI training and batch jobs without Docker or Kubernetes.

Freemium· Free monthly credits on Velda Cloud; Enterprise contact salesdistributed-trainingbatch-inference

W&B Sweeps

Fine-tuning

Hyperparameter optimization from Weights & Biases with Bayesian search and Hyperband early stopping.

Freemium· Free for personal use; team and enterprise tiers via W&Bhyperparameter-tuningbayesian-optimization

vLLM

Fine-tuning · Multi-model (open-weight LLMs: Llama, Qwen, DeepSeek, Mistral, Gemma, Phi, etc.)

Open-source high-throughput inference engine for serving LLMs with PagedAttention and continuous batching.

Free· Free and open-source (Apache 2.0); self-hosted infrastructure costs applyllm-servingself-hosted-inference