📖 The AI Tool Bible

Best AI tools for gpu infrastructure

9 tools in the Fine-tuning category, filtered to gpu infrastructure.

All Fine-tuning →

Together AI

Fine-tuning · Llama / Mistral / Qwen / DeepSeek and others

Fine-tune & serve open-weight models (Llama, Mistral, DeepSeek).

Paid· Pay-per-token; fine-tuning per-tokenopen modelsfine-tuning

Modal

Fine-tuning · Infrastructure (any model you can host)

Serverless GPUs and infra for training & serving ML.

Freemium· $30/mo free credits; pay-as-you-go GPU ratesserverless GPUfine-tuning

CoreWeave

AI-native GPU cloud built for large-scale training, fine-tuning, and inference on NVIDIA hardware.

Enterprise· Contact sales; Capacity Plans with reserved GPU commitmentsmodel-trainingfine-tuning

FedML

Fine-tuning · Bring-your-own (PyTorch, Hugging Face)

Distributed training, fine-tuning, and serving platform with federated learning roots.

Freemium· Open-source library free; managed GPU usage pay-as-you-gofine-tuningdistributed-training

Lambda

On-demand NVIDIA GPU cloud built specifically for training, fine-tuning, and serving large AI models.

Paid· Pay-by-the-minute: A100 from $1.29/hr, H100 SXM from $3.99/hr, B200 from $6.69/hr; clusters priced by quotellm-trainingfine-tuning

OpenPipe

Fine-tuning · Llama, Mistral, Qwen and other open-weight base models

Fine-tuning and reinforcement learning platform for turning expensive prompts into cheap, fast, task-specific models.

Freemium· Free tier available; usage-based pricing for training and hosted inference; enterprise plans on requestllm-cost-reductionfine-tuning

Paperspace Gradient

Fine-tuning · Bring-your-own (PyTorch, TensorFlow, Hugging Face)

End-to-end MLOps platform with GPU notebooks, training jobs, and model deployment, now folded into DigitalOcean.

Freemium· Free notebook tier; paid Pro/Growth plans + per-second GPU billingmodel-trainingfine-tuning

RunPod

Fine-tuning · Bring-your-own (any open-weight or custom model)

On-demand GPU cloud and serverless inference platform built specifically for AI workloads.

Paid· Pay-per-second GPU rental; H100 from ~$1.89/hr, consumer GPUs from ~$0.20/hrllm-fine-tuninggpu-rental

vLLM

Fine-tuning · Multi-model (open-weight LLMs: Llama, Qwen, DeepSeek, Mistral, Gemma, Phi, etc.)

Open-source high-throughput inference engine for serving LLMs with PagedAttention and continuous batching.

Free· Free and open-source (Apache 2.0); self-hosted infrastructure costs applyllm-servingself-hosted-inference