Llama 4 Scout
Long-window open-weight model that handles large document sets at a low price point.
Open-weight models used by more developers than any other.
Meta's Llama 4 series are the most widely deployed open-weight AI models in the world. Llama 4 Maverick delivers frontier-class performance for free — self-hosted or via Groq, Together AI, and Fireworks.
Every Meta AI model in the directory, ranked by overall capability score.
Long-window open-weight model that handles large document sets at a low price point.
Per 1 million tokens. Updated when providers change prices.
| Model | Input / 1M | Output / 1M | Context | Speed |
|---|---|---|---|---|
| Llama 4 Scout Budget | $0.08/1M | $0.30/1M | 512K | Fast |
| Meta: Llama 3.2 11B Vision Instruct Budget | $0.24/1M | $0.24/1M | 131K | Fast |
| Meta: Llama 3.1 70B Instruct Budget | $0.40/1M | $0.40/1M | 131K | Fast |
| Llama 4 Maverick Budget | $0.15/1M | $0.60/1M | 256K | Fast |
| Meta: Llama 3 70B Instruct |
Head-to-head comparisons for the most-searched questions.
Newsletter
Pricing changes, new releases, and ranking shifts — straight to your inbox.
No spam. Useful updates only. Affiliate disclosures always clearly labeled.
Llama 4 Maverick is Meta's most capable model — it delivers frontier-class coding and writing performance for free. Llama 4 Scout is Meta's faster, smaller model optimised for high-throughput applications and edge deployments.
Yes — Llama 4 Scout and Llama 4 Maverick are open-weight models available under the Llama community license. You can run them locally via Ollama, or use them free via Groq, Together AI, Fireworks, and other hosted providers.
Llama 4 Maverick is competitive with GPT-5.4 on most tasks — at zero API cost. For the highest coding quality, Claude Opus 4.7 and Claude Sonnet 4.6 still lead. For a free open-weight model that runs anywhere, Llama 4 Maverick is the strongest option available.
Llama 3.2 11B Vision Instruct is Meta's open-weight multimodal model capable of understanding both text and images at an extremely low price point. It handles image captioning, visual question answering, and document analysis alongside standard text tasks.
Meta's Llama 3.1 70B Instruct is a open-weight large language model with 70 billion parameters, fine-tuned for instruction following across coding, reasoning, and general-purpose tasks. It offers a strong balance of capability and cost at $0.40/1M tokens for both input and output.
Flexible open-weight model for teams that want control, portability, and solid general-purpose performance.
Meta's Llama 3 70B Instruct is a 70-billion parameter open-weight language model fine-tuned for instruction following, representing Meta's most capable publicly available model at the time of release. It excels at general reasoning, coding assistance, and structured text tasks with strong multilingual support.
Llama 3.1 8B Instruct is Meta's smallest production-ready open-weight model, optimized for fast, low-cost inference on everyday language tasks. It delivers surprisingly capable instruction-following for its size, making it a go-to for high-volume, cost-sensitive deployments.
Llama 3 8B Instruct is Meta's compact open-weight instruction-following model, optimized for efficiency and accessibility at extremely low cost. It handles everyday text tasks like summarization, Q&A, and light coding at a fraction of the price of frontier models.
Llama 3.2 1B Instruct is Meta's smallest production language model, designed for lightweight text tasks with an extremely low cost footprint. It excels at simple instruction-following, text classification, and on-device or edge deployment scenarios.
Llama Guard 4 12B is Meta's specialized safety classification model designed to detect and filter harmful content in LLM inputs and outputs. It's purpose-built for content moderation pipelines, not general-purpose text generation.
Llama Guard 3 8B is a specialized safety classifier built on Meta's Llama 3 architecture, designed to detect and categorize harmful or policy-violating content in both user inputs and model outputs. It is purpose-built for content moderation pipelines, not general-purpose text generation.
| $0.51/1M |
| $0.74/1M |
| 8K |
| Balanced |
| Meta: Llama 3.1 8B Instruct Budget | $0.02/1M | $0.05/1M | 16K | Very fast |
| Meta: Llama 3 8B Instruct Budget | $0.03/1M | $0.04/1M | 8K | Very fast |
| Meta: Llama 3.2 1B Instruct Budget | $0.03/1M | $0.20/1M | 60K | Very fast |
| Meta: Llama Guard 4 12B Budget | $0.18/1M | $0.18/1M | 164K | Very fast |
| Llama Guard 3 8B Budget | $0.48/1M | $0.03/1M | 131K | Very fast |