Llama 4 Scout

Name: Llama 4 Scout
Brand: Meta
Price: 0.3 USD

Meta · Active · Updated May 18, 2026

Meta's efficient open-weight model optimized for fast inference, easy deployment, and high-throughput production workloads.

Pricing Compare

Input Price

$0.30/M

per million tokens

Output Price

$1.00/M

per million tokens

Context Window

262,144

tokens

Max Output

16,384

tokens

Technical Specifications

Provider	Meta
Release Date	April 1, 2026
Pricing Type	per token
Input Price	$0.3.00 / 1M tokens
Output Price	$1.00 / 1M tokens
Cached Input	—
Context Window	262,144 tokens
Max Output	16,384 tokens
Input Modalities	text, image
Output Modalities	text
Status	active
Availability	api, enterprise
Latency	very fast
Rate Limit	5,000 RPM
Pricing URL	View official pricing →
Docs URL	View documentation →

Capability Scores

Coding

Reasoning

Math

Image

Speed

Overview

Llama 4 Scout is Meta's efficiency-optimized open-weight model, designed for high-throughput production environments where speed and cost efficiency are paramount. Despite its smaller size, Scout offers a 256K context window and strong enough reasoning capabilities for most real-world applications. Like all Llama models, it is fully open for self-hosting and customization.

Pros

+Fully open-weight with permissive licensing
+Fast inference (speed: 92/100) for production workloads
+256K context window at a budget-friendly price
+Easy to deploy on consumer-grade hardware

Cons

−Lower benchmark scores than Maverick variant
−Not suitable for complex reasoning or coding tasks
−No audio or image output capabilities

Use Cases

High-volume content generation and classification

Self-hosted chat applications and customer service

Fine-tuned domain-specific deployments

Frequently Asked Questions about Llama 4 Scout

How much does Llama 4 Scout cost?

Llama 4 Scout costs $0.3 per million input tokens and $1 per million output tokens.

What is the context window of Llama 4 Scout?

Llama 4 Scout has a 262,144 token context window, with a maximum output of 16,384 tokens.

Is Llama 4 Scout good for coding?

Llama 4 Scout scores 74/100 on coding benchmarks.

What modalities does Llama 4 Scout support?

Llama 4 Scout supports text, image input and text output.

← Back to all models