GPT-5.4
TrendingOpenAI · Active · Updated May 20, 2026
OpenAI's latest frontier model with groundbreaking reasoning capabilities, extended context, and enhanced multimodal understanding.
Technical Specifications
| Provider | OpenAI |
| Release Date | March 15, 2026 |
| Pricing Type | per token |
| Input Price | $2.5.00 / 1M tokens |
| Output Price | $15.00 / 1M tokens |
| Cached Input | $0.25 / 1M tokens |
| Context Window | 262,144 tokens |
| Max Output | 32,768 tokens |
| Input Modalities | text, image, audio, video |
| Output Modalities | text, image, audio |
| Status | active |
| Availability | api, web_app, mobile_app |
| Latency | fast |
| Rate Limit | 8,000 RPM |
| Pricing URL | View official pricing → |
| Docs URL | — |
Capability Scores
Overview
GPT-5.4 marks a major generational leap for OpenAI, introducing a 256K context window, native video input, and significantly improved reasoning capabilities over GPT-4o. Built on a new architecture, it delivers a 50% improvement on complex coding tasks and maintains the multimodal versatility that made GPT-4o popular. GPT-5.4 is the most balanced frontier model available, combining top-tier performance with broad modality support.
Pros
- +True multimodal — text, image, audio, and video input
- +256K context window — double that of GPT-4o
- +Top-tier coding (92/100) and reasoning (90/100) scores
- +32768 max output tokens for long-form generation
Cons
- −Premium pricing at $12/$40 per million tokens
- −Slightly slower than lightweight models for simple tasks
- −Video input support still maturing in real-world applications
Compare with Alternatives
Claude 4 Sonnet wins for coding and reasoning; GPT-5.4 wins for speed, multimodal versatility, and value.
GPT-5.4 wins for ecosystem and multimodal output; Gemini 2.5 Pro wins for context window and pricing value.
GPT-5.4 wins on raw performance and multimodal capabilities; Llama 4 Maverick wins on openness, privacy, and customization freedom.