DeepSeek V4 Flash

Trending

DeepSeek · Active · Updated May 20, 2026

DeepSeek's fastest and most cost-efficient model, optimized for high-volume, low-latency applications with strong reasoning capabilities.

Input Price
$0.14/M
per million tokens
Output Price
$0.28/M
per million tokens
Context Window
262,144
tokens
Max Output
8,192
tokens

Technical Specifications

ProviderDeepSeek
Release DateMarch 1, 2026
Pricing Typeper token
Input Price$0.14.00 / 1M tokens
Output Price$0.28.00 / 1M tokens
Cached Input$0.03 / 1M tokens
Context Window262,144 tokens
Max Output8,192 tokens
Input Modalitiestext, image
Output Modalitiestext
Statusactive
Availabilityapi, web_app
Latencyvery fast
Rate Limit10,000 RPM
Pricing URLView official pricing →
Docs URLView documentation →

Capability Scores

Coding
82
Reasoning
78
Math
80
Speed
95

Overview

DeepSeek V4 Flash represents the sweet spot between speed and intelligence. It delivers near-frontier reasoning at a fraction of the cost of premium models, with inference speeds that rival the fastest small models. With a 256K context window and multimodal input support, it is an exceptional choice for production workloads that need both quality and throughput.

Pros

  • +Very fast inference (95/100 speed score) — ideal for real-time applications
  • +Competitive reasoning at $0.14/M input tokens
  • +256K context window exceeds many competitors
  • +Image input support for multimodal use cases

Cons

  • Coding performance trails frontier models (82 vs 90+)
  • Lower raw intelligence ceiling than V4 Pro variant
  • Text-only output — no audio or image generation

Use Cases

Real-time chat and customer service at scale
High-throughput content generation and classification
Cost-sensitive production AI pipelines

Frequently Asked Questions about DeepSeek V4 Flash

How much does DeepSeek V4 Flash cost?
DeepSeek V4 Flash costs $0.14 per million input tokens and $0.28 per million output tokens. Cached input is $0.028 per million tokens.
What is the context window of DeepSeek V4 Flash?
DeepSeek V4 Flash has a 262,144 token context window, with a maximum output of 8,192 tokens.
Is DeepSeek V4 Flash good for coding?
DeepSeek V4 Flash scores 82/100 on coding benchmarks.
What modalities does DeepSeek V4 Flash support?
DeepSeek V4 Flash supports text, image input and text output.