deepseek-v4DeepSeek V4 Pro
Open-weight flagship — 80.6% SWE-Bench at 1/20th the cost of Claude Opus 4.7
DeepSeek V4 Pro is the leading open-weight frontier model in 2026 — released April 24, 2026 as a 1.6T-parameter Mixture-of-Experts (49B active) with a 1M-token context window, MIT-licensed. It hits 80.6% on SWE-Bench Verified (within 0.2 points of Claude Opus 4.6) and 93.5 on LiveCodeBench (the highest of any model), at $1.74/$3.48 per MTok — roughly 1/20th of Opus 4.7's output price.
Pricing
| Rate | List price | Anvat effective | Savings |
|---|---|---|---|
| Input | $1.74 | $1.22 | 30% |
| Output | $3.48 | $2.44 | 30% |
| All prices per million tokens (MTok). List = provider direct. Anvat effective = 30% discount applied. | |||
Pricing verified 2026-06 · See full Anvat pricing
Strengths
- 80.6% SWE-Bench Verified — within 0.2 points of Claude Opus 4.6
- 93.5 LiveCodeBench Pass@1 — highest of any model
- 1M-token context window standard, no separate pricing tier
- MIT-licensed weights — self-host on vLLM/SGLang/TGI
- MoE architecture (1.6T total / 49B active) keeps inference cost low
- FP4 + FP8 mixed precision halves memory footprint
Where it underperforms
- Trails Opus 4.7 + GPT-5.5 by ~2-3 points on hardest reasoning (HLE, HMMT)
- Trails GPT-5.4 by 7 points on Terminal-Bench 2.0
- Text-only — no native multimodal (no vision, no audio)
- Interleaved-thinking pattern requires agent framework updates
- Tool ecosystem (LangChain etc.) still primarily Anthropic/OpenAI-shaped
Use cases this model is the right pick for
- High-volume coding tasks where cost per request is the binding constraint
- Open-weight requirements (regulatory, audit, on-prem option)
- Long-context analysis at 500K-1M tokens (full codebase, large legal docs)
- Pairing with closed frontiers in router patterns (V4-first, escalate on hard tasks)
- Bulk generation (content drafting, summarisation, translation)
Benchmarks
coding
SWE-Bench Verified
80.6% (within 0.2pt of Opus 4.6)
coding
LiveCodeBench Pass@1
93.5 (highest of any model)
coding
Codeforces Rating
3206 (ahead of GPT-5.4 xHigh)
agentic
MCPAtlas Public
73.6 (#2 after Opus 4.6)
reasoning
MRCR 1M long-context recall
83.5
agentic
Toolathlon
51.8 (ahead of Gemini 3.1 Pro)
Benchmark numbers self-reported by provider; verify against the latest publisher documentation before quoting.
Quickstart
Same wire format as direct provider APIs — your existing SDK code keeps working. Point at api.anvat.app/v1 and use your Anvat key.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.anvat.app/v1",
apiKey: process.env.ANVAT_API_KEY,
});
const response = await client.chat.completions.create({
model: "deepseek-v4",
max_tokens: 4096,
messages: [
{ role: "user", content: "Implement a Trie data structure in TypeScript with full test coverage." },
],
});Try DeepSeek V4 Pro — 30% off list
Same model, same quality, same wire format — at the discounted Anvat effective rate. $2 free credit on signup, no card required.
Get a key →Related
All models
Full catalog of frontier models on Anvat
Claude API pricing in 2026
Full breakdown + cost-cutting strategies
Cheap Claude API in 2026
Four legitimate ways to cut your bill
Cost calculator
Estimate monthly spend on this model
Use cases
Claude Code, Cursor, coding agents, RAG setup guides
Anvat pricing
Full price list across all models