DeepSeek1M contextdeepseek-v4

DeepSeek V4 Pro

Open-weight flagship — 80.6% SWE-Bench at 1/20th the cost of Claude Opus 4.7

DeepSeek V4 Pro is the leading open-weight frontier model in 2026 — released April 24, 2026 as a 1.6T-parameter Mixture-of-Experts (49B active) with a 1M-token context window, MIT-licensed. It hits 80.6% on SWE-Bench Verified (within 0.2 points of Claude Opus 4.6) and 93.5 on LiveCodeBench (the highest of any model), at $1.74/$3.48 per MTok — roughly 1/20th of Opus 4.7's output price.

Pricing

Rate	List price	Anvat effective	Savings
Input	$1.74	$1.22	30%
Output	$3.48	$2.44	30%
All prices per million tokens (MTok). List = provider direct. Anvat effective = 30% discount applied.

Pricing verified 2026-06 · See full Anvat pricing

Strengths

80.6% SWE-Bench Verified — within 0.2 points of Claude Opus 4.6
93.5 LiveCodeBench Pass@1 — highest of any model
1M-token context window standard, no separate pricing tier
MIT-licensed weights — self-host on vLLM/SGLang/TGI
MoE architecture (1.6T total / 49B active) keeps inference cost low
FP4 + FP8 mixed precision halves memory footprint

Where it underperforms

Trails Opus 4.7 + GPT-5.5 by ~2-3 points on hardest reasoning (HLE, HMMT)
Trails GPT-5.4 by 7 points on Terminal-Bench 2.0
Text-only — no native multimodal (no vision, no audio)
Interleaved-thinking pattern requires agent framework updates
Tool ecosystem (LangChain etc.) still primarily Anthropic/OpenAI-shaped

Use cases this model is the right pick for

High-volume coding tasks where cost per request is the binding constraint
Open-weight requirements (regulatory, audit, on-prem option)
Long-context analysis at 500K-1M tokens (full codebase, large legal docs)
Pairing with closed frontiers in router patterns (V4-first, escalate on hard tasks)
Bulk generation (content drafting, summarisation, translation)

Benchmarks

coding
SWE-Bench Verified
80.6% (within 0.2pt of Opus 4.6)
coding
LiveCodeBench Pass@1
93.5 (highest of any model)
coding
Codeforces Rating
3206 (ahead of GPT-5.4 xHigh)
agentic
MCPAtlas Public
73.6 (#2 after Opus 4.6)
reasoning
MRCR 1M long-context recall
83.5
agentic
Toolathlon
51.8 (ahead of Gemini 3.1 Pro)

Benchmark numbers self-reported by provider; verify against the latest publisher documentation before quoting.

Quickstart

Same wire format as direct provider APIs — your existing SDK code keeps working. Point at api.anvat.app/v1 and use your Anvat key.

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.anvat.app/v1",
  apiKey: process.env.ANVAT_API_KEY,
});

const response = await client.chat.completions.create({
  model: "deepseek-v4",
  max_tokens: 4096,
  messages: [
    { role: "user", content: "Implement a Trie data structure in TypeScript with full test coverage." },
  ],
});

Try DeepSeek V4 Pro — 30% off list

Same model, same quality, same wire format — at the discounted Anvat effective rate. $2 free credit on signup, no card required.

Get a key →