DeepSeek V3

State-of-the-art 671B Mixture of Experts model delivering GPT-4 class performance at a fraction of the cost. Excellent for general purpose AI tasks with 64K context length.

Deploy Model View API Docs

Model Specifications

Parameters671B (37B active)

ArchitectureMixture of Experts (MoE)

Context Length64K tokens

Experts256 total, 8 active

LanguagesEnglish, Chinese, Code

LicenseDeepSeek License

Why Choose DeepSeek V3

State-of-the-Art Performance

Matches GPT-4 class models on most benchmarks.

Cost Efficient

MoE architecture provides excellent cost-performance ratio.

Pricing

Serverless API

Pay per token with auto-scaling

₹20 /1M tokens input · ₹40 /1M tokens output

Auto-scaling
No minimum
99.9% uptime
Rate limits apply

Get Started

Recommended

Use Cases

General Chat

Versatile conversational AI for a wide range of topics.

Code Generation

Generate and debug code across multiple languages.

Content Writing

Ready to Deploy DeepSeek V3?

Get GPT-4 class performance at a fraction of the cost.

Deploy Now View All Models