← All Models
DeepSeek V4 Flash
284B MoE (13B active) Min 40GB VRAM Full 80GB VRAM codingresearchreasoningagentic
DeepSeek's 2026 preview frontier model. In Ollama it is cloud-only as of May 2026, with a 284B MoE architecture, 13B active parameters per token, and a 1M-token context window.
Run with Ollama
ollama run deepseek-v4-flash:cloud