• My Feed
  • Home
  • What's Important
  • Media & Entertainment
Search

Stay Curious. Stay Wanture.

© 2026 Wanture. All rights reserved.

  • Terms of Use
  • Privacy Policy
Tech/Software
DeepSeek launches V4‑Pro, V4‑Flash 1 million‑token window

24 April 2026

—

News

Jasmine Wu

DeepSeek released V4‑Pro and V4‑Flash in April 2026, opening a 1 million‑token context window and publishing full model weights. The Chinese AI lab positions itself as a cost‑efficient alternative to closed commercial systems, claiming frontier‑level performance at significantly lower prices.

A million tokens translates to roughly 750,000 words of usable memory. That headroom is enough to ingest several codebases, research papers, or transcripts in a single session. For developers building long‑form coding assistants or multi‑turn dialogue systems, the capacity eliminates truncation errors that plague shorter‑context models. The capacity alone doesn't guarantee reasoning quality, but it does remove a structural bottleneck.

V4‑Pro deploys 1.6 trillion parameters, activating 49 billion per forward pass. DeepSeek says this mixture‑of‑experts design delivers results comparable to leading closed models (GPT‑4 and Claude 3.5 Sonnet) across code generation, mathematical reasoning, and nuanced instruction‑following. V4‑Flash, with 284 billion total parameters and 13 billion active, aims to match the reasoning speed of those larger systems while consuming fewer compute resources.

The technical report and model weights are published on Hugging Face, and the full methodology is detailed in the accompanying whitepaper. Independent benchmarks will clarify whether the self‑reported metrics hold in production.

API access runs $1.74 per million input tokens and $3.48 per million output tokens for V4‑Pro. V4‑Flash costs $0.14 input and $0.28 output per million tokens, making it viable for high‑volume applications where cost per call determines feasibility. For context, those Flash prices undercut most commercial offerings by an order of magnitude, though latency and quality trade‑offs remain to be measured under load.

The DeepSeek web app labels V4‑Pro as "Expert" and V4‑Flash as "Instant," letting users experiment without writing code. For production environments, the public API supports direct integration, and the open weights permit local deployment or fine‑tuning on proprietary datasets. DeepSeek also announced plans to connect the models with tools such as Claude Code, OpenClaw, and OpenCode, though no firm release timelines were provided.

Releasing the full architecture and trained parameters allows developers to adapt the models for specialized domains. Legal reasoning, medical diagnostics, scientific literature synthesis, and other areas become accessible without waiting for vendor roadmaps. It also invites scrutiny: researchers can audit the training process, test for bias, and verify the reported capabilities. That transparency is both a competitive signal and a philosophical stance, treating intelligence as infrastructure rather than proprietary advantage.

The immediate question is whether DeepSeek's self‑reported performance survives real‑world stress tests. Developers evaluating the models should run task‑specific benchmarks, measure latency under expected query volumes, and compare total cost of ownership against alternatives. The million‑token window is headline‑grabbing, but the more revealing metric will be how gracefully the model reasons across that entire span, and whether the open‑weight approach accelerates or fragments the current AI landscape.

Mobile Bottom Test Banner

What is this about?

  • News/
  • Jasmine Wu/
  • Tech/
  • Software/
  • DeepSeek/
  • LLM/
  • V4/
  • Open-Weight Models/
  • Million-Token Context Window

Feed

    Xbox Drops Microsoft Gaming Label, 2022 Rebrand Reversed

    Microsoft’s Xbox division announced in 2026 that it will retire the “Microsoft Gaming” brand introduced in 2022 and revert to the classic Xbox name. The shift signals a strategic focus on daily active users across PC, mobile, and emerging markets, backed by the upcoming Project Helix and an expanded push into China.

    Xbox Drops Microsoft Gaming Label, 2022 Rebrand Reversed
    Jordan McAllisterabout 3 hours ago

    Honor's six‑fan Win H9 cuts temps by 12.5%

    Honor unveiled the Win H9 gaming laptop in China on April 22, noting a six‑fan system that cuts temperatures 12.5% versus the older two‑fan design. The flagship ships with an RTX 5070 Ti, Intel Core Ultra 9 290HX, 32 GB RAM, 1 TB SSD, and a 300‑Hz 16:10 DCI‑P3 display. Prices begin at 18,000 yuan for the top model and 11,500 yuan for the base, with no confirmed launch outside China.

    Honor's six‑fan Win H9 cuts temps by 12.5%
    Carter Brooksabout 3 hours ago

    Open AI Models Prove Value in Samsung‑Noôdome Pilot

    At Samsung’s Innovation Campus, mathematician Arutyun Avetisyan and AI researcher Svetlana Yun presented Noôdome‑backed data showing lightweight, open‑source AI models can trim development sprints by three days and cut defect rates by about 15%. They warned that oversight and talent development are essential to avoid security and bias risks while scaling AI.

    Adrian Vegaabout 3 hours ago

    UAE aims to automate half of its services by 2028

    The UAE government has launched a two‑year directive that will see autonomous AI agents handle requests, recommendations, and real‑time decisions across more than 30 ministries. Oversight rests with Vice President Sheikh Mohammed bin Rashid Al Maktoum, and a task force led by Minister Mohammad Al Gergawi, with pilots slated for health and transportation, and rollout by late 2027.

    Rachel Steinabout 20 hours ago
    Apple’s April 9 Telega App Removal Sparks Antitrust Probe

    Apple’s April 9 Telega App Removal Sparks Antitrust Probe

    Russia’s antitrust watchdog warns Apple’s removal may breach competition law

    Rachel Stein1 day ago
    Game Pass Ultimate May Get Full‑Time Discord Nitro

    Game Pass Ultimate May Get Full‑Time Discord Nitro

    Xbox and Discord hint at extended Nitro perks for subscribers

    Carter Brooks1 day ago
    OpenAI launches always‑on AI agents for Workspace

    OpenAI launches always‑on AI agents for Workspace

    Agents replace Custom GPTs, running continuously to automate business workflows

    Jasmine Wu1 day ago
    Meta Starts Recording U.S. Staff Screens to Train AI

    Meta Starts Recording U.S. Staff Screens to Train AI

    Meta logs employee screenshots as it plans a 10% staff cut

    Rachel Stein1 day ago
    Razer launches Atlas Pro glass pad at 1.9 mm thickness

    Razer launches Atlas Pro glass pad at 1.9 mm thickness

    It claims the title of the world’s thinnest glass mouse pad

    Carter Brooks1 day ago
    Loading...
Tech/Software

DeepSeek launches V4‑Pro, V4‑Flash 1 million‑token window

24 April 2026

—

News

Jasmine Wu

DeepSeek released V4‑Pro and V4‑Flash in April 2026, opening a 1 million‑token context window and publishing full model weights. The Chinese AI lab positions itself as a cost‑efficient alternative to closed commercial systems, claiming frontier‑level performance at significantly lower prices.

A million tokens translates to roughly 750,000 words of usable memory. That headroom is enough to ingest several codebases, research papers, or transcripts in a single session. For developers building long‑form coding assistants or multi‑turn dialogue systems, the capacity eliminates truncation errors that plague shorter‑context models. The capacity alone doesn't guarantee reasoning quality, but it does remove a structural bottleneck.

V4‑Pro deploys 1.6 trillion parameters, activating 49 billion per forward pass. DeepSeek says this mixture‑of‑experts design delivers results comparable to leading closed models (GPT‑4 and Claude 3.5 Sonnet) across code generation, mathematical reasoning, and nuanced instruction‑following. V4‑Flash, with 284 billion total parameters and 13 billion active, aims to match the reasoning speed of those larger systems while consuming fewer compute resources.

The technical report and model weights are published on Hugging Face, and the full methodology is detailed in the accompanying whitepaper. Independent benchmarks will clarify whether the self‑reported metrics hold in production.

API access runs $1.74 per million input tokens and $3.48 per million output tokens for V4‑Pro. V4‑Flash costs $0.14 input and $0.28 output per million tokens, making it viable for high‑volume applications where cost per call determines feasibility. For context, those Flash prices undercut most commercial offerings by an order of magnitude, though latency and quality trade‑offs remain to be measured under load.

The DeepSeek web app labels V4‑Pro as "Expert" and V4‑Flash as "Instant," letting users experiment without writing code. For production environments, the public API supports direct integration, and the open weights permit local deployment or fine‑tuning on proprietary datasets. DeepSeek also announced plans to connect the models with tools such as Claude Code, OpenClaw, and OpenCode, though no firm release timelines were provided.

Releasing the full architecture and trained parameters allows developers to adapt the models for specialized domains. Legal reasoning, medical diagnostics, scientific literature synthesis, and other areas become accessible without waiting for vendor roadmaps. It also invites scrutiny: researchers can audit the training process, test for bias, and verify the reported capabilities. That transparency is both a competitive signal and a philosophical stance, treating intelligence as infrastructure rather than proprietary advantage.

The immediate question is whether DeepSeek's self‑reported performance survives real‑world stress tests. Developers evaluating the models should run task‑specific benchmarks, measure latency under expected query volumes, and compare total cost of ownership against alternatives. The million‑token window is headline‑grabbing, but the more revealing metric will be how gracefully the model reasons across that entire span, and whether the open‑weight approach accelerates or fragments the current AI landscape.

What is this about?

  • News/
  • Jasmine Wu/
  • Tech/
  • Software/
  • DeepSeek/
  • LLM/
  • V4/
  • Open-Weight Models/
  • Million-Token Context Window

Feed

    Xbox Drops Microsoft Gaming Label, 2022 Rebrand Reversed

    Microsoft’s Xbox division announced in 2026 that it will retire the “Microsoft Gaming” brand introduced in 2022 and revert to the classic Xbox name. The shift signals a strategic focus on daily active users across PC, mobile, and emerging markets, backed by the upcoming Project Helix and an expanded push into China.

    Xbox Drops Microsoft Gaming Label, 2022 Rebrand Reversed
    Jordan McAllisterabout 3 hours ago

    Honor's six‑fan Win H9 cuts temps by 12.5%

    Honor unveiled the Win H9 gaming laptop in China on April 22, noting a six‑fan system that cuts temperatures 12.5% versus the older two‑fan design. The flagship ships with an RTX 5070 Ti, Intel Core Ultra 9 290HX, 32 GB RAM, 1 TB SSD, and a 300‑Hz 16:10 DCI‑P3 display. Prices begin at 18,000 yuan for the top model and 11,500 yuan for the base, with no confirmed launch outside China.

    Honor's six‑fan Win H9 cuts temps by 12.5%
    Carter Brooksabout 3 hours ago

    Open AI Models Prove Value in Samsung‑Noôdome Pilot

    At Samsung’s Innovation Campus, mathematician Arutyun Avetisyan and AI researcher Svetlana Yun presented Noôdome‑backed data showing lightweight, open‑source AI models can trim development sprints by three days and cut defect rates by about 15%. They warned that oversight and talent development are essential to avoid security and bias risks while scaling AI.

    Adrian Vegaabout 3 hours ago

    UAE aims to automate half of its services by 2028

    The UAE government has launched a two‑year directive that will see autonomous AI agents handle requests, recommendations, and real‑time decisions across more than 30 ministries. Oversight rests with Vice President Sheikh Mohammed bin Rashid Al Maktoum, and a task force led by Minister Mohammad Al Gergawi, with pilots slated for health and transportation, and rollout by late 2027.

    Rachel Steinabout 20 hours ago
    Apple’s April 9 Telega App Removal Sparks Antitrust Probe

    Apple’s April 9 Telega App Removal Sparks Antitrust Probe

    Russia’s antitrust watchdog warns Apple’s removal may breach competition law

    Rachel Stein1 day ago
    Game Pass Ultimate May Get Full‑Time Discord Nitro

    Game Pass Ultimate May Get Full‑Time Discord Nitro

    Xbox and Discord hint at extended Nitro perks for subscribers

    Carter Brooks1 day ago
    OpenAI launches always‑on AI agents for Workspace

    OpenAI launches always‑on AI agents for Workspace

    Agents replace Custom GPTs, running continuously to automate business workflows

    Jasmine Wu1 day ago
    Meta Starts Recording U.S. Staff Screens to Train AI

    Meta Starts Recording U.S. Staff Screens to Train AI

    Meta logs employee screenshots as it plans a 10% staff cut

    Rachel Stein1 day ago
    Razer launches Atlas Pro glass pad at 1.9 mm thickness

    Razer launches Atlas Pro glass pad at 1.9 mm thickness

    It claims the title of the world’s thinnest glass mouse pad

    Carter Brooks1 day ago
    Loading...