tradingkey.logo

Deepseek unveils v3.1 model with hybrid reasoning and lower prices

Cryptopolitan2025年8月21日 17:01

The Chinese startup DeepSeek introduced a new update, claiming it outperforms the widely recognized R1 across core benchmarks. In a Thursday WeChat post, the AI company confirmed that the new model version, V3.1, provides quicker responses to queries and signals their entry into AI agent development.

DeepSeek added that the model supports a hybrid reasoning architecture, having both thinking and non-thinking modes, improved agent capabilities, and stronger performance in tool use and task execution.

DeepSeek provides a “Deep Thinking” button to switch between modes

So far, DeepSeek’s official app and website have already been updated to V3.1, allowing users to toggle between thinking and non-thinking modes via the “Deep Thinking” button, similar to how Anthropic’s hybrid models like Opus and Sonnet work.

Reportedly, the V3.1 model also performs better on benchmarks like SWE and Terminal-Bench and thinking efficiency than R1. Moreover, according to Artificial Analysis, the model reached 60 points on its intelligence index in reasoning mode, just above the 59 scored by R1. Still, the underlying architecture remains the same, with 671 billion total parameters and 37 billion active.

Despite having a higher efficiency, it also uses slightly fewer tokens than R1 in reasoning mode. The new model, however, is slightly behind Alibaba’s latest model and OpenAI’s open-source reasoning model, GPT-OSS, in performance. It also lacks function calling in reasoning mode, which is considered a major constraint in agentic workflows.

The startup had first announced the new model on Tuesday, though it was only available on Hugging Face at the time. A separate statement added that the version had been tailored to run on next-generation Chinese-made AI chips. 

Now, the company unveiled a new pricing plan for its upgraded V3. The plan raises some charges, eliminates evening discounts, and reduces costs in certain applications, effective Sept. 6.

DeepSeek set pricing for its Input API at $0.07 per million tokens for cache hits and $0.56 for cache misses, with output tokens at $1.68 per million. The rates sharply undercut competitors: Gemini 2.5 Pro costs $10 per million output tokens ($15 for longer prompts), OpenAI’s GPT-5 is also $10, and Anthropic’s Claude Opus 4.1 goes as high as $75.

Analysts expected DeepSeek to release R1’s successor earlier this year

DeepSeek first rattled Silicon Valley with its low-cost and powerful R1 AI model launch in January. The model has since stayed at the forefront of China’s accelerating AI push, challenging US firms such as OpenAI.

Market observers, however, are still waiting for the follow-up to R1, a possible R2 model, which many had expected to launch earlier this year. Local reports have hinted that the delay in the launch stems from founder Liang Wenfeng’s insistence on perfecting the model. At the same time, he also manages his profitable High-Flyer Asset Management business. 

As previously reported by Cryptopolitan, DeepSeek has delayed the launch of its R2 AI model after encountering persistent technical issues with Huawei’s Ascend processors. Following the success of its R1 model in January, DeepSeek was encouraged by Chinese authorities to adopt Huawei chips instead of US-made Nvidia products. However, the company ran into significant problems during the training phase of its R2 model.

Sources familiar with the matter said DeepSeek had to rely on Nvidia chips for training while using Huawei’s Ascend processors only for inference. Industry insiders note that Chinese chips, including Huawei’s, often lag behind Nvidia in inter-chip connectivity, software support, and overall stability.

Huawei sent engineers to DeepSeek’s offices to help adapt the model. Still, the start-up could not complete a successful training run on Ascend hardware even with on-site assistance. Originally slated for a May release, the R2 model’s launch has been postponed due to these hardware challenges.

While some Chinese media outlets speculate that the new model could launch in the coming weeks, DeepSeek founder Liang Wenfeng has voiced internal frustration over its progress, urging the team to take the necessary time to develop a model that preserves the company’s competitive edge.

Meanwhile, industry heavyweights including Alibaba and Tencent continue to release updates briskly, with Alibaba’s Qwen models attracting a particularly strong following.

Sign up to Bybit and start trading with $30,050 in welcome gifts

免责声明:本网站提供的信息仅供教育和参考之用,不应视为财务或投资建议。

相关文章

tradingkey.logo
tradingkey.logo
日内数据由路孚特(Refinitiv)提供,并受使用条款约束。历史及当前收盘数据均由路孚特提供。所有报价均以当地交易所时间为准。美股报价的实时最后成交数据仅反映通过纳斯达克报告的交易。日内数据延迟至少15分钟或遵循交易所要求。
* 参考、分析和交易策略由第三方提供商Trading Central提供,观点基于分析师的独立评估和判断,未考虑投资者的投资目标和财务状况。
风险提示:我们的网站和移动应用程序仅提供关于某些投资产品的一般信息。Finsights 不提供财务建议或对任何投资产品的推荐,且提供此类信息不应被解释为 Finsights 提供财务建议或推荐。
投资产品存在重大投资风险,包括可能损失投资的本金,且可能并不适合所有人。投资产品的过去表现并不代表其未来表现。
Finsights 可能允许第三方广告商或关联公司在我们的网站或移动应用程序的任何部分放置或投放广告,并可能根据您与广告的互动情况获得报酬。
© 版权所有: FINSIGHTS MEDIA PTE. LTD. 版权所有
KeyAI