tradingkey.logo

Deepseek unveils v3.1 model with hybrid reasoning and lower prices

Cryptopolitan2025年8月21日 17:01

The Chinese startup DeepSeek introduced a new update, claiming it outperforms the widely recognized R1 across core benchmarks. In a Thursday WeChat post, the AI company confirmed that the new model version, V3.1, provides quicker responses to queries and signals their entry into AI agent development.

DeepSeek added that the model supports a hybrid reasoning architecture, having both thinking and non-thinking modes, improved agent capabilities, and stronger performance in tool use and task execution.

DeepSeek provides a “Deep Thinking” button to switch between modes

So far, DeepSeek’s official app and website have already been updated to V3.1, allowing users to toggle between thinking and non-thinking modes via the “Deep Thinking” button, similar to how Anthropic’s hybrid models like Opus and Sonnet work.

Reportedly, the V3.1 model also performs better on benchmarks like SWE and Terminal-Bench and thinking efficiency than R1. Moreover, according to Artificial Analysis, the model reached 60 points on its intelligence index in reasoning mode, just above the 59 scored by R1. Still, the underlying architecture remains the same, with 671 billion total parameters and 37 billion active.

Despite having a higher efficiency, it also uses slightly fewer tokens than R1 in reasoning mode. The new model, however, is slightly behind Alibaba’s latest model and OpenAI’s open-source reasoning model, GPT-OSS, in performance. It also lacks function calling in reasoning mode, which is considered a major constraint in agentic workflows.

The startup had first announced the new model on Tuesday, though it was only available on Hugging Face at the time. A separate statement added that the version had been tailored to run on next-generation Chinese-made AI chips. 

Now, the company unveiled a new pricing plan for its upgraded V3. The plan raises some charges, eliminates evening discounts, and reduces costs in certain applications, effective Sept. 6.

DeepSeek set pricing for its Input API at $0.07 per million tokens for cache hits and $0.56 for cache misses, with output tokens at $1.68 per million. The rates sharply undercut competitors: Gemini 2.5 Pro costs $10 per million output tokens ($15 for longer prompts), OpenAI’s GPT-5 is also $10, and Anthropic’s Claude Opus 4.1 goes as high as $75.

Analysts expected DeepSeek to release R1’s successor earlier this year

DeepSeek first rattled Silicon Valley with its low-cost and powerful R1 AI model launch in January. The model has since stayed at the forefront of China’s accelerating AI push, challenging US firms such as OpenAI.

Market observers, however, are still waiting for the follow-up to R1, a possible R2 model, which many had expected to launch earlier this year. Local reports have hinted that the delay in the launch stems from founder Liang Wenfeng’s insistence on perfecting the model. At the same time, he also manages his profitable High-Flyer Asset Management business. 

As previously reported by Cryptopolitan, DeepSeek has delayed the launch of its R2 AI model after encountering persistent technical issues with Huawei’s Ascend processors. Following the success of its R1 model in January, DeepSeek was encouraged by Chinese authorities to adopt Huawei chips instead of US-made Nvidia products. However, the company ran into significant problems during the training phase of its R2 model.

Sources familiar with the matter said DeepSeek had to rely on Nvidia chips for training while using Huawei’s Ascend processors only for inference. Industry insiders note that Chinese chips, including Huawei’s, often lag behind Nvidia in inter-chip connectivity, software support, and overall stability.

Huawei sent engineers to DeepSeek’s offices to help adapt the model. Still, the start-up could not complete a successful training run on Ascend hardware even with on-site assistance. Originally slated for a May release, the R2 model’s launch has been postponed due to these hardware challenges.

While some Chinese media outlets speculate that the new model could launch in the coming weeks, DeepSeek founder Liang Wenfeng has voiced internal frustration over its progress, urging the team to take the necessary time to develop a model that preserves the company’s competitive edge.

Meanwhile, industry heavyweights including Alibaba and Tencent continue to release updates briskly, with Alibaba’s Qwen models attracting a particularly strong following.

Sign up to Bybit and start trading with $30,050 in welcome gifts

免責聲明:本網站提供的資訊僅供教育和參考之用,不應視為財務或投資建議。

相關文章

tradingkey.logo
tradingkey.logo
日內數據由路孚特(Refinitiv)提供,並受使用條款約束。歷史及當前收盤數據均由路孚特提供。所有報價均以當地交易所時間為準。美股報價的即時最後成交數據僅反映透過納斯達克報告的交易。日內數據延遲至少15分鐘或遵循交易所要求。
* 參考、分析和交易策略由提供商Trading Central提供,觀點基於分析師的獨立評估和判斷,未考慮投資者的投資目標和財務狀況。
風險提示:我們的網站和行動應用程式僅提供關於某些投資產品的一般資訊。Finsights 不提供財務建議或對任何投資產品的推薦,且提供此類資訊不應被解釋為 Finsights 提供財務建議或推薦。
投資產品存在重大投資風險,包括可能損失投資的本金,且可能並不適合所有人。投資產品的過去表現並不代表其未來表現。
Finsights 可能允許第三方廣告商或關聯公司在我們的網站或行動應用程式的任何部分放置或投放廣告,並可能根據您與廣告的互動情況獲得報酬。
© 版權所有: FINSIGHTS MEDIA PTE. LTD. 版權所有
KeyAI