Highest Drop of 97.5%! Tencent Cloud Large Model Price Cut, Now Fully Equal to Official Factory Price

To further lower the innovation threshold for enterprises and developers, the Tencent Cloud Intelligent Agent Development Platform officially announced a major pricing adjustment on Tuesday. Starting at 0:00 Beijing Time on June 3, the platform will significantly reduce the prices of the entire V4 series of large models under DeepSeek.

Core model prices drop dramatically

In this price adjustment, the main model DeepSeek-V4-Pro has seen a huge reduction of 75% in both the input and output costs for inference, significantly reducing users' daily call costs. More notably, the cache hit cost of this model has dropped by as much as 97.5%, offering high cost-effectiveness for high-frequency reuse scenarios.

Additionally, the lightweight and agile DeepSeek-V4-Flash model has also received corresponding discount adjustments. The cache hit cost of this model has been reduced by 90%, allowing developers to enjoy more competitive computing costs when building low-latency, high-concurrency intelligent applications.

Seamless integration with the latest technological achievements

As the technical star of this price cut, the DeepSeek-V4 series of large models attracted industry attention from the moment of its release, thanks to its mixture-of-experts (MoE) architecture. This series of models has a total parameter count of 1.6 trillion and natively supports ultra-long context processing capability of up to 1 million tokens.

The recent price reduction by the Tencent Cloud Intelligent Agent Development Platform not only complements its strategy of ending the free public test and moving towards formal commercialization, but also directly brings the call cost level to that of DeepSeek's permanent price reduction policy. This move is bound to further intensify competition in the domestic cloud computing power market during the stage of large model application deployment.

Tencent Hy3 Programming Evaluation Released: Parameters Are Only One Fifth of Competitors, Code Ability is on Par with DeepSeek-V4-Pro

SuperCLUE released the programming special evaluation of Tencent Hy3 language model, comparing it with DeepSeek-V4-Pro and others. Hy3 uses MoE architecture, with a total parameter count of 295B and only 21B activated, supporting a 256K context length, and is claimed to be the strongest in the Mix. The results show that despite far fewer parameters, its performance was surprisingly excellent. Evaluated from four dimensions, it balances performance and cost, focusing on real coding scenarios for Chinese programmers.

Ping An Bank Collaborates with Tencent Cloud and China UnionPay to Launch AI Smart Computing Card, AI-Themed Credit Card to Be Released Next Month

Tencent Cloud, in collaboration with Ping An Bank and China UnionPay, has launched the "AI Smart Computing Card" debit card for individual users, integrating basic financial services with exclusive computing power benefits. It addresses the pain points of switching between multiple platforms and repeated recharges, enabling AI computing power consumption, tool usage, and daily financial transactions through a single card. The card provides instant access to the UnionPay Cloud Computing Platform and is expected to open for applications in July.

Tencent Cloud ADP 4.0 Launch: Claw Mode Enables Agents to Be Created with a Single Sentence and Integrated into the System Instantly

Tencent Cloud launched the ADP 4.0 version at the 2026 AI Industrial Applications Conference, upgrading it into an enterprise-level AgentOps platform. The new version introduces the Claw mode, supporting Agentic Loop, and connects agents from construction, connection to distribution and governance through components such as Connector, Skills, Knowledge Base, MCP, and Agent Portal throughout their entire lifecycle. This initiative aims to lower the threshold for enterprises to build agents and promote the large-scale industrial application of agents.

Price Drops 75%! DeepSeek V4 Announces Permanent Discount, Ranks First in Global AI Cost-Effectiveness

DeepSeek announced that its flagship large model V4-Pro will have a permanent 75% price reduction, converting the previous limited-time 2.5-fold discount into a permanent pricing strategy. Third-party evaluations show that the model has topped the global AI cost-effectiveness ranking due to this price cut, outperforming U.S. competitors and highlighting the absolute advantages of Chinese AI in terms of cost and efficiency.

DeepSeek Announces Permanent 75% Price Reduction for V4-Pro Model API, Setting a New Low in Global Large Model Pricing

DeepSeek officially announced that the API price of its DeepSeek-V4-Pro model will be permanently reduced to one-quarter of the original price after the limited-time promotion ends on May 31, 2026. Previously, the model had already introduced a full set of API price adjustments on April 26, with input cache hit prices reduced to one-tenth of the launch price, combined with a limited-time 2.5 discount. This adjustment makes the low price a standard, setting a new record for global large model API pricing.