DeepSeek Code Repository Shows Mysterious Identifier MODEL1, New Flagship to Be Unveiled in February

Leading domestic AI company DeepSeek has been making waves recently. After the one-year anniversary of the release of the R1 model, its technical developments have once again caused a stir among developers. According to the latest GitHub code submission records, in the updated FlashMLA code from DeepSeek, a large number of identifiers pointing to an unknown model "MODEL1" were discovered.

In this codebase spanning hundreds of files, "MODEL1" is mentioned alongside the existing V3.2 version, suggesting that it is not simply an iteration of the current architecture, but likely a completely new model series. Technical details further support this hypothesis: the new architecture shows different design approaches compared to the V3 series in key-value (KV) cache layout, sparse processing logic, and decoding support for FP8 data format, which usually indicates new breakthroughs in computational efficiency and GPU memory optimization.

Previously, it was reported that DeepSeek plans to release a flagship model called DeepSeek V4 during the Lunar New Year of 2026, focusing on stronger coding capabilities. Combined with the recent official publications of two important papers on "optimized residual connections (mHC)" and "AI memory module (Engram)", the outside world generally speculates that "MODEL1" is the engineering implementation of these cutting-edge research results.

Sold 2 Billion in 3 Years! JD.com Teams Up with Haier at AWE to Create a Stir: AI Kitchens Are About to Take Over Your Stomach

Haier Small Appliances and JD.com's Kitchen Small Appliances signed a strategic partnership, planning to achieve a sales volume of 2 billion yuan across JD.com's full channels in the next three years. Both sides will integrate platforms, brands, and AI technology to jointly promote the deep evolution of smart kitchens.

Microsoft Plans to Train 3 Million Africans in AI Tools to Drive Digital Transformation

Microsoft is accelerating its expansion in the African AI market, planning to train 3 million people within the year to use AI tools, aiming to capture the demographic dividend of young people and compete with platforms like DeepSeek from China. The company is taking a multi-pronged approach through education, collaboration, and investment in computing power to secure leadership in the future market.

Meituan Upgrades Xingyu Big Model, Takeout Food Safety Enters 24/7 Monitoring Mode

Meituan upgrades 'Star Eyes' AI system, integrating multimodal large models with hardware/software to achieve second-level kitchen risk intervention, shifting food safety oversight from post-incident accountability to preemptive alerts. The system has conducted over 1.9 billion inspections, serving as a 24/7 digital supervisor to enhance hygiene standards in the food delivery industry.....

DeepSeek Code Repository Shows Mysterious Identifier MODEL1, New Flagship to Be Unveiled in February

Related Recommendations

Sold 2 Billion in 3 Years! JD.com Teams Up with Haier at AWE to Create a Stir: AI Kitchens Are About to Take Over Your Stomach

Microsoft Plans to Train 3 Million Africans in AI Tools to Drive Digital Transformation

Meituan Upgrades Xingyu Big Model, Takeout Food Safety Enters 24/7 Monitoring Mode

Ant Group Launches Spring Campus Recruitment for 2026: Technical Positions Account for 85% with Over 70% Focused on AI Fields

Split! Anthropic Sues the U.S. Government: Claude Has Been Placed on a Blacklist by the Pentagon?