Breaking the Barrier of Running Large Models on Mobile Phones: M6 Intelligence Collaborates with Tsinghua University to Open-Source New Edge-Side Product BitCPM-CANN

Recently, MBLab has jointly released and open-sourced the latest breakthrough in low-bit large model training with Tsinghua University and the OpenBMB open-source community — BitCPM-CANN. This achievement was completed natively on Huawei's Ascend platform, marking a key step forward in the lightweight and engineering implementation of edge-side AI large models.

Releasing Six Times the Memory Benefit to Break Hardware Limitations

The open-sourced BitCPM-CANN includes four model sizes: 0.5B, 1B, 3B, and 8B. When compared item by item with full-precision models of the same size, it performs exceptionally well. Compared to traditional BF16 precision, this model can release about six times the memory benefit during inference, significantly lowering the hardware requirements for running large models.

For the mobile industry, the six times memory benefit means that large models with 8B parameters, which previously required very high configuration levels, can now run smoothly on mainstream flagship phones. This extreme release of memory space will directly accelerate the popularization and commercial application of edge-side AI technology on mobile devices.

High Ability Retention Rate Confirms Engineering Reproducibility

While reducing the model size, BitCPM-CANN still maintains an extremely high performance level, with its model ability retention rate successfully maintained between 90% and 97.2%. The ability retention rates of the three main model sizes have reached 95.7% to 97.2%, and even the smallest 0.5B model has a retention rate exceeding 90%.

This impressive evaluation result systematically proves that the low-bit training approach has strong scalability and engineering reproducibility. MBLab has built a complete low-bit training foundation based on the relevant core technologies, covering the entire engineering system including environment adaptation, support for 32K long sequences, and fused operators, thus laying a solid public infrastructure for future low-bit training work on Ascend.

Mianbi Intelligence Jointly Launches China's First 1.58-bit Large Model BitCPM-CANN with Tsinghua University

Mianbi Intelligence, in collaboration with Tsinghua University and the OpenBMB open-source community, released and open-sourced China's first ternary (1.58-bit) large model, BitCPM-CANN, trained on the Huawei Ascend platform. This model achieves a breakthrough in low-bit training, with native development across the full chain from quantization operators to training algorithms. It offers four versions ranging from 0.5B to 8B parameters, showcasing....

Human Game Experience Upgraded! Free and Open-Source AI Chess Engine Maia 3 Officially Released

The Maia Chess team released the open-source chess engine Maia 3, trained on 250 million human games, with an Elo rating of approximately 1800 points, an increase of nearly 300 points from the previous version. The engine is free and open-source, supports local deployment, and focuses on simulating human decision-making patterns, promoting the popularization of AI chess engines.

Factitious Illusion Rate Drops to 3.3%! Baichuan Intelligence to Release New Medical Large Model Baichuan-M4

At Tsinghua University's 'AI Medical New Paradigm' forum, Baichuan Intelligent CEO Wang Xiaochuan released the next-generation medical model 'Baichuan-M4' and AI family doctor 'Baixiaoyi'. The model topped three authoritative benchmarks, solving the 'factual hallucination' problem in medical AI, marking a breakthrough in precision and application form of AI in the medical vertical domain.....

InHand Intelligence Launches the Small-Sized Humanoid Robot XMAN-L1: Lightweight Interaction Expert Starts Working

On May 26, Keenon Robotics launched the compact humanoid robot XMAN-L1, 136cm tall, designed for lightweight interactive scenarios. As a core product of the 'general+specialized' matrix, it integrates mainstream large models for anthropomorphic upgrades in commercial service, combining high flexibility with compact hardware performance.....