The Ma Yi Team Discovers: Fine-tuning Multimodal Large Models Leads to Catastrophic Forgetting

As GPT-4 was released, multi-modal large models (MLLM) became a hot topic. The team led by Ma Yi proposed the EMT framework to evaluate catastrophic forgetting in MLLMs after fine-tuning. Experiments revealed that while fine-tuning MLLMs improved performance on the fine-tuning dataset, it also led to a decline in performance on other datasets. During the fine-tuning process, MLLMs generated hallucination texts related to the fine-tuning dataset, overlooking the original issues. This research provides a framework and benchmarks for subsequent work, and further optimization is needed in model design and training techniques. The Ma Yi team conducted the first systematic evaluation of catastrophic forgetting in MLLMs, balancing trade-offs between different capabilities.

Latest Evaluation of Multimodal Large Models Released! Gemini-3-Pro Ranks First with a Major Gap, Doubao and SenseTime Lead the Domestic Group, Qwen3-VL Becomes the First Open-Source Model to Achieve High Scores

According to the latest SuperCLUE-VLM rankings, Google's Gemini-3-Pro scored 83.64 points, showing a significant lead, particularly in visual understanding and reasoning. Domestic models performed outstandingly, with SenseTime's SenseNova V6.5Pro and ByteDance's Doubao ranking second and third, showcasing China's rapid progress in the multimodal field. The evaluation covers three core capability dimensions.

Copyright Issues Hinder AIGC Development, Multimodal Large Models Become the Future Mainstream, High-Quality Datasets Gain Value

OpenAI and Microsoft are being sued by news agencies for copyright infringement. Multimodal large models are becoming the mainstream trend in the field of large models. OpenAI is reportedly actively negotiating with publishers on AI policies while high-quality dataset copyright issues are receiving attention. CITIC Publishing collaborates with large model companies for language training, and Visual China holds a core advantage in the era of AIGC.

Study: Global AI Chipset Market to Exceed $700B with 31.8% CAGR

According to TMR Research, the global artificial intelligence chipset market size is expected to exceed $700 billion, with a compound annual growth rate of 31.8% from 2022 to 2031. The article discusses the development trends, application areas, and key players in the artificial intelligence chipset market, which is highly timely and valuable for readers interested in the artificial intelligence chipset market.

IBM Research: How AI & Automation Protect Businesses from Data Breaches

IBM's report provides sufficient evidence that artificial intelligence, automation, and threat intelligence can address data breaches throughout the lifecycle, reduce costs, and provide stronger evidence. The research found that integrating artificial intelligence and automation into security operations teams can reduce the lifecycle of data breaches by 33% and costs by 33.6%. However, currently, only 28% of enterprises widely apply artificial intelligence and automation. Many enterprises rely on legacy systems, which are easily bypassed by attackers. The significance of this article lies in emphasizing the effectiveness of artificial intelligence and automation in improving cybersecurity and calling on enterprises to widely adopt these technologies to protect data security.

Google's AGI Robot Breakthrough: 54 - Member Team's 7 - Month Work, High Generalization and Reasoning 解释：核心关键词为“谷歌AGI机器人”（Google's AGI Robot）和“新成果”（Breakthrough），标题简洁地概括了主要内容，以动词开头，符合英文习惯，且长度在规定范围内。

The robotics research team at Google DeepMind recently released a robotics project called RT-2. This project took 7 months to develop and uses a large model for training. RT-2 has capabilities such as symbol understanding, reasoning, and human recognition, and can think and complete tasks based on human instructions. By combining the large model with the robot's operational capabilities, RT-2 can accomplish tasks that involve logical leaps, such as from 'extinct animals' to 'plastic dinosaurs'. The results of this project performed well in various sub - category tests, with performance up to three times that of the previous generation of robot models. This research result demonstrates the potential of large models in robotics research and is expected to drive the development of robots in the future.

The Ma Yi Team Discovers: Fine-tuning Multimodal Large Models Leads to Catastrophic Forgetting

Related Recommendations

Latest Evaluation of Multimodal Large Models Released! Gemini-3-Pro Ranks First with a Major Gap, Doubao and SenseTime Lead the Domestic Group, Qwen3-VL Becomes the First Open-Source Model to Achieve High Scores

Copyright Issues Hinder AIGC Development, Multimodal Large Models Become the Future Mainstream, High-Quality Datasets Gain Value

Study: Global AI Chipset Market to Exceed $700B with 31.8% CAGR

IBM Research: How AI & Automation Protect Businesses from Data Breaches