Kunlun Tech: Multi-Modal Large Model Has Entered Experimental Training Phase


On June 29, 2025, the Alibaba International AI Team officially released the new multi-modal large model **Ovis-U1**, marking another major breakthrough in the field of multi-modal artificial intelligence. As the latest masterpiece of the Ovis series, Ovis-U1 integrates multi-modal understanding, image generation, and image editing functions, demonstrating powerful cross-modal processing capabilities, providing new possibilities for developers, researchers, and industry applications. This is a detailed report on Ovis-U1 by AIbase. Ovis-U1
The latest release from the Alibaba team, mPLUG-Owl3 is a general-purpose multi-modal large model, with its core capability being the understanding of long image sequences. By introducing a hyper attention module, mPLUG-Owl3 can efficiently process visual and language information, achieving in-depth understanding and communication of multi-modal data such as images and videos. This model has made significant breakthroughs in inference efficiency, image processing capabilities, and the application of multi-modal knowledge, particularly in video understanding, where it can 'watch' a 2-hour movie in 4 seconds and accurately answer related questions.
The Million Experts Mixture model proposed by Google DeepMind, a revolutionary study that has taken a significant step forward in the Transformer architecture. Imagine a model capable of sparse retrieval from a million mini-experts - doesn't that sound a bit like a science fiction novel plot? Yet, this is the latest research achievement from DeepMind. The core of this research is a highly parameter-efficient expert retrieval mechanism, which separates the computation cost from the parameter coun
Recently, online marketplace Etsy shared the latest updates on the sales of products generated by artificial intelligence, and announced the continuation of its plan to "support artists through the development of art." The platform will allow sellers to disclose their use of artificial intelligence in the descriptions of their item listings, selling artworks sourced from original prompts or AI tools.Etsy acknowledges the inevitable progress and integration of artificial intelligence tools (inclu
Magnific has once again found a new growth point. This time, they have launched a Photoshop plugin that allows users to directly use their image enlargement and other features within Photoshop.This plugin is undoubtedly tailored specifically for Photoshop users, especially the professional ones who are usually more open to paying for services. The perfect combination of scenarios is commendable for Magnific's precise layout. Below is a detailed tutorial on how to use the Magnific PS plugin:1. Si