SenseTime has officially launched its new lightweight multi-modal intelligent agent model — SenseNova 6.7-Lite. Designed to meet the needs of "real-world streams," the model uses a multi-modal architecture to directly understand complex layouts, document structures, and financial charts, achieving an integrated "see, think, act" process and improving the success rate of tasks such as data analysis, in-depth research, and PPT generation.

Technically, SenseNova 6.7 Flash-Lite eliminates the intermediate visual layer, achieving a significant leap in agent capabilities with fewer parameters. This innovation has led to numerous SOTA (State Of The Art) results in authoritative benchmark tests. In addition, the model significantly reduces token consumption during the reasoning process, decreasing by 60% compared to traditional text-only agents. At the same time, SenseNova 6.7 Flash-Lite can provide millisecond-level feedback, meeting the needs of high-frequency interactions.
Currently, this model is widely applied across multiple industries such as finance, manufacturing, healthcare, and education, and it possesses five core capabilities, including action decision-making, autonomous arrangement of tool chains, perception in noisy environments, self-correction and recovery, and long-term memory consistency. These capabilities bring higher reliability and execution stability to real-world workflow scenarios, redefining the standards for lightweight multi-modal intelligent agent models.
To encourage developers to use this model, SenseTime has also announced a limited-time open SenseNova Token Plan, offering free quotas in the first month where developers can refresh 1500 call quotas every 5 hours. Meanwhile, the entire SenseNova-Skills line of office skills is also open-sourced, providing developers with more tools and resources.
Key Points:
🌟 SenseNova 6.7 Flash-Lite adopts a lightweight design, enhancing the capabilities of multi-modal agents.
⚡ Token consumption has been reduced by 60% compared to traditional models, meeting the needs of high-frequency interactions.
💼 The model supports multiple industries and offers a limited-time free usage plan to help developers.
