Seven Major AI Transformation Trends to Watch in 2024


On January 30, 2026, Google Maps launched walking and cycling navigation features based on the Gemini assistant, expanding AI voice interaction from driving scenarios to non-motorized travel. The feature aims to provide a "co-pilot"-style real-time voice assistance throughout the journey, allowing users to query location and traffic information via voice, solving the problem of不方便查看手机 during walking or cycling.
The first full-modal real-time interactive visual language model in China, VisualGPT, was launched in Qingdao. Users can upload images and videos and directly select areas to ask questions. The model provides answers, code, or 3D environments within seconds. It also opens up an intelligent agent training platform and computing resources, pushing AI interaction into a new stage of visual interface instant interaction.
AI search engine Perplexity officially launched its advertising program in the US market this week, creating a new model for search advertising. This highly anticipated AI company has successfully attracted several advertising giants, including Indeed, Whole Foods, Universal McCann, and PMG. Unlike traditional search advertising, Perplexity has innovated an advertising format that integrates into the AI interaction process. Advertisers can naturally incorporate questions in relevant contexts, such as how to use Indeed to optimize job search strategies.
Recently, World Labs, a startup co-founded by AI industry leader Fei-Fei Li, performed impressively in its latest round of funding, successfully raising $1 billion, which has drawn widespread attention from the industry. The funding round featured a strong lineup, with Autodesk being an important investor, injecting $200 million into World Labs. Additionally, well-known companies and investment institutions such as Andreessen Horowitz, NVIDIA, and AMD have also participated, providing significant momentum for the development of this startup.
Google DeepMind officially launched its latest AI music generation model, Lyria 3. The model is now available as a beta version (Beta) in the Gemini app and is freely accessible to global users aged 18 and above. The most remarkable feature of Lyria 3 is its full-scenario creation capability. Even users with no musical background can easily generate music through three methods. Users just need to input natural language prompts, such as 'a cheerful reggae song suitable for a beach party' or 'an epic electronic music about space exploration'.