Bursting with Popularity! Academic Team Breaks the Monopoly of Tech Giants with SFT, OpenSeeker-v2 Ranks at the Top of the Search Agent Rankings

In the current large language model (LLM) field, deep search capabilities have become the "ultimate move" of top intelligent agents. However, this competition has long been dominated by industrial giants with substantial resources. Traditional development models typically rely on resource-intensive pipelines, including pre-training, continued pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL).

A research team from academia recently released their latest achievement OpenSeeker-v2, completely breaking this conventional understanding. The research report indicates that by using high-quality and high-difficulty task trajectories for training, even a simple supervised fine-tuning (SFT) method can develop a top-performing search agent.

The team proposed three core optimization strategies in data synthesis: first, expanding the knowledge graph scale to provide a richer exploration space; second, significantly increasing the number of toolkits to extend functional boundaries; finally, implementing strict low-step filtering to ensure the refinement and efficiency of training data.

Experimental data shows that OpenSeeker-v2 (30B scale, ReAct architecture), trained on only 10,600 data points, demonstrated strong dominance in four core benchmark tests: it achieved an accuracy of 46.0% on BrowseComp, 58.1% on BrowseComp-ZH, 34.6% on "Humanity's Last Exam," and as high as 78.0% on xbench. These results not only set new records but also comprehensively surpassed industry models that used complex pipelines of heavy CPT + SFT + RL - such as Tongyi DeepResearch.

Notably, this is the first state-of-the-art (SOTA) search agent developed by a purely academic team using only SFT technology at the same model scale and architecture. Currently, the team has officially open-sourced the model weights of OpenSeeker-v2. This discovery greatly reduces the R&D threshold for cutting-edge search agents and provides an academic community and open-source community with a more reference-worthy lightweight development path.

Paper URL: https://arxiv.org/pdf/2605.04036

New Breakthrough in Domestic GPU: Moortech S5000 Supports China Mobile's Jiutian Large Model

At the 9th Digital China Construction Summit, China Mobile will publicly showcase its self-developed "Jiutian" 35B general large model for the first time. Moortech announced that its flagship GPU MTTS 5000 has completed full-process adaptation and inference verification for this model, marking an important breakthrough in the domestic computing power ecosystem. Through its self-developed MUSA software stack and the high-performance inference engine SGLang-MUSA, Moortech has achieved deep integration and optimization throughout the entire inference pipeline of the "Jiutian" 35B model.

DeepSeek API Input Cache Price Dramatically Reduced: 1/10 of the Initial Price

DeepSeek announced a significant reduction in the price of its entire series of API input cache to 1/10 of the initial price. The input cache price for V4-Pro is now reduced to 0.1 yuan per million Tokens, and after a limited-time promotion, it is only 0.025 yuan, far lower than the overseas competitor GPT-5.5Pro. This move aims to attract more developers and enterprises by offering extreme cost-effectiveness, marking a new stage in domestic AI cost control.

Xiaomi MiMo-V2.5 Shocking Beta Test: 4.3 Hours of Manual Compiler Development, Long-Range Intelligent Agent Achieves a Full Leap

Xiaomi released the MiMo-V2.5 series of large models, including MiMo-V2.5, V2.5-Pro, and accompanying TTS and ASR models, marking an upgrade from "usable" to "user-friendly." The flagship model MiMo-V2.5-Pro has reached competitive levels with top models such as Claude Opus4.6 and GPT-5.4 in terms of general intelligent agent capabilities and software engineering. Its core advantage lies in high instruction adherence and self-correction capabilities.

Conversational Work: Qwen Launches Table Agent for Direct Generation and Editing of Excel Files

Qwen launches the 'Table Agent' feature, allowing users to generate, query, and edit Excel files through natural language conversations, achieving a transition from text answers to direct results. The feature covers three key aspects: zero-barrier conversion of information into tables, intelligent retrieval, and in-depth editing, simplifying traditional table processing workflows.

Tencent Releases HY-Embodied-0.5 Embodied Model, Achieving Best Results in 22 Evaluations and Setting New Industry Records

Tencent has released the HY-Embodied-0.5 foundational model specifically designed for robots, aiming to address the shortcomings of general visual language models in 3D spatial perception and physical interaction, and to advance large models toward the field of robot control. The series of models have been restructured in both architecture and training, and are accompanied by the release of main models such as MoT-2B.