New Benchmark for Speech-to-Text: ElevenLabs Leads, Google Gemini Ranks Second with Comprehensive Capabilities

Artificial Analysis has recently released version 2.0 of its speech-to-text benchmark test (AA-WER v2.0). Test results show that ElevenLabs and Google demonstrate strong dominance in the field of audio transcription.

In terms of the core word error rate (WER) metric, ElevenLabs' Scribe v2 achieved an extremely low error rate of 2.3%, ranking first. Following closely behind is Google's Gemini3Pro, with an error rate of 2.9%. Notably, Google did not specifically train Gemini for transcription tasks, and this outstanding performance is entirely attributed to its strong multimodal general capabilities.

Other mainstream models performed as follows:

Mistral Voxtral Small: Ranked third with an error rate of 3.0%.
Google Gemini3Flash: Performed steadily, with an error rate of 3.1%.
OpenAI Whisper Large v3: As the most popular open-source model, it ranked in the middle with an error rate of 4.2%.
Bottom tier: Alibaba's Qwen3ASR Flash (5.9%), Amazon's Nova2Omni (6.0%), and Rev AI (6.1%) ranked at the bottom in the test.

In the specialized AA-AgentTalk test for voice assistant commands, the ranking remained stable. ElevenLabs' Scribe v2 and Google's Gemini3Pro led with error rates of 1.6% and 1.7%, respectively, demonstrating high reliability in handling short and direct voice interactions.

Japanese Government Advances AI and Semiconductor Industry Talent Development Program

The Japanese government announced the establishment of a cross-ministerial task force to train talent in strategic industries such as artificial intelligence, semiconductors, quantum technology, shipbuilding, and defense manufacturing. The move aims to shift the workforce toward high-growth areas rather than merely expanding traditional vocational training. According to the "Yomiuri Shimbun", Tokyo is coordinating the establishment of a "Re-skilling and Talent Development Promotion Committee," under the Cabinet Office, to drive strategic adjustments in the workforce.

Mininglamp Opens Cider+Mano-P, Turning Your Mac into a Private AI Workstation

Mininglamp has open-sourced two local AI projects, Cider and Mano-P, addressing the pain points of AI inference acceleration on Mac and GUI agent operations. Cider unleashes the potential of M-series chips, making LLM/VLM run faster and more efficiently locally; Mano-P enhances the efficiency of agent operations. This upgrades your Mac from merely running AI to an efficient, private, and deeply controllable AI workstation, building a complete local AI infrastructure.

OpenAI Joins NVIDIA and Other Giants to Release MRC Protocol, Redefining Large-Scale AI Training Network Architecture

OpenAI has partnered with five major companies, including AMD, Broadcom, Intel, Microsoft, and NVIDIA, to launch the Multi-Path Reliable Connection (MRC) protocol, aimed at addressing network latency and failure issues in large-scale AI training. The protocol has been open-sourced through the Open Compute Project (OCP) and is driving a shift from a three-tier architecture to a two-tier design, breaking single points of failure and improving training stability and efficiency.

OpenAI Invests Heavily in AI Computing Power, 50 Billion Dollar Investment Sparks Industry Arms Race

OpenAI CEO Greg Brockman revealed that the company plans to invest up to 50 billion dollars before 2026 to enhance computing resources, in response to the surge in demand for computing power for training and inference of large AI models. This investment has grown thousands of times compared to around 30 million dollars in 2017, marking the transition of generative AI from experimental stages to large-scale commercialization.

AI Left Hand Turns into Right Hand? Anthropic Signs a $200 Billion Super Big Order with Google

Anthropic has partnered deeply with Google, committing to pay $200 billion over five years for cloud services and self-developed chip computing power, accounting for over 40% of Google's future revenue commitments. This massive deal highlights the high computing power threshold in the AI industry, while Anthropic and OpenAI's orders dominate the major U.S. cloud service market.....