Amid the continuous advancement of global AI technology, Google released the new TranslateGemma translation model series on January 15. This series is based on its latest Gemma3 architecture and offers three parameter sizes: 4B, 12B, and 27B, supporting translations across 55 core languages and also featuring multimodal image translation capabilities. This means users can not only translate text but also translate text within images, achieving seamless language communication.

According to Google's introduction, the launch of TranslateGemma is not just a technological iteration but also a significant performance leap. In rigorous WMT24++ benchmark tests, the 12B version of the translation quality actually exceeded the 27B baseline model, which has twice as many parameters. This means developers can achieve higher fidelity translation results with only half the computing power, greatly improving translation efficiency and response speed.

image.png

Additionally, it is worth noting that the smallest 4B model has also demonstrated strong capabilities, with performance comparable to the 12B model, making it particularly suitable for mobile devices and edge computing environments. This advancement allows more users to easily experience high-quality translation in daily life, especially when traveling, studying, and working.

From a technical perspective, the high performance of TranslateGemma is attributed to a unique "two-stage fine-tuning" process. First, Google conducts supervised fine-tuning using high-quality synthetic data and human-translated data, followed by a reinforcement learning phase, where an advanced reward model guides the model to generate more natural and contextually appropriate translations. This technological innovation brings new ideas to the field of translation.

image.png

To adapt to different application scenarios, Google has divided TranslateGemma into models of various sizes. The 4B model is optimized for smartphones and edge devices, the 12B model is suitable for consumer-grade laptops, and the 27B model is the ideal choice for users seeking the best translation quality, capable of running on high-end GPUs or cloud TPUs.

Currently, all models are available on Kaggle, Hugging Face, and Vertex AI platforms for developers and researchers to download and use. With the release of TranslateGemma, Google once again demonstrates its leading position in the AI field and opens up new possibilities for the future of language translation.