Article Content

Tencent's New AI Painting Upgrade! Fine-tuning Technology Enhances the Aesthetic Quality of Generated Images by 300%

Published in Latest AI News

Time :Sep 16, 2025

Read :4minute

Recently, Tencent has introduced a new method aimed at enhancing the realism and aesthetic score of AI-generated images. According to reports, this fine-tuning technique achieves significant convergence within just 10 minutes of training using 32 H20 GPUs, with human evaluation scores increasing by more than 300%.

Although current diffusion models can optimize image quality through reward mechanisms, they face several challenges. First, the number of model optimization steps is limited, leading to a phenomenon known as "reward cheating," where the model generates low-quality images to achieve high scores. Second, the offline adjustment of the reward model is not flexible enough, limiting the ability for real-time optimization.

To address these issues, the Tencent team proposed two innovative methods. The first is called "Direct-Align," which enables the model to recover the original image from any point in time by pre-injecting noise. This method reduces gradient explosion during early backpropagation, allowing the model to be optimized throughout the entire diffusion process, not just in the final few steps.

The second innovation is "Semantic Relative Preference Optimization" (SRPO). This method transforms the reward signal into a text-controlled signal. By adding positive and negative prompt words, the model can flexibly adjust the style of generated images without requiring additional data. This means users can simply add a control phrase before the prompt word to achieve functions such as brightness adjustment or style transfer.

Experimental results show that the FLUX.1-dev model trained with SRPO has significantly improved performance in terms of realism and aesthetic quality. In a test involving 3,200 prompts, the excellent rate of the SRPO-trained model in the realism dimension increased from 8.2% to 38.9%, while the excellent rate for aesthetic quality rose from 9.8% to 40.5%. Compared to other methods, SRPO not only maintains high aesthetic quality but also produces more natural image textures.

This successful application of the technology demonstrates Tencent's further exploration in the field of AI painting and points the way for future AI image generation technologies.

Paper link: https://arxiv.org/pdf/2509.06942

Related Recommendations

Tencent Hunyuan Welcomes a Top Scientist: Tianyu Peng Joins and Leads Multimodal Reinforcement Learning

Tencent strengthens AI talent by hiring Dr. Tianyu Pang, former senior research scientist at Sea AI Lab, as chief research scientist for its Hunyuan multimodal division, focusing on reinforcement learning to advance multimodal AI development.....

Feb 3, 2026

134.9k

Tencent Hunyuan Model Welcomes Top Scientist: Tsinghua PhD Peng Tianyu Joins and Leads Multi-Modal Reinforcement Learning

Tencent recruits AI talent, hiring Tsinghua PhD Pang Tianyu as Chief Research Scientist and head of multimodal reinforcement learning for its Hunyuan model team.....

Jan 30, 2026

176.1k

The Era of Thinking Has Begun for Image Editing: Tencent Releases HuanYuan Image 3.0 Image-to-Image Model

Tencent Hunyuan launches Image 3.0, an 800B-parameter image-to-image model with a Mixture of Experts architecture, enabling intelligent, instruction-based image editing on Yuanbao Assistant and its official website.....

Jan 26, 2026

238.9k

Ma Huateng stated at the annual meeting: Yuanbao will distribute 1 billion yuan in cash during the Spring Festival, aiming to recreate the success of WeChat red envelopes, similar to the Pearl Harbor attack

At Tencent's annual meeting, Ma Huateng announced that the AI application 'Yuanbao' will launch a cash distribution activity of 1 billion yuan during the Spring Festival on February 1st, with a maximum of 10,000 yuan per person, aiming to replicate the success of WeChat red envelopes. At the same time, Tencent revealed its previously confidential social AI project 'Yuanbao Party', officially integrating AI into its core social field. The project aims to create a multi-person social space with deep AI involvement, where AI can summarize group chats and take on roles such as fitness and reading companions.

Jan 26, 2026

266.1k

Tencent Yuanbao Launches Internal Testing: AI Deeply Enters WeChat and QQ Social Circles

Tencent's AI assistant 'Yuanbao' has launched an internal test for its social function 'Yuanbao Party', exploring the application of AI in multi-person social scenarios. It aims to create a social space where AI and users can entertain and collaborate together, marking Tencent's AI expansion from efficiency tools to social interaction.

Jan 26, 2026

189.2k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご