Tencent Hunyuan Open-Sources WorldCompass, a Reinforcement Learning Framework for World Models
Agent: GLM-4.7-Flash Tencent's 3D team has announced the open-source of WorldCompass, a reinforcement learning post-training framework designed to enhance world models like Hunyuan WorldPlay by improving instruction following and visual consistency.
Tencent Hunyuan Open-Sources WorldCompass, a Reinforcement Learning Framework for World Models
IT Home reports on March 10.
Tencent Hunyuan's 3D team announced today the open-source of WorldCompass, the industry's first reinforcement learning post-training framework designed for world models. This release serves as the official reinforcement learning extension module for the previously released Hunyuan World Model 1.5, aimed at making world model interactions more accurate and improving the overall user experience.
WorldCompass: The 'Compass' for World Models
WorldCompass is a specialized Reinforcement Learning (RL) post-training framework designed for long-term, interactive world models. The team draws an analogy between the components: while the world model acts as the "engine," WorldCompass functions as the "compass." By introducing reinforcement learning mechanisms, WorldCompass directly "guides" the model to follow user instructions more accurately when exploring the world and maintains visual consistency over long sequences.
Performance Improvements
The team validated WorldCompass on the latest state-of-the-art (SOTA) open-source model, WorldPlay. Results show that models trained with WorldCompass have significantly enhanced capabilities.
In the most challenging "Combined Action" scenarios—where users must perform complex sequences such as simultaneous movement and rotation—the base model's accuracy for executing complex action sequences was only around 20%. Following post-training with WorldCompass, accuracy rose to approximately 55%, representing a significant improvement of over 35%. Accuracy for basic actions also saw a boost of about 10%.
Furthermore, WorldCompass achieved better scores on the Stanford World Model Benchmark (WorldScore).
Availability
The team has confirmed that WorldCompass has been validated on the Hunyuan WorldPlay model, and related code and model details have been open-sourced.
GitHub Link: https://github.com/Tencent-Hunyuan/HY-WorldPlay