Unitree Robotics Aims to Launch Foundation Model for General Humanoid Robots Within Three Years
Chinese robotics firm Unitree Robotics has disclosed in its latest IPO filing responses that it plans to release a 'General Humanoid Robot Embodied Foundation Model' within three years. The model is designed to possess four core capabilities: scene generalization, instruction generalization, action generalization, and task generalization. This development is part of the company's push to commercialize embodied AI, supported by its recently accepted IPO application on Shanghai's STAR Market seeking to raise 4.2 billion yuan.
SHANGHAI — Robotics company Unitree Robotics has outlined an ambitious three-year roadmap to develop a foundational AI model for general-purpose humanoid robots, according to documents filed as part of its initial public offering (IPO) process.
The disclosure was made in the company's response to a second round of pre-review inquiries from the Shanghai Stock Exchange (SSE), which recently accepted Unitree's application to list on the Science and Technology Innovation Board (STAR Market). The company aims to raise approximately 4.202 billion yuan (about $588 million USD).
The Foundation Model Goal
The core of the plan is the development and release of a "General Humanoid Robot Embodied Foundation Model." Unitree states this model will be systematically equipped with four key generalization capabilities:
- Scene Generalization: Adapting to diverse physical environments.
- Instruction Generalization: Understanding and executing a wide range of verbal or coded commands.
- Action Generalization: Performing a broad repertoire of physical movements and manipulations.
- Task Generalization: Completing multi-step, complex objectives.
Initially, the model is intended to "efficiently empower standardized vertical scenarios" in production and manufacturing. Unitree envisions a complete operational loop of "cloud-based model training, on-device inference and execution, and online data collection."
Path to Commercialization and Broader Applications
The filing indicates a phased commercialization strategy. As the model matures in terms of generalization, reliability, stability, and safety, Unitree plans to expand its application from industrial settings into domestic and personal service domains.
"The application field of the General Humanoid Robot Embodied Foundation Model will expand from vertical industrial scenarios to life domains such as home services and healthcare companionship," the company stated, aiming to "accelerate the process of embodied intelligence entering millions of households."
Technical Roadmap: WMA and VLA Models
On the research front, Unitree is focusing its efforts on the "World Model-Action" (WMA) embodied large model as a primary direction. Simultaneously, the company says it will continue to track and benchmark the Vision-Language-Action (VLA) technical pathway.
A key research challenge involves exploring rational methods to integrate "world modeling" capabilities with the VLA architecture. Unitree will develop both WMA and VLA large models in parallel while conducting "systematic evaluation and verification" of dual-system coordination mechanisms, trigger strategies, and performance gains. This work is intended to lay the technical foundation for large-scale deployment in complex, long-sequence task scenarios.
The announcement comes on the heels of Unitree's reported explosive financial growth. For the year 2025, the company posted revenue of 1.708 billion yuan, a staggering 335.36% year-over-year increase, with net profit reaching 600 million yuan, up 674%.