MiniMax Empowers 'OpenClaw' AI with Voice Customization and Music Composition Skills
MiniMax has integrated its advanced Speech and Music models into the OpenClaw ecosystem, equipping the '小龙虾' AI assistant with new skills for custom voice generation and full song creation within popular workplace chat applications.
MiniMax has launched a significant update for its OpenClaw "小龙虾" AI assistant, introducing deep capabilities for voice synthesis and music generation directly within enterprise communication platforms.
The company announced the formal integration of its MiniMax Speech and Music model APIs into the OpenClaw ecosystem. By acquiring new Skills from the Clawhub, users can now empower their AI assistant to perform sophisticated audio tasks within Feishu, WeChat Work, and DingTalk.
Voice Maker: The Polyglot Sound Designer
At the core of the update is the Voice Maker skill. Once a user's "小龙虾" learns this skill and validates a MiniMax API Key, it unlocks what the company terms a "Language Master" identity. This capability includes:
- Support for over 40 languages and nearly a hundred pre-set voice tones.
- Automatic script segmentation for handling multi-voice, multi-emotion paragraph dubbing with ease.
- An integrated Voice Design function, allowing users to create custom voice tones through natural language descriptions, enabling precise, multi-dimensional sound crafting.
Music Maker: The AI Composer
Simultaneously, the Music Maker skill grants the assistant full music composition abilities. After learning this skill and API validation, the AI can generate:
- Complete pop songs up to 5 minutes in length.
- Instrumental music for relaxation.
- Hummed vocal melody demos.
For users requiring more granular control, an additional 'Music Expert' Skill is available on Clawhub. This tool can automatically add structured tags and propose different arrangement schemes for various song sections.
Accessibility and Use
These new Skills are accessible through multiple channels:
- Learning the Skills directly within the OpenClaw platform.
- Direct experience within the MaxClaw interface.
- Activation via commands within paired IM software (Feishu, WeChat Work, DingTalk).
For first-time use in MaxClaw, users download the Skill's ZIP file, upload it to the chat dialog, and provide their API Key through natural language communication. With the right commands and parameter configuration, the assistant can even bypass traditional file sharing and send audio content directly as a message within Feishu.
This move significantly expands the role of the "小龙虾" from a text-based helper to a versatile, multi-modal AI companion capable of handling complex creative audio tasks on demand.