Tencent Meeting Rolls Out AI Simultaneous Interpretation with Voice Cloning and Sub-3-Second Latency
Tencent Meeting has launched an AI-powered simultaneous interpretation feature that mimics user voices, delivers translations with less than three seconds of latency, and integrates seamlessly with real‑time transcription and captions.
Tencent Meeting announced the official launch of its "AI Simultaneous Interpretation" (AI 同传) feature, enabling real‑time cross‑language communication directly inside meetings without the need for plugins or external devices. The tool supports independent language channels for every participant, so each attendee hears the conversation in their preferred language.
A standout capability is voice cloning: when activated, listeners hear the translated speech in the speaker's own vocal tone, making it sound as if the speaker themselves were fluent in the target language. The system achieves a latency of less than three seconds, allowing dialogue to flow almost as naturally as a native‑language conversation.
Users can adjust the volume balance between the original audio and the interpretation. In formal settings, retaining some original sound helps verify accuracy; for casual talks, participants can mute the source entirely for a cleaner listening experience.
The feature is deeply integrated with Tencent Meeting's existing transcription and caption tools. During a multilingual session, participants can simultaneously hear the interpreted voice, read real‑time captions, view translated text, and access a written record.
To enable AI Simultaneous Interpretation, users select the function from the app toolbar during a meeting. Settings can be fine‑tuned by tapping the "Interpreting" indicator. The rollout completes a cross‑language package that delivers listening, translation, visibility, and note‑taking in one unified flow.