MiniMax Unveils Voice Design Feature for Speech Model Update

Breaking NewsJun. 23, 2025

Summarized by NextFin AI

Chinese AI startup MiniMax has launched an update to its Speech-02 voice model, introducing a new feature called Voice Design.
This feature allows users to describe their ideal voice characteristics using natural language, enabling precise control over voice outputs.
Voice Design can generate entirely novel voice tones that do not exist in reality, enhancing text-to-speech capabilities.
The integration with Speech-02 ensures a “what you describe is what you get” performance in voice generation tasks.

AsianFin — Chinese AI startup MiniMax has rolled out a new update to its Speech-02 voice model, introducing a feature called “Voice Design,” according to a statement on Monday.

The new functionality allows users to describe their ideal voice characteristics using natural language, enabling precise, multi-dimensional control over voice outputs — including the ability to generate entirely novel voice tones that don’t exist in the real world.

Integrated seamlessly with the Speech-02 model, the Voice Design feature empowers users to achieve true “what you describe is what you get” performance in text-to-speech tasks, the company said.

Explore more exclusive insights at nextfin.ai.

Insights

What is the concept behind MiniMax's Voice Design feature?

How does the Voice Design functionality work within the Speech-02 model?

What are the key improvements in the Speech-02 voice model with the new update?

What user feedback has been received regarding the Voice Design feature?

How does the Voice Design feature compare to existing text-to-speech technologies?

What market trends are influencing the development of voice technologies?

What recent news has emerged regarding advancements in AI speech models?

How do users describe their ideal voice characteristics using the Voice Design feature?

What challenges does MiniMax face in the competitive voice technology market?

What potential impacts could the Voice Design feature have on the future of voice applications?

How does the ability to create novel voice tones expand the possibilities in content creation?

What are the limitations of the current Speech-02 model despite its new features?

Are there any controversies surrounding the use of AI-generated voices?

What historical advancements have led to the current capabilities of voice models?

How does MiniMax's technology stack up against competitors in the AI voice landscape?

What are the implications of allowing users to create entirely new voice tones?

NextFin.Al

No Noise, only Signal.

Open App