Choosing a voice
Browse the full voice library in the XUNA AI Voice Library. Voices are organized by:- Gender — Male, female, or neutral.
- Age — Young, middle-aged, or old.
- Accent — American, British, Australian, and many more.
- Use case — Narration, conversational, customer service, etc.
- Language — Filtered by language support.
Supported languages
XUNA AI Conversational AI supports 31 languages for both speech recognition and synthesis. Set the agent’s primary language to ensure the ASR model is tuned for the correct language.View all supported languages
View all supported languages
English, Spanish, French, German, Italian, Portuguese, Polish, Dutch, Russian, Japanese, Korean, Chinese (Mandarin), Arabic, Hindi, Turkish, Swedish, Norwegian, Danish, Finnish, Czech, Slovak, Romanian, Hungarian, Ukrainian, Greek, Bulgarian, Croatian, Catalan, Hebrew, Malay, and Indonesian.
Automatic language detection
If your agent serves multilingual users, you can enable automatic language detection. The agent detects the user’s language from their first utterance and switches to matching speech recognition and synthesis automatically. Enable language detection through the tools configuration using the built-in language detection system tool.Automatic language detection works best when users speak a full sentence. Short utterances like “hi” may not provide enough signal to detect language reliably.
Voice settings
Fine-tune how the voice sounds using these settings:| Setting | Description | Range |
|---|---|---|
| Stability | How consistent the voice sounds across sentences. Higher = more consistent, lower = more expressive. | 0.0 – 1.0 |
| Similarity boost | How closely the synthesized voice matches the original voice clone. | 0.0 – 1.0 |
| Style exaggeration | Amplifies the style of the voice. Use sparingly — high values can distort quality. | 0.0 – 1.0 |

