How to generate speech
Describe what you need
Tell the agent what text to convert. You can type the text directly or ask the agent to write it first.
Choose a voice (optional)
If you do not specify a voice, the agent uses Rachel (the default). To use a different voice, mention it in your prompt.
You can also use a cloned voice by its ID. See Voice Cloning.
| Voice | Description |
|---|---|
| Rachel (default) | Neutral, clear, conversational |
| George | Male, warm tone |
| Sarah | Female, professional |
| Charlie | Male, casual |
| Lily | Female, friendly |
| Chris | Male, energetic |
Example prompts
Phone greeting for a small accounting firm
Voiceover for a bakery's Instagram Reel
Explainer narration for a fitness studio's website
What you cannot do
- You cannot control speed, pitch, or emphasis. The voice model determines pacing naturally.
- You cannot add pauses or breaks at specific positions in the text.
- You cannot output in formats other than MP3 or WAV.
- You cannot mix speech with background music in one step. Generate them separately.
Next steps
Multi-Speaker Dialogue
Generate conversations between multiple speakers.
Voice Cloning
Clone a voice from audio samples to use for speech generation.
