Text to Audio
With Text to Audio you convert text into an audio file. This is useful for voice-overs, audience communication, instructional videos, training materials, and scripts.
Getting started from the dashboard
On the dashboard, choose Text to Audio under the input field. The input field expands so you can conveniently enter longer scripts. You can then fill in the text and generate audio.
Settings
Via the settings button next to the input field you can adjust the speech settings.
| Setting | Explanation |
|---|---|
| Model | Choose the text-to-speech model. |
| Language | Choose the language in which the text should be spoken. |
| Voice | Choose a voice suitable for the chosen language. |
| System prompt | Provide instructions for pronunciation, tone, speed, accent, and special terms. |
| Style reference | Add extra cues about the desired speaking style. |
The voice list is filtered by the chosen language. If a voice is intended for only certain languages, you will see that language listed with the voice.
Pronunciation and style
The system prompt controls how the voice should sound. You can indicate, for example:
- that the speaker should sound as native Dutch,
- that words like AI, AI-Public, ChatGPT, OpenAI and Gemini may be pronounced in English,
- that Claude should be pronounced as a French name,
- or that the tone should be calm, warm, formal, informal, low, or energetic.
When you choose another language, AI-Public adjusts the default instructions to that language.
Saving and restoring
You can save your settings to your account. AI-Public will remember, among other things, the model, language, voice, and system prompt. With Restore defaults you remove these saved preferences.
Result
After generation the audio file appears directly in the chat. You can play it there with the audio player and download it with the download button.
During generation the input form is temporarily disabled. This prevents multiple audio generations from running concurrently.