Skip to main content

Text to Audio

With Text to Audio you convert text into an audio file. This is useful for voice-overs, audience communication, instructional videos, training materials, and scripts.

Getting started from the dashboard

On the dashboard, choose Text to Audio under the input field. The input field expands so you can conveniently enter longer scripts. You can then fill in the text and generate audio.

Settings

Via the settings button next to the input field you can adjust the speech settings.

SettingExplanation
ModelChoose the text-to-speech model.
LanguageChoose the language in which the text should be spoken.
VoiceChoose a voice suitable for the chosen language.
System promptProvide instructions for pronunciation, tone, speed, accent, and special terms.
Style referenceAdd extra cues about the desired speaking style.

The voice list is filtered by the chosen language. If a voice is intended for only certain languages, you will see that language listed with the voice.

Pronunciation and style

The system prompt controls how the voice should sound. You can indicate, for example:

  • that the speaker should sound as native Dutch,
  • that words like AI, AI-Public, ChatGPT, OpenAI and Gemini may be pronounced in English,
  • that Claude should be pronounced as a French name,
  • or that the tone should be calm, warm, formal, informal, low, or energetic.

When you choose another language, AI-Public adjusts the default instructions to that language.

Saving and restoring

You can save your settings to your account. AI-Public will remember, among other things, the model, language, voice, and system prompt. With Restore defaults you remove these saved preferences.

Result

After generation the audio file appears directly in the chat. You can play it there with the audio player and download it with the download button.

During generation the input form is temporarily disabled. This prevents multiple audio generations from running concurrently.