Text to Speech
Learn how to use the JigsawStack Text to Speech API to convert text into natural speech
Overview
The Text to Speech API converts written text into natural-sounding speech output. You can use predefined speaker accents or clone a voice from an audio sample to create personalized speech output.
- Support for multiple languages and accents
- Voice cloning from audio samples
- High-quality, natural-sounding output
- Simple integration options
API Endpoint
Quick Start
See Speaker Voice Accents for the complete list of supported speaker voice accents.
Response
The API returns the generated audio file directly in the response body as binary data, typically in MP3 format.
Using Text to Speech Effectively
- Keep text natural: Write text that sounds natural when spoken aloud
- Consider punctuation: Use commas and periods to create natural pauses
- Test different accents: Try different voice options to find the best fit for your use case
- For voice cloning: Use high-quality audio samples with minimal background noise
Examples
Voice Clone Management
JigsawStack provides functionalities to create, list, and delete voice clones for Text to Speech. These voice clones can be reused across multiple text-to-speech requests.
Create Voice Clone
Create a new voice clone from an audio sample.
List Voice Clones
Retrieve a list of all your voice clones.
Delete Voice Clone
Delete a voice clone when you no longer need it.
Using Voice Clones for Text to Speech
Once you’ve created a voice clone, you can use it for text-to-speech by referencing its ID.