Skip to Content

How to Get text-to-speech audio trained on your voice

AudioBox is Meta’s new foundation research model for audio generation. In this tutorial, we’ll show you how to leverage it to generate AI audio that sounds like your own voice for free.

How to Get text-to-speech audio trained on your voice

Step-by-step

  1. Head to the AudioBox demo and scroll down.
  2. Click “Record your voice“ to use your audio sample, or use text descriptions or pre-loaded voices instead.
  3. AudioBox will then prompt you to read a short sentence to upload your vocals to the model.
  4. Type the text you’d like to generate after recording (or using a sample recording).
  5. That’s it! AudioBox will then generate two recordings in your vocal style 🎉