MaixCAM MaixPy speech synthesis
2025-08-15
Update history
| Date | Version | Author | Update content |
|---|---|---|---|
| 2025-08-15 | 1.0.0 | lxowalle | Initial document |
Introduction
This document provides instructions on using the built-in TTS functionality to convert text into speech.
TTS Support List:
| MaixCAM | MaixCAM Pro | MaixCAM2 | |
|---|---|---|---|
| MeloTTS | ❌ | ❌ | ✅ |
About TTS
TTS (Text-to-Speech) converts text into speech. You can write a piece of text and feed it to a TTS-supported model. After running the model, it will output an audio data containing the spoken version of the text.
In practice, TTS is commonly used for video dubbing, navigation guidance, public announcements, and more. Simply put, TTS is “technology that reads text aloud.”
MeloTTS
The usage of MeloTTS can be found in MeloTTS Text to Speech Model.