Introduction
MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai. Supported languages include:
Some other features include:
- The Chinese speaker supports
mixed Chinese and English
. - Fast enough for
CPU real-time inference
.
Usage
The Python API and model cards can be found in this repo or on HuggingFace.
Discord
Join our Discord community and select the Developer
role upon joining to gain exclusive access to our developer-only channel! Don't miss out on valuable discussions and collaboration opportunities.
Contributing
If you find this work useful, please consider contributing to this repo.
Authors
Citation
@software{zhao2024melo,
author={Zhao, Wenliang and Yu, Xumin and Qin, Zengyi},
title = {MeloTTS: High-quality Multi-lingual Multi-accent Text-to-Speech},
url = {https://github.com/myshell-ai/MeloTTS},
year = {2023}
}
TODO
License
This library is under MIT License, which means it is free for both commercial and non-commercial use.
Acknowledgements
This implementation is based on TTS, VITS, VITS2 and Bert-VITS2. We appreciate their awesome work.