A versatile and user-friendly Python Text-to-Speech engine
A text-to-speech server and client using gtts and pydub
image-upscaling.net api client
A text-to-speech conversion tool using Google Translate API
A Python CLI for Ruth NLP
OuteAI Text-to-Speech (TTS)
Stream text into audio with an easy-to-use, highly configurable library delivering voice output with minimal latency.
Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Python GUI
Minimax MCP Server
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language).
A simple text splitter based on Tortoise for use in text-to-speech applications
A simple text-to-speech client for Azure TTS API.
Orpheus Text-to-Speech System
Convert any content to a podcast
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
Python package for text normalization, use for frontend of Text-to-speech Reseach
A clean interface to Windows speech recognition and text-to-speech capabilities.
Manim plugin for all things voiceover
ElevenLabs MCP Server
A python text to speech library
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models locally on your Mac using MLX
my dash mic recorder
Orca Streaming Text-to-Speech Engine
bard is a text to speech tool based on existing open-source models (local install) and APIs to install on your desktop
A powerful, Transformer-based text-to-speech (TTS) tool.
Official Python client for the Smallest AI API
A high quality multi-voice text-to-speech library
pyttsx - cross platform text-to-speech
A Python package for Gemini Text-to-Speech using the official API.
TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
Toolkit for using and training Parler-TTS, a high-quality text-to-speech model.
A Media Toolkit. Text-to-speech is currently available.
Voice-Activated Natural Language UI
CLI tool to manage piper voices.
A Python library for computing the Mel-Cepstral Distance (also known as Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Kubichek (1993).
Official Speechify API SDK
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. Original authors: Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani.
AI powered Telegram bot for chatting, text-to-image and text-to-speech conversions
Dimits is a Python library that provides an easy-to-use interface to the Piper text-to-speech (TTS) system. It utilizes the powerful Piper TTS engine, which is optimized for Raspberry Pi 4, to generate high-quality synthesized speech.
Orca Streaming Text-to-Speech Engine demos
A Python package for converting PDF and TXT files to audio