A powerful, Transformer-based text-to-speech (TTS) tool.
Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
LangChain integration for DeepAI API with chat, images, and speech capabilities
My MCP Server
AI powered Telegram bot for chatting, text-to-image and text-to-speech conversions
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. Original authors: Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani.
Orca Streaming Text-to-Speech Engine demos
TTS with RVC pipeline
A local/offline-capable voice assistant with speech recognition, LLM processing, and text-to-speech
A Python package for converting PDF and TXT files to audio
Microcontroller and python interface
Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control
A simple CLI tool for text-to-speech using OpenAI's API
A Python client for interacting with the Deepdub API
AllVoiceLab MCP Server
Text-to-Speech API Client with OpenAI compatibility
Python text-to-speech library with built-in voice effects and support for multiple TTS engines.
Text-to-speech
Minimal CosyVoice2 European inference CLI (bundles runtime + Matcha)
AI powered text-ehancer and offline text-to-speech
A package for text-to-speech and speech-to-text tools
Effective evaluations for Text-to-Speech (TTS) systems
PythonAIBrain is a versatile, plug-and-play Python package designed to help you build offline intelligent AI assistants and applications effortlessly. With modules covering speech recognition, text-to-speech, image generation, natural language understanding, and more, PythonAIBrain lets you create powerful AI solutions without deep expertise or complex setup. Whether you’re a beginner or an experienced developer, get ready to bring your AI ideas to life quickly and efficiently.
Unlimited text-to-speech generation with chunking and seamless merging
Python SDK for the Tsetsen, Mongolian Text-to-Speech API
Python client for PlomTTS AI Text-to-Speech server
A Python library for computing the Mel-Cepstral Distance (also known as Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Kubichek (1993).
Python SDK for Aurora
A powerful desktop client for Mistral LLMs
Python package for convert text to phoneme ipa, use for cross language embedding Text-to-speech Reseach
Aiola Text-To-Speech Python SDK
Dimits is a Python library that provides an easy-to-use interface to the Piper text-to-speech (TTS) system. It utilizes the powerful Piper TTS engine, which is optimized for Raspberry Pi 4, to generate high-quality synthesized speech.
Python Acapela Text-To-Speech
Bambara Text-to-Speech system using Maliba-AI models
Fish Speech pipeline as library so you don't need to webui.
A Python wrapper for A.I.VOICE Editor API
Text-To-Speech with MSSpeak
text-to-speech synthesis
A bot that uses the musixmatch API to transform songs into Google Text-to-Speech
Tool to make high quality text to speech (tts) corpus from audio + text books.
Simple, hackable text-to-speech with PyTorch or MLX.
Tools for Twilio+AWS translation projects