A versatile and user-friendly Python Text-to-Speech engine
Minimax MCP Server
A text-to-speech server and client using gtts and pydub
OuteAI Text-to-Speech (TTS)
Stream text into audio with an easy-to-use, highly configurable library delivering voice output with minimal latency.
image-upscaling.net api client
A text-to-speech conversion tool using Google Translate API
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language).
A Python CLI for Ruth NLP
A simple text splitter based on Tortoise for use in text-to-speech applications
TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Python GUI
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
A simple text-to-speech client for Azure TTS API.
Manim plugin for all things voiceover
PythonAIBrain is a versatile, plug-and-play Python package designed to help you build offline intelligent AI assistants and applications effortlessly. With modules covering speech recognition, text-to-speech, image generation, natural language understanding, and more, PythonAIBrain lets you create powerful AI solutions without deep expertise or complex setup. Whether you’re a beginner or an experienced developer, get ready to bring your AI ideas to life quickly and efficiently.
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models locally on your Mac using MLX
A Python library for computing the Mel-Cepstral Distance (also known as Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Kubichek (1993).
Speech Utils
Python package for text normalization, use for frontend of Text-to-speech Reseach
ElevenLabs MCP Server
Orpheus Text-to-Speech System
A Python client for the DynaSpark API - Free AI text generation, text-to-speech, and image generation
pyttsx - cross platform text-to-speech
A Python client for interacting with the Deepdub API
A clean interface to Windows speech recognition and text-to-speech capabilities.
A high quality multi-voice text-to-speech library
Orca Streaming Text-to-Speech Engine
text-to-speech synthesis lite
Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control
Official Python client for the Smallest AI API
LLM Agent Toolkit provides minimal, modular interfaces for core components in LLM-based applications.
A Python package to generate AI-powered podcasts in Hindi, Marathi, or English using Gemini API.
my dash mic recorder
My MCP Server
Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
Mobvoi MCP Server
A python text to speech library
Toolkit for using and training Parler-TTS, a high-quality text-to-speech model.
AI-powered Text-to-Speech web application with multiple provider support
Text-to-Speech API Client with OpenAI compatibility
A Media Toolkit. Text-to-speech is currently available.
Convert any content to a podcast
A Python package for converting PDF and TXT files to audio
Voice-Activated Natural Language UI