A versatile and user-friendly Python Text-to-Speech engine
Fast and local neural text-to-speech engine
The official Python library for the Fish Audio API
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models
A text-to-speech server and client using gtts and pydub
Official Python SDK for KugelAudio TTS API
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models locally on your Mac using MLX
Orca Streaming Text-to-Speech Engine
Translate, transliterate, get the language of texts in no time with the help of multiple APIs!
TTS caching integration for Pipecat to reduce API costs on repeated phrases
A Python CLI for Ruth NLP
Kyutai's pocket-sized text-to-speech!
Fast multilingual text-to-phoneme converter for South East Asian languages.
A text-to-speech conversion tool using Google Translate API
image-upscaling.net api client
Text-to-speech CLI, MCP server, and Claude Code plugin (ElevenLabs, AWS Polly, OpenAI)
Minimax MCP Server
ElevenLabs MCP Server
A Python client for interacting with the Deepdub API
NeuTTS - a package for text-to-speech generation using Neuphonic's TTS models.
Stream text into audio with an easy-to-use, highly configurable library delivering voice output with minimal latency.
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language).
LiveKit Agents plugin for Camb.ai TTS
Prebuilt binary wheels for pyopenjtalk (Python wrapper for OpenJTalk)
Advanced on-device Vietnamese TTS with instant voice cloning
A simple text splitter based on Tortoise for use in text-to-speech applications
Echo TTS - Text-to-Speech synthesis with voice cloning
Coze Coding Dev SDK - 优雅的多功能 AI SDK,支持图片生成、视频生成、语音合成、语音识别、大语言模型、联网搜索和文本/多模态 Embedding。包含命令行工具 coze-coding-ai,支持 Context 上下文追踪
Deepgram STT and TTS integration for Vision Agents
OuteAI Text-to-Speech (TTS)
Local voice layer for AI coding tools — 100% offline voice assistant for Claude Code
Cartesia TTS integration for Vision Agents
Manim plugin for all things voiceover
OpenAI-compatible HTTP server for OmniVoice TTS
High-quality Text-to-Speech synthesis with ONNX Runtime
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
ElevenLabs TTS and STT integration for Vision Agents
Kokoro TTS integration for Vision Agents
Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Python GUI
Shunyalabs ASR & TTS services for Pipecat
Python package for text normalization, use for frontend of Text-to-speech Reseach
Inworld AI TTS integration for Vision Agents
Fish Audio TTS and STT integration for Vision Agents
Ultra-lightweight English text-to-speech model (1.6M params, ~3.4MB ONNX)
A high-performance inference engine specifically designed for the GPT-SoVITS text-to-speech model