Speech recognition and synthesis module for Windows - Python 2 and 3.
pdf text-to-speech
Manim plugin for all things voiceover (fork with updated ElevenLabs API)
Library to save and load pronunciation dictionaries (language-independent).
Continuous speech to text using watson in python with websocket and record from microphone
A flexible routing library for GenAI text-to-speech (TTS).
CLI and library to modify pronunciation dictionaries (any language).
Python SDK for the Tsetsen, Mongolian Text-to-Speech API
A python package for out-of-the-box ML solutions
AxelSolver are a series of Solvers (classes) that run specific tasks, for example: VoiceSolver is for Text-to-Speech and NuclearSolver is for a KillSwitch website (just in case AI takes over.)
A python package for out-of-the-box ML solutions
`SpeakerPy` - это Python-библиотека для синтеза речи, основанная на моделях Silero Text-to-Speech.
Text to speech using watson
Streamlit extensions for text-to-speech
Command-line interface (CLI) to train Tacotron 2 using .wav <=> .TextGrid pairs.
A simple CLI tool for text-to-speech using OpenAI's API
Audio and voice package
Python text-to-speech IVONA Wrapper
Python package for synthesizing text into speech
Web app, command-line interface and Python library for synthesizing English texts into speech.
Fork of StyleTTS 2 Python packge. StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. Original authors: Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani, Sidharth Rajaram.
Turns your voice into text-to-speech!
Text-To-Speech gbp-webhook plugin
MCP server for ElevenLabs text-to-speech API integration
Automate chunking long texts to produce a single audio file from text-to-speech APIs
A command-line interface and python library for encoding text into synthetic speech using Google Cloud Text-To-Speech or ElevenLabs APIs.
JoTTS is a German text-to-speech engine.
A Text-to-Speech (TTS) service based on Model Context Protocol (MCP), using Microsoft Edge TTS engine
Command-line interface (CLI) to select lines of a text file.
Library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
Text-to-speech using the Azure OpenAI TTS API
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech.
A Model Context Protocol (MCP) project for VoiSpark.
自动生成带字幕视频的工具,支持文字转语音、字幕同步和视频合成
A Python client library for the Aristech Text-to-Speech API
Command-line interface (CLI) to create a pronunciation dictionary based on annotations.
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Minimax MCP Server
Simple text-to-speech converter with multiple languages and voices
Discord 봇을 위한 TTS(Text-to-Speech) 라이브러리
Descript Audio Codec - MLX
A text-to-speech interface with mplayer-like bindings, using espeak
Text-to-Speech (TTS) with natural human voice involves converting written text into spoken words using advanced machine learning models. These models are trained to produce speech that closely mimics the nuances, intonations, and rhythms of human speech, making the output sound more natural and lifelike.
EVA ICS v4 text-to-speech service
CLI to modify text files.
Python client for the Player2 API - AI powered gaming tools
Advanced voice processing package with comprehensive TTS and speech recognition capabilities
A Text-to-Speech MCP server for Cursor IDE