You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

ttsvowelviz

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

ttsvowelviz

A tool for visualizing static and dynamic vowel spaces during text-to-speech model training

0.1.0

PyPI

Maintainers: 1

TTSVowelViz

TTSVowelViz is a tool for visualizing static and dynamic vowel spaces during the training of text-to-speech ( TTS) models. This helps researchers and developers monitor the progression of vowel quality over training steps.

✨ Features

📊 Visualize static and dynamic vowel spaces across training steps.
📈 Track vowel space evolution during model training.
🔍 Examine the shape of the learned vowel space.
🆚 Compare learned vowel spaces at training steps against the ground truth.
🛠️ Customize visualizations with various user-defined inputs and configurations.
🧠 Analyze, evaluate, and interpret TTS systems.
🧩 Easily integrate into TTS training pipelines with minimal effort.

📦 Installation

Install using pip:

pip install ttsvowelviz

Or install the latest version from source:

git clone https://github.com/pasindu-ud/ttsvowelviz.git
cd ttsvowelviz
pip install .

🔧 Usage

Basic Example

from typing import List, Union

from ttsvowelviz import Synthesizer, TTSVowelViz
from ttsvowelviz.forced_aligner import ForcedAligner, WebMAUSBasicAligner
from ttsvowelviz.formant_extractor import FormantExtractor, PraatFormantExtractor


class ExampleSynthesizer(Synthesizer):
    def synthesize(self, step: int, text: str) -> str:
        # Code to generate speech from text at a given step
        return "Path to the synthesized audio file"


static_vowels: List[str] = ["3:", "6", "6:", "I", "O", "U", "e", "i:", "o:", "{", "}:"]
static_time_points: List[Union[int, float]] = [50]
point_vowels: List[str] = ["i:", "o:", "6:"]
dynamic_vowels: List[str] = ["@}", "Ae", "e:", "oI", "{I", "{O"]
dynamic_time_points: List[Union[int, float]] = [20, 50, 80]
intermediate_steps: List[int] = [0, 1000, 3000]
synthesizer: Synthesizer = ExampleSynthesizer()
forced_aligner: ForcedAligner = WebMAUSBasicAligner(language="eng-NZ")
formant_extractor: FormantExtractor = PraatFormantExtractor()
text_list: List[str] = ["Heard foot hud heed head had hard hod thought goose hid heard.",
                        "How'd hear oat hide lloyd hare aid how'd."]
ground_truth_src_dir_path: str = "Path to the ground truth directory"
vowel_space_dst_dir_path: str = "Path to the directory where vowel spaces should be saved"

tool: TTSVowelViz = TTSVowelViz(static_vowels=static_vowels, static_time_points=static_time_points,
                                point_vowels=point_vowels, dynamic_vowels=dynamic_vowels,
                                dynamic_time_points=dynamic_time_points, intermediate_steps=intermediate_steps,
                                synthesizer=synthesizer, forced_aligner=forced_aligner,
                                formant_extractor=formant_extractor, text_list=text_list,
                                ground_truth_src_dir_path=ground_truth_src_dir_path,
                                vowel_space_dst_dir_path=vowel_space_dst_dir_path)
for s in intermediate_steps:
    tool.execute(step=s)

📚 Citation

If you use this tool in your research, please cite:

@misc{ttsvowelviz2025,
  author = {Pasindu Udawatta and Jesin James and B.T. Balamurali and Catherine I. Watson and Ake Nicholas and Binu Abeysinghe},
  title = {TTSVowelViz},
  year = {2025},
  url = {https://github.com/pasindu-ud/ttsvowelviz}
}

📄 License

MIT License. See LICENSE file for details.

Keywords

FAQs

What is ttsvowelviz?

Is ttsvowelviz well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

ttsvowelviz

TTSVowelViz

✨ Features

📦 Installation

🔧 Usage

Basic Example

📚 Citation

📄 License

Keywords

Related posts

Malicious npm Packages Target WhatsApp Developers with Remote Kill Switch

11 Malicious Go Packages Distribute Obfuscated Remote Payloads

TC39 Advances 11 Proposals for Math Precision, Binary APIs, and More