
Research
/Security News
60 Malicious Ruby Gems Used in Targeted Credential Theft Campaign
A RubyGems malware campaign used 60 malicious packages posing as automation tools to steal credentials from social media and marketing tool users.
A fully custom, offline Text-to-Speech system for the Twi language.
pip install kalakan-tts
git clone https://github.com/kalakan-ai/kalakan-tts.git
cd kalakan-tts
pip install -e ".[dev,api,training]"
from kalakan.synthesis.synthesizer import Synthesizer
# Initialize the synthesizer with specific models
synthesizer = Synthesizer(
acoustic_model="/path/to/acoustic_model.pt", # or use a model name like "tacotron2", "fastspeech2", "transformer_tts"
vocoder="/path/to/vocoder.pt", # or use a model name like "griffin_lim", "hifigan", "melgan", "waveglow"
device="cuda:0", # or "cpu"
)
# Generate audio from Twi text
audio = synthesizer.synthesize(
text="Akwaaba! Wo ho te sɛn?",
normalize=True,
clean=True,
speed=1.0, # Control speech speed (for FastSpeech2)
pitch=1.0, # Control pitch (for FastSpeech2)
energy=1.0, # Control energy/volume (for FastSpeech2)
)
# Save the audio to a file
synthesizer.save_audio(audio, "output.wav")
# Using the demo script
python demo.py --text "Akwaaba! Wo ho te sɛn?" --output output.wav --acoustic_model /path/to/acoustic_model.pt --vocoder /path/to/vocoder.pt
# Start the REST API server
python -m kalakan.api.server --host 0.0.0.0 --port 8000 --acoustic_model /path/to/acoustic_model.pt --vocoder /path/to/vocoder.pt
# Start the gRPC API server
python -m kalakan.api.grpc_api --host 0.0.0.0 --port 50051 --acoustic_model /path/to/acoustic_model.pt --vocoder /path/to/vocoder.pt
# REST API client
python rest_client.py --text "Akwaaba! Wo ho te sɛn?" --output output.wav --host localhost --port 8000
# gRPC API client
python grpc_client.py --text "Akwaaba! Wo ho te sɛn?" --output output.wav --host localhost --port 50051
Kalakan TTS includes a comprehensive metadata generation tool for preparing TTS datasets:
# Generate basic metadata for a dataset
kalakan gen-metadata --input-dir /path/to/dataset --verbose
# Generate metadata with phonemes, dataset splitting, and quality control
kalakan gen-metadata \
--input-dir /path/to/dataset \
--generate-phonemes \
--split-dataset \
--val-ratio 0.2 \
--test-ratio 0.1 \
--output-format both \
--include-stats \
--check-duplicates \
--remove-duplicates \
--normalize-text \
--clean-text \
--speaker-id "speaker_001" \
--verbose
For detailed documentation, see docs/metadata_generation_guide.md.
Kalakan TTS includes a powerful text normalization tool for preparing Twi text:
# Normalize Twi text directly
kalakan norm --text "Dr. Kwame na Prof. Ama bɛba ha 25 mu."
# Normalize text from files with various options
kalakan norm \
--file input.txt \
--output normalized.txt \
--format json \
--show-diff \
--verbose
For detailed documentation, visit docs/.
Apache License 2.0
FAQs
A fully custom, offline Text-to-Speech system for the Twi language
We found that kalakan-tts demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
/Security News
A RubyGems malware campaign used 60 malicious packages posing as automation tools to steal credentials from social media and marketing tool users.
Security News
The CNA Scorecard ranks CVE issuers by data completeness, revealing major gaps in patch info and software identifiers across thousands of vulnerabilities.
Research
/Security News
Two npm packages masquerading as WhatsApp developer libraries include a kill switch that deletes all files if the phone number isn’t whitelisted.