PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Voice Cloning Python Packages

Python packages with the GitHub topic voice-cloning. Sorted by relevance, with stars and monthly downloads.
coqui-ai
tts

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

183K 45K 6K
fishaudio
fish-audio-sdk

The official Python library for the Fish Audio API.

176K 178 33
OpenBMB
voxcpm

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

116K 19K 2K
playht
pyht

PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API

55K 219 32
chinokikiss
gsv-tts-lite

GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS text-to-speech model.(few shot voice cloning)

5K 96 10
PaddlePaddle
paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
sidharthrajaram
styletts2

🐍 πŸ€– Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

3K 161 40
PaddlePaddle
paddlespeech-feat

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
hinanohart
vocaboot

Terminal-first OSS singing-voice-synthesis quality gate. Alpha (v0.1.0).

2K 0 0
sidharthrajaram
tts-webui-styletts2

🐍 πŸ€– Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

2K 161 40
SocAIty
speechcraft

πŸ”Š Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark

1K 70 8
appautomaton
mlx-speech

Pure-MLX speech synthesis, voice cloning, dialogue, sound-effects, and ASR for Apple Silicon β€” Fish S2 Pro, VibeVoice, LongCat, MOSS, Step-Audio, Cohere ASR.

1K 16 2
herimor
voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

1K 231 27
PaddlePaddle
paddleaudio

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

1K 13K 2K
High-Logic
genie-tts

GPT-SoVITS ONNX Inference Engine & Model Converter

908 2K 107
hathibelagal-dev
str2speech

An easy-to-use library and command-line tool for TTS

831 15 1
PaddlePaddle
paddlespeech-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

798 13K 2K
jasminsternkopf
mel-cepstral-distance

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".

539 67 12
DrewThomasson
ebook2audiobook

Convert eBooks to audiobooks with chapters and metadata

423 19K 2K
falcoschaefer99-eng
muse-tts

Three TTS engines, 54 voices, voice cloning. Mac, Windows, Linux. Nothing phones home.

381 0 1
gooofy
zerovox

zero-shot realtime TTS system, fully offline, free and open source

261 53 10
travisvn
chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)

184 599 141
travisvn
chatterbox-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)

135 599 141
coqui-ai
tts2

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

132 45K 6K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery