PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Voice Recognition Python Packages

Python packages with the GitHub topic voice-recognition. Sorted by relevance, with stars and monthly downloads.
snakers4
silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

825K 9K 771
alphacep
vosk

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

571K 15K 2K
moonshine-ai
moonshine-voice

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

27K 8K 423
ai-bot-pro
achatbot

An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.

18K 89 18
collabora
whisper-live

A nearly-live implementation of OpenAI's Whisper.

12K 4K 554
usefulsensors
fastrtc-moonshine-onnx

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

7K 8K 423
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

7K 663 77
coqui-ai
stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

6K 3K 300
Picovoice
pvleopard

On-device speech-to-text engine powered by deep learning

5K 479 29
moonshine-ai
useful-moonshine-onnx

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

4K 8K 423
Picovoice
pvrhino

On-device Speech-to-Intent engine powered by deep learning

4K 701 95
shhossain
banglaspeech2text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

3K 123 19
PaddlePaddle
paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
PaddlePaddle
paddlespeech-feat

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
coqui-ai
iara-stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

2K 3K 300
spokestack
spokestack

Spokestack Library for Python

2K 142 15
coqui-ai
coqui-stt-training

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

1K 3K 300
botbahlul
autosrt

a utility for automatic speech recognition and subtitle generation

1K 69 13
OpenJarbas
hivemind-voice-sat

Hivemind Voice Satellite

1K 21 13
PaddlePaddle
paddleaudio

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

1K 13K 2K
coqui-ai
stt-tflite

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

1K 3K 300
solovieff
kibernikto

Kibernikto is a multi agent ai framework with a telegram bot connection.

972 34 8
Picovoice
pvcheetahdemo

Cheetah speech-to-text engine demos

908 663 77
PaddlePaddle
paddlespeech-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

798 13K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery