PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speech Recognition Python Packages

Python packages with the GitHub topic speech-recognition. Sorted by relevance, with stars and monthly downloads.
huggingface
transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

149.1M 161K 33K
Uberi
speechrecognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

7.9M 9K 2K
SYSTRAN
faster-whisper

Faster Whisper transcription with CTranslate2

7.1M 23K 2K
deepgram
deepgram-sdk

Official Python SDK for Deepgram.

2.7M 430 130
openvinotoolkit
openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

1.4M 10K 3K
m-bain
whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

1M 22K 2K
nttcslab-sp
kaldiio

A pure python module for reading and writing kaldi ark files

847K 268 36
alphacep
vosk

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

571K 15K 2K
alibaba-damo-academy
funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

347K 16K 2K
Ailln
cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

312K 760 83
revdotcom
rev-ai

Rev AI Python SDK

254K 36 13
Blaizzy
mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

186K 7K 594
openvinotoolkit
openvino-dev

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

181K 10K 3K
cmusphinx
pocketsphinx

A small speech recognizer

127K 4K 729
Picovoice
pvporcupine

On-device wake word detection powered by deep learning

107K 5K 576
linto-ai
whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

96K 3K 211
huggingface
pytorch-pretrained-bert

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

85K 161K 33K
speechmatics
speechmatics-python

Python library and CLI for Speechmatics

52K 75 23
huggingface
pytorch-transformers-pvt-nightly

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

43K 161K 33K
Softcatala
whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

40K 1K 126
huggingface
pytorch-transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

32K 161K 33K
dictation-toolbox
dragonfly2

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

30K 412 79
espnet
espnet

End-to-End Speech Processing Toolkit

27K 10K 2K
istupakov
onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

27K 316 30
    • Data from PyPI, GitHub, ClickHouse, and BigQuery