PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Asr Python Packages

Python packages with the GitHub topic asr. Sorted by relevance, with stars and monthly downloads.
jdepoix
youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

22.5M 8K 775
deepgram
deepgram-sdk

Official Python SDK for Deepgram.

2.7M 430 130
m-bain
whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

1M 22K 2K
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

934K 17K 3K
alphacep
vosk

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

571K 15K 2K
kurianbenoy
whisper-normalizer

A python package for whisper normalizer

504K 76 17
Ailln
cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

312K 760 83
k2-fsa
sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

258K 12K 1K
k2-fsa
sherpa-onnx-core

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

163K 12K 1K
speechmatics
speechmatics-rt

Python SDKs for Speechmatics APIs

154K 17 7
speechmatics
speechmatics-voice

Python SDKs for Speechmatics APIs

109K 17 7
linto-ai
whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

96K 3K 211
k2-fsa
sherpa-onnx-bin

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

52K 12K 1K
speechmatics
speechmatics-batch

Python SDKs for Speechmatics APIs

30K 17 7
istupakov
onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

27K 316 30
ai-bot-pro
achatbot

An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.

18K 89 18
k2-fsa
sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

15K 2K 213
mbailey
voice-mode

Natural voice conversations with Claude Code

15K 1K 165
fgnt
meeteval

MeetEval - A meeting transcription evaluation toolkit

13K 155 18
analyticsinmotion
werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and transcription accuracy.

12K 26 6
sccn
eegprep

EEGPrep is an automated preprocessing tool for human EEG data built on a benchmarked EEGLAB pipeline

9K 23 5
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

7K 663 77
coqui-ai
stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

6K 3K 300
speechmatics
speechmatics-flow

Python SDKs for Speechmatics APIs

6K 17 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery