PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Voice Activity Detection Python Packages

Python packages with the GitHub topic voice-activity-detection. Sorted by relevance, with stars and monthly downloads.
snakers4
silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

825K 9K 771
alibaba-damo-academy
funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

347K 16K 2K
amsehili
auditok

An audio/acoustic activity detection and audio segmentation tool

27K 848 100
k2-fsa
sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

15K 2K 213
lifeiteng
omnivad

Cross-platform VAD & Audio Event Detection toolkit — Python (PyPI) + TypeScript (npm) + C API. DFSMN models ~2MB, 200x real-time. Runs everywhere: native, browser (WASM), Node.js.

14K 63 3
smacke
ffsubsync

Automagically synchronize subtitles with video.

13K 8K 317
ina-foss
inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

8K 888 150
alibaba-damo-academy
funasr-onnx

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7K 16K 2K
chicogong
ffvoice

🎙️ 高性能 C++ 语音引擎 - 实时音频处理 + AI 语音识别 + 边录边转写 | High-performance C++ voice engine with real-time ASR and RNNoise

7K 2 0
juanmc2005
diart

A python package to build AI-powered real-time audio applications

5K 2K 164
shashikg
whisper-s2t

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

5K 572 76
daanzu
silero-vad-lite

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies

3K 18 1
FireRedTeam
fireredvad

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

3K 388 26
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 67 7
decibri
decibri

Cross-platform audio capture, playback, and voice activity detection for Python, Rust, and Node.js powered by a single Rust core.

3K 8 0
Picovoice
pvcobra

On-device voice activity detection (VAD) powered by deep learning

2K 253 17
strands-labs
pywebrtc-audio

Python bindings for WebRTC's audio processing pipeline, bringing noise suppression, echo cancellation, automatic gain control, and voice activity detection to the Python ecosystem via pybind11.

2K 2 0
baxtree
subaligner

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

2K 505 24
alibaba-damo-academy
funasr-torch

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

785 16K 2K
stefanwebb
open-voice-activity-detection

Fully open-source and state-of-the-art Voice Activity Detection (VAD) models for academic research and commercial applications.

762 7 0
dangvansam
livekit-plugins-tenvad

TEN VAD low-latency voice activity detection for real-time streaming, integrated with livekit-agents

661 25 6
k2-fsa
sherpa-ncnn-bin

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

645 2K 213
k2-fsa
sherpa-ncnn-core

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

585 2K 213
mechanicalsea
spectra-torch

Spectra Extraction based on PyTorch

558 41 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery