Speech Detection Python Packages

ffsubsync

Automagically synchronize subtitles with video.

24K 8K 320

inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

10K 900 152

omnivad

Cross-platform VAD & Audio Event Detection toolkit — Python (PyPI) + TypeScript (npm) + C API. DFSMN models ~2MB, 200x real-time. Runs everywhere: native, browser (WASM), Node.js.

8K 86 5

subsync

Synchronize your subtitles using machine learning

402 158 15

subsume

Automagically synchronize subtitles with video.

95 8K 320

ffsubs

Automagically synchronize subtitles with video.

76 8K 320