PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speaker Diarization Python Packages

Python packages with the GitHub topic speaker-diarization. Sorted by relevance, with stars and monthly downloads.
alibaba-damo-academy
funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

346K 16K 2K
linto-ai
whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

100K 3K 211
espnet
espnet

End-to-End Speech Processing Toolkit

27K 10K 2K
alibaba-damo-academy
funasr-onnx

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7K 16K 2K
wenet-e2e
wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

7K 1K 195
juanmc2005
diart

A python package to build AI-powered real-time audio applications

5K 2K 164
wq2012
spectralcluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

4K 551 75
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 67 7
ekhodzitsky
polyvoice

Speaker diarization for Rust — who spoke when, without Python. Silero VAD + WeSpeaker + AHC in a single Pipeline::run() call.

3K 0 0
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

1K 263 27
thc1006
taiwan-asr-toolkit

Production-grade Traditional Chinese / Taiwan Mandarin speech-to-text. Qwen3-ASR + MediaTek Breeze-ASR-25, hot-word injection, LLM polish, speaker diarization. RTF up to 1554x on RTX 5090, 56 TDD tests.

1K 2 0
Anes1032
qwen-aligner-toolkit

Production toolkit around Qwen3-ForcedAligner: multilingual word-level alignment, VAD, and speaker diarization.

816 0 0
alibaba-damo-academy
funasr-torch

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

746 16K 2K
wq2012
simpleder

A lightweight library to compute Diarization Error Rate (DER).

698 62 9
gorkemkaramolla
whisper-run

Faster Whisper with Speaker Diarization

588 9 1
XqFeng-Josie
voxkitchen

Declarative speech data processing toolkit — write a YAML recipe, run vkit run, get training-ready data. Ships a Claude Code skill.

580 2 0
narcotic-sh
senko

Very fast, accurate speaker diarization

563 259 29
google
diarizationlm

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

558 449 38
Picovoice
pvfalcon

Falcon Speaker Diarization Engine

464 71 7
hwk06023
sonata-asr

SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR system that captures human expressions including emotive sounds and non-verbal cues.

433 6 2
google
sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

408 449 38
ringger
transcribe-critic

Multi-source transcript merging inspired by textual criticism — LLM adjudicates multiple Whisper, YouTube captions & external transcripts for higher quality. Includes speaker diarization and summarization.

398 23 2
clement-pages
gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

342 70 8
google
uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

330 2K 319
    • Data from PyPI, GitHub, ClickHouse, and BigQuery