PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Diarization Python Packages

Python packages with the GitHub topic diarization. Sorted by relevance, with stars and monthly downloads.
desh2608
spy-der

Simple Python package for fast DER computation

15K 35 7
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 67 7
ekhodzitsky
polyvoice

Speaker diarization for Rust — who spoke when, without Python. Silero VAD + WeSpeaker + AHC in a single Pipeline::run() call.

2K 0 0
patrikx3
p3x-meet-assistant

Real-time AI speech-to-text for meetings with GPT-4o Transcribe and GPU speaker diarization

726 0 0
wq2012
simpleder

A lightweight library to compute Diarization Error Rate (DER).

698 62 9
narcotic-sh
senko

Very fast, accurate speaker diarization

584 259 29
namastexlabs
murmurai-core

Modern speech recognition with word-level timestamps and speaker diarization. Fork of WhisperX with torch 2.6+, pyannote 4.x compatibility.

480 41 17
Picovoice
pvfalcon

Falcon Speaker Diarization Engine

472 71 7
namastexlabs
murmurai

GPU-powered transcription API with speaker diarization

356 41 17
desh2608
dover-lap

Python package for combining diarization system outputs.

355 94 12
Picovoice
pvfalcondemo

Falcon Speaker Diarization engine demos

296 71 7
yaniv-golan
dlzoom

CLI tool to download Zoom cloud recordings and extract audio for transcription

264 0 0
Gr122lyBr
voicetag

Speaker identification powered by pyannote and resemblyzer

228 50 4
harmlessman
pafts

Library That Preprocessing Audio For TTS.

133 27 5
jakariaemon
whisper-speaker-id

Whisper Speaker Identification: A Python library for multiligual speaker identification and speaker embedding generation.

131 26 1
namastexlabs
whisperx-api

🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, speaker diarization. Free forever: uvx murmurai

124 41 17
bunyaminergen
wavlmmsdd

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

110 12 3
eddiegulay
rtrimmer

Lightweight Python package to trim RTTM diarization files and audio files

63 1 0
penkow
opensono

Open source - Voice AI

3 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery