Speaker Recognition Python Packages

nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1.1M 18K 3K

wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

4K 1K 199

diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 91 8

nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

2K 18K 3K

pveagle

On-device speaker recognition engine powered by deep learning

2K 51 7

speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

2K 265 27

nemo-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 18K 3K

mvector

Voice Print Recognition toolkit on Pytorch

989 1K 169

ppvector

Voice Print Recognition toolkit on PaddlePaddle

973 314 49

nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

960 18K 3K

diarizationlm

DiarizationLM

718 452 38

whisper-speaker-id

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

549 26 1

pveagledemo

On-device speaker recognition engine powered by deep learning

506 51 7

pvfalcon

On-device speaker diarization powered by deep learning

498 74 7

bmnspeechlib

467 265 27

wavencoder

WavEncoder - PyTorch backed audio encoder

427 92 14

audioperm

A python library for generating different permutations of audible segments from audio files.

397 13 2

hyperion-ml

Python toolkit for speech processing

395 72 22

uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

335 2K 319

voicetag

Speaker identification powered by pyannote and resemblyzer

334 51 5

sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

332 452 38

pvfalcondemo

On-device speaker diarization powered by deep learning

301 74 7

nemo-tts

Collection of Neural Modules for Speech Synthesis

126 18K 3K

piwho

A python wrapper around MARF speaker recognition frameworkfor raspberry pi and other SBCs

117 57 20