PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speaker Recognition Python Packages

Python packages with the GitHub topic speaker-recognition. Sorted by relevance, with stars and monthly downloads.
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

934K 17K 3K
wenet-e2e
wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

8K 1K 195
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

3K 67 7
Picovoice
pveagle

On-device speaker recognition engine powered by deep learning

2K 46 7
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

2K 263 27
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
Picovoice
pveagledemo

On-device speaker recognition engine powered by deep learning

1K 46 7
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

804 17K 3K
yeyupiaoling
mvector

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

700 1K 168
nvidia
nemo-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

620 17K 3K
yeyupiaoling
ppvector

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

614 313 49
google
diarizationlm

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

556 449 38
Picovoice
pvfalcon

Falcon Speaker Diarization Engine

472 71 7
google
sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

451 449 38
zabir-nabil
audioperm

A python library for generating different permutations of audible segments from audio files.

358 13 2
google
uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

345 2K 319
hyperion-ml
hyperion-ml

Python toolkit for speech processing

300 72 21
Picovoice
pvfalcondemo

Falcon Speaker Diarization engine demos

296 71 7
shangeth
wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

251 92 14
NavodPeiris
bmnspeechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

249 263 27
Gr122lyBr
voicetag

Speaker identification powered by pyannote and resemblyzer

228 50 4
jakariaemon
whisper-speaker-id

Whisper Speaker Identification: A Python library for multiligual speaker identification and speaker embedding generation.

131 26 1
nvidia
nemo-tts

Collection of Neural Modules for Speech Synthesis

91 17K 3K
maxhollmann
voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

89 43 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery