PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Transcription Python Packages

Python packages with the GitHub topic transcription. Sorted by relevance, with stars and monthly downloads.
speechmatics
speechmatics-python

Python library and CLI for Speechmatics

52K 75 23
neocl
speach

🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)

34K 21 6
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

7K 663 77
juanmc2005
diart

A python package to build AI-powered real-time audio applications

5K 2K 164
Picovoice
pvleopard

On-device speech-to-text engine powered by deep learning

5K 479 29
deepgram
deepctl

Official Deepgram CLI — speech-to-text, text-to-speech, and audio intelligence from your terminal

5K 5 2
Yujia-Yan
transkun

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

3K 341 35
nomadkaraoke
lyrics-transcriber

Automatically create synchronised lyrics files in ASS and LRC with word-level timestamps, using Whisper and lyrics from online sources, with anchor sequences and LLMs to auto-correct transcription

2K 91 17
HYP3R00T
voicepad

Command-line interface for voice recording and GPU-accelerated transcription.

2K 1 0
HYP3R00T
voicepad-core

Core audio processing for Voicepad.

2K 1 0
baxtree
subaligner

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

2K 505 24
kristofferv98
voiceprocessingtoolkit

A comprehensive library for voice processing tasks such as wake word detection, speech recognition, translation, and text-to-speech.

2K 4 1
BasedHardware
omi-cli

Command-line interface for Omi — talk to memories, conversations, action items, and goals from your terminal.

2K 13K 2K
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

2K 263 27
d-flood
criticus

A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.

1K 26 2
andbue
nashi

A webapp for the transcription of scanned pages

1K 17 4
plribeiro3000
meeting-hive

A local-first pipeline for correcting speech-to-text errors in meeting transcripts, using a vocabulary you maintain. Adapter-based (sources, vocabularies, summarizers), cross-platform, markdown output.

1K 0 0
Picovoice
pvcheetahdemo

Cheetah speech-to-text engine demos

908 663 77
paberr
ownscribe

Fully local meeting transcription and summarization CLI

834 51 6
tiroq
mnemofy

mnemofy extracts audio from media files, transcribes speech, and produces documented meeting notes with topics, decisions, and concrete mentions.

775 0 0
andbue
nashi-ocr

An OCR client complementing webapp for the transcription of scanned pages

758 17 4
dimastatz
whisperflow

Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.

729 739 103
patrikx3
p3x-meet-assistant

Real-time AI speech-to-text for meetings with GPT-4o Transcribe and GPU speaker diarization

726 0 0
AbdullahHendy
live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optional translation. Supports CLI and Python API.

670 15 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery