PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Stt Python Packages

Python packages with the GitHub topic stt. Sorted by relevance, with stars and monthly downloads.
alphacep
vosk

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

571K 15K 2K
speechmatics
speechmatics-rt

Python SDKs for Speechmatics APIs

154K 17 7
speechmatics
speechmatics-voice

Python SDKs for Speechmatics APIs

109K 17 7
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

37K 35K 2K
GetStream
vision-agents

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

33K 8K 653
speechmatics
speechmatics-batch

Python SDKs for Speechmatics APIs

30K 17 7
istupakov
onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

27K 316 30
moonshine-ai
moonshine-voice

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

27K 8K 423
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

18K 35K 2K
analyticsinmotion
werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and transcription accuracy.

12K 26 6
GetStream
vision-agents-plugins-getstream

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

9K 8K 653
GetStream
vision-agents-plugins-deepgram

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

8K 8K 653
GetStream
vision-agents-plugins-cartesia

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

8K 8K 653
GetStream
vision-agents-plugins-openai

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

8K 8K 653
usefulsensors
fastrtc-moonshine-onnx

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

7K 8K 423
GetStream
vision-agents-plugins-openrouter

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 653
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

7K 663 77
GetStream
vision-agents-plugins-gemini

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 653
GetStream
vision-agents-plugins-ultralytics

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 653
GetStream
vision-agents-plugins-smart-turn

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 653
coqui-ai
stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

6K 3K 300
GetStream
vision-agents-plugins-moondream

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

6K 8K 653
GetStream
vision-agents-plugins-kokoro

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

6K 8K 653
GetStream
vision-agents-plugins-anthropic

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

6K 8K 653
    • Data from PyPI, GitHub, ClickHouse, and BigQuery