PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speech Translation Python Packages

Python packages with the GitHub topic speech-translation. Sorted by relevance, with stars and monthly downloads.
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

934K 17K 3K
espnet
espnet

End-to-End Speech Processing Toolkit

27K 10K 2K
KevKibe
africanwhisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

3K 38 6
PaddlePaddle
paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
PaddlePaddle
paddlespeech-feat

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
huggingface
speech-to-speech

Build local voice agents with open-source models

2K 5K 565
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
PaddlePaddle
paddleaudio

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

1K 13K 2K
PalabraAI
palabra-ai

Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication across 25+ languages

1K 38 6
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

804 17K 3K
PaddlePaddle
paddlespeech-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

798 13K 2K
nvidia
nemo-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

620 17K 3K
George0828Zhang
torch-cif

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.

318 37 4
nvidia
nemo-tts

Collection of Neural Modules for Speech Synthesis

91 17K 3K
PaddlePaddle
paddlespeech-ldd-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

66 13K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery