PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Parallel Corpus Python Packages

Python packages with the GitHub topic parallel-corpus. Sorted by relevance, with stars and monthly downloads.
Helsinki-NLP
opusfilter

Toolbox for filtering parallel corpora

913 115 26
yonkornilov
opus-api

OPUS (opus.nlpl.eu) Python API

653 18 5
BramVanroy
astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

519 26 0
UUDigitalHumanitieslab
perfectextractor

Extracting present perfects (and related forms) from parallel corpora

338 7 2
rggdmonk
hadal

A simple and efficient tool for mining and aligning sentences with pre-trained models.

165 6 0
time-in-translation
preprocess-corpora

Preprocessing and sentence-aligning for parallel corpora

117 2 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery