PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Text Data Python Packages

Python packages with the GitHub topic text-data. Sorted by relevance, with stars and monthly downloads.
asyml
texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

2K 747 113
asyml
forte

Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/

1K 252 60
infinitode
duplipy

DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.

638 1 0
asyml
texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

509 2K 368
thu-coai
cotk

Conversational Toolkits

250 129 26
yaledhlab
wordmap

Visualize large text collections with WebGL

230 28 5
lolei
redditcleaner

Cleans Reddit Text Data :scroll: :broom:

175 83 2
BALaka-18
rake-new2

A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.

175 29 20
signaln
parallelio

For reading from and writing to parallel data files in Python

57 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery