PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Topic Modeling Python Packages

Python packages with the GitHub topic topic-modeling. Sorted by relevance, with stars and monthly downloads.
RaRe-Technologies
gensim

Topic Modelling for Humans

5M 16K 4K
MaartenGr
bertopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

416K 8K 897
nomic-ai
nomic

Nomic Developer API SDK

37K 2K 197
JasonKessler
scattertext

Beautiful visualizations of how language differs among document types.

16K 2K 285
bab2min
tomotopy

Python package of Tomoto, the Topic Modeling Tool

15K 595 65
ddangelov
top2vec

Top2Vec learns jointly embedded topic, document and word vectors.

9K 3K 377
MilaNLProc
contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

4K 1K 152
gregversteeg
corextopic

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

3K 640 117
MIND-LAB
octis

OCTIS: a library for Optimizing and Comparing Topic Models.

3K 800 118
raphschlatt
ads-bib

Pipeline for querying and turning NASA's ADS publications metadata into curated, analysis-ready datasets, topic maps, and citation networks.

2K 2 0
stephenhky
shorttext

Various Algorithms for Short Text Mining

2K 471 74
mortazavilab
topyfic

Topyfic is a Python package designed to identify reproducible latent dirichlet allocation (LDA) using leiden clustering and harmony for single cell epigenomics data

2K 11 1
bobxwu
topmost

Topmost: A Topic Modeling System Toolkit

2K 288 27
ddbourgin
numpy-ml

Machine learning, in numpy

2K 16K 4K
bobxwu
fastopic

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

2K 156 13
lffloyd
embedded-topic-model

A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM

2K 94 10
demetrius-mp
sesg

SeSG (Search String Generator) python package repository.

2K 1 0
drob-xx
topicmodeltuner

HDBSCAN Tuning for BERTopic Models

2K 52 3
yaniv-shulman
chunkey-bert

ChunkeyBert is a minimal and easy-to-use keyword extraction technique that leverages BERT embeddings for unsupervised keyphrase extraction from long text documents.

2K 1 0
Sinapsis-AI
sinapsis-bertopic

Package for topic modeling using BERTopic, including templates for fitting models and making predictions.

1K 0 0
maximtrp
bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

1K 85 15
DecafSunrise
simpletopicmodel

An NLP Package for generating Topic Models

1K 1 0
machine-intelligence-laboratory
topicnet

Interface for easier topic modelling.

1K 143 17
ContextLab
hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

1K 2K 162
    • Data from PyPI, GitHub, ClickHouse, and BigQuery