PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Multimodal Deep Learning Python Packages

Python packages with the GitHub topic multimodal-deep-learning. Sorted by relevance, with stars and monthly downloads.
jrzaurin
pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

3K 1K 197
theislab
scarches

Reference mapping for single-cell genomics

3K 404 70
AI4Finance-Foundation
finrobot

FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀

1K 7K 1K
thuiar
mmsa-fet

A Tool for extracting multimodal features from videos.

1K 212 27
kyegomez
navit-torch

navit - Pytorch

1K 273 15
kyegomez
bitnet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

1K 2K 172
kyegomez
swarms-torch

Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊

863 144 16
kyegomez
pali-torch

Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"

632 95 8
kyegomez
pegasusx

pegasus - Pytorch

560 14 5
jaisidhsingh
loraclip

Easy wrapper for inserting LoRA layers in CLIP.

546 40 2
kyegomez
medpalm

MedPalm - Pytorch

537 432 59
idso-fa1-pathology
vitaminp

VitaminP: a vision transformer-assisted multimodal integration network for pathology cell segmentation

523 8 1
automatika-robotics
roboml

RoboML is an aggregator package written for prototyping and deploying open source ML models for robotics

468 11 0
georgepar
slp

Speech, Language and Multimodal Processing models and utilities in PyTorch

374 22 6
pytorch-duo
torchmm

PyTorch Data loaders and abstraction for multi-modal data.

329 0 0
kyegomez
the-compiler

Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!

299 145 17
kelechi-c
ripple-net

Text-image search and image tagging library

268 13 1
kyegomez
cross-attn

MultiModalCrossAttn - Pytorch

251 37 1
kyegomez
kosmos2-torch

My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"

231 74 6
multimind-dev
multimind-sdk

Your SDK solves all of this. One interface. Unified logic. Local + hosted models. Fine-tuning. Agent tools. Enterprise-ready. Hybrid RAG.Star 🌟 if you like it!

202 92 14
kyegomez
pali3

pali3 - Pytorch

188 147 4
asnelt
mmae

Package for Multimodal Autoencoders in TensorFlow / Keras

178 20 12
kyegomez
mmqqa

Experiments around using Multi-Modal Casual Attention with Multi-Grouped Query Attention

176 5 0
kyegomez
mmmgqa

Experiments around using Multi-Modal Casual Attention with Multi-Grouped Query Attention

123 5 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery