PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Multi Modality Python Packages

Python packages with the GitHub topic multi-modality. Sorted by relevance, with stars and monthly downloads.
hanxiao
bert-serving-client

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

14K 13K 2K
hanxiao
bert-serving-server

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

13K 13K 2K
jina-ai
clip-as-service

Embed images and sentences into fixed-length vectors via CLIP

8K 13K 2K
jina-ai
clip-server

Embed images and sentences into fixed-length vectors via CLIP

8K 13K 2K
jina-ai
clip-client

Embed images and sentences into fixed-length vectors via CLIP

8K 13K 2K
lucidrains
deep-daze

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

2K 4K 310
jina-ai
open-gpt-torch

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

2K 167 22
dvlab-research
visionzip

Official repository for VisionZip (CVPR 2025)

1K 431 27
haotian-liu
llava-torch

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

968 25K 3K
kyegomez
gemini-torch

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

931 466 65
kyegomez
sophia-optimizer

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

737 382 26
kyegomez
andromeda-llm

andromeda - Pytorch

678 151 22
kyegomez
andromeda-torch

An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast

656 151 22
kyegomez
tiny-gptv

Tiny GPTV - Pytorch

543 16 0
kyegomez
mm1-torch

PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"

476 28 1
kyegomez
andromeda-transformer

andromeda - Pytorch

470 151 22
jina-ai
rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

401 167 22
jina-ai
v-clip-server

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

339 13K 2K
kyegomez
thebestllmever

An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast

326 151 22
kyegomez
the-compiler

Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!

299 145 17
kyegomez
hsss

Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling"

286 15 2
kyegomez
aoa-torch

Attention on Attention - Pytorch

277 6 0
Luodian
otter-ai

Otter: A Multi-Modal Model with In-Context Instruction Tuning

256 3K 212
jina-ai
open-gpts

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

252 167 22
    • Data from PyPI, GitHub, ClickHouse, and BigQuery