PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llava Python Packages

Python packages with the GitHub topic llava. Sorted by relevance, with stars and monthly downloads.
Blaizzy
mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

383K 5K 539
PaddlePaddle
ppdiffusers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

5K 722 225
unum-cloud
uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

2K 1K 79
zhudotexe
kani-vision

Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.

1K 7 0
nrl-ai
llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.

996 529 43
haotian-liu
llava-torch

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

968 25K 3K
Blaizzy
mlx-vlm-nell

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

373 5K 539
corentin-ryr
multimedeval

A Python tool to evaluate the performance of VLM on the medical domain.

265 88 8
0jc1
autovod

Automatically download livestreams, clip with AI, and upload in realtime

210 27 15
om-ai-lab
omagent-core

[EMNLP-2024] Build multimodal language agents for fast prototype and production

171 3K 288
Blaizzy
fount-vlm-nell-02

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

123 5K 539
    • Data from PyPI, GitHub, ClickHouse, and BigQuery