PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Vision Language Python Packages

Python packages with the GitHub topic vision-language. Sorted by relevance, with stars and monthly downloads.
OFA-Sys
cn-clip

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

6K 6K 550
IRVLUTD
proto-clip-toolkit

Code release for Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning | IROS 2024

1K 55 7
shikras
ddd-dataset

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

865 137 7
IDEA-Research
nw-groundingdino

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

481 10K 1K
IDEA-Research
gdinopy

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

345 10K 1K
IDEA-Research
groundingdino-iscas

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

290 10K 1K
IDEA-Research
groundingdino-yl

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

275 10K 1K
IDEA-Research
gddet

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

200 10K 1K
IDEA-Research
groundingdino-mod

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

154 10K 1K
EmmanuelleB985
mmeval-vrag

Evaluation Framework for Multimodal RAG Systems

113 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery