multimodal-llm
Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.
Open-source red teaming framework for MLLMs with 42+ attack methods