cross-attention
CALM - Pytorch
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
attention map for diffusers
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!