gpt-4v-alternative
Local UI-grounding specialist for hybrid AI agents. Qwen3-VL-2B LoRA. Screenshot + text target → strict JSON bbox. Drop-in for Claude Code, Codex, browser-use. Cuts GPT-4V cost & latency.