#1 SketchVLM Enables Vision Models to Draw Explanations Directly on Images
SketchVLM allows vision models to draw SVG overlays directly on images for explanations, moving beyond text-only answers. This approach enhances visual reasoning task accuracy by 28.5% and is training-free and model-agnostic.