π OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement learning.
-
Updated
May 19, 2026 - Python
π OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement learning.
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and proactive exploration.
Official code for "AnomalyClaw: A Universal Visual Anomaly Detection Agent via Tool-Grounded Refutation". Ships the CrossDomainVAD-12 benchmark.
Add a description, image, and links to the vlm-agent topic page so that developers can more easily learn about it.
To associate your repository with the vlm-agent topic, visit your repo's landing page and select "manage topics."