Covering Human Action Space for Computer Use: Data Synthesis and Benchmark
CUActSpot benchmark enhances GUI complex interaction performance via data synthesis and multimodal evaluation; Phi-Ground-Any-4B excels.
Miaosen Zhang, Xiaohan Zhao, Zhihong Tan et al.