GUI Exploration Lab: Enhancing Screen Navigation in Agents via...
With the rapid development of Large Vision Language Models, the focus of Graphical User Interface (GUI) agent tasks shifts from single-screen tasks to complex screen navigation challenges. However...
openreview.net