Multi-Modal

[2025-2] 백승우 - GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning

BaekDaBang 2025. 11. 26. 17:15
 

GUI Exploration Lab: Enhancing Screen Navigation in Agents via...

With the rapid development of Large Vision Language Models, the focus of Graphical User Interface (GUI) agent tasks shifts from single-screen tasks to complex screen navigation challenges. However...

openreview.net