Agent4 [2026-1] 백승우 - The Evolution of Human-Like Computer-Using Agents From Perception to Command UFO: A UI-Focused Agent for Windows OS InteractionWe introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a dual-agent framework to meticulously observe and analyze the graphical user interfacearxiv.org UFO2: The Desktop AgentOSRecent Computer-Using Agents (CUAs), powered by multimod.. 2026. 1. 21. [2025-2] 백승우 - Toward Autonomous UI Exploration: The UIExplorer Benchmark https://arxiv.org/abs/2506.17779 2025. 12. 3. [2025-2] 백승우 - GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning GUI Exploration Lab: Enhancing Screen Navigation in Agents via...With the rapid development of Large Vision Language Models, the focus of Graphical User Interface (GUI) agent tasks shifts from single-screen tasks to complex screen navigation challenges. However...openreview.net 2025. 11. 26. [2025-2] 백승우 - Agent Learning via Early Experience Agent Learning via Early ExperienceA long-term goal of language agents is to learn and improve through their own experience, ultimately outperforming humans in complex, real-world tasks. However, training agents from experience data with reinforcement learning remains difficult in many enviarxiv.org 2025. 10. 15. 이전 1 다음