분류 전체보기371 [2025-2] 백승우 - Toward Autonomous UI Exploration: The UIExplorer Benchmark https://arxiv.org/abs/2506.17779 2025. 12. 3. [2025-2] 백승우 - GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning GUI Exploration Lab: Enhancing Screen Navigation in Agents via...With the rapid development of Large Vision Language Models, the focus of Graphical User Interface (GUI) agent tasks shifts from single-screen tasks to complex screen navigation challenges. However...openreview.net 2025. 11. 26. [2025-2] 최민서 - Direct Preference Optimization:Your Language Model is Secretly a Reward Model [논문링크] https://arxiv.org/abs/2305.18290 Direct Preference Optimization: Your Language Model is Secretly a Reward ModelWhile large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of their behavior is difficult due to the completely unsupervised nature of their training. Existing methods for gaining sarxiv.org 1. Introductio.. 2025. 11. 19. [2025-2] 이루가 - "Why Should I Trust You?": Explaining the Predictions of Any Classifier 논문 링크: https://arxiv.org/abs/1602.04938 "Why Should I Trust You?": Explaining the Predictions of Any ClassifierDespite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when charxiv.org 1. Introduction머신러닝 발.. 2025. 11. 8. 이전 1 2 3 4 5 6 7 8 ··· 93 다음