Natural Language Processing

[2025-2] 백승우 - ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

BaekDaBang 2025. 7. 29. 12:39
 

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

While reasoning models (e.g., DeepSeek R1) trained with reinforcement learning (RL), excel in textual reasoning, they struggle in scenarios requiring structured problem-solving, such as geometric reasoning, concise computation, or complex equation solving-

arxiv.org