Natural Language Processing [2025-2] 백승우 - ReTool: Reinforcement Learning for Strategic Tool Use in LLMs BaekDaBang 2025. 7. 29. 12:39 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs While reasoning models (e.g., DeepSeek R1) trained with reinforcement learning (RL), excel in textual reasoning, they struggle in scenarios requiring structured problem-solving, such as geometric reasoning, concise computation, or complex equation solving- arxiv.org