본문 바로가기

책상 밖 세상을 경험할 수 있는 Playground를 제공하고, 수동적 학습에서 창조의 삶으로의 전환을 위한 새로운 라이프 스타일을 제시합니다.

Miscellaneous84

[2025-2] 김지원 - Introduction to Reinforcement Learning 논문 정보: Ghasemi, M., & Ebrahimi, D. (2024). Introduction to reinforcement learning. arXiv preprint arXiv:2408.07712.논문 링크: https://arxiv.org/pdf/2408.07712?논문 인용수: 17회 (2025.09.12 기준)Introduction 강화 학습은 AI의 한 분야로 시간에 따른 누적 보상을 최대화하는 것을 목표로 환경과 상호작용함으로써 훈련된다.지도 학습&비지도 학습과 달리 RL은 직관적인 결정들을 내려야 하는 자율적인 에이전트들을 다루고 종종 데이터 없이 이들의 행동으로부터 학습하기도 한다.핵심 아이디어는 시행착오 탐색을 통해 시간을 지나 누적 보상을 최대화하기 위해 어떻게 세상이 작동하는 .. 2025. 9. 13.

[2025-2] LLM-based agent : DrBioRight 2.0 논문 출처 : Liu, W., Li, J., Tang, Y. et al. DrBioRight 2.0: an LLM-powered bioinformatics chatbot for large-scale cancer functional proteomics analysis. Nat Commun 16, 2256 (2025). https://doi.org/10.1038/s41467-025-57430-4 Agent란?그냥 “사람 대리인”이 아니라, 목표를 스스로 이해하고, 계획을 세우고, 필요한 도구를 사용해 문제를 해결하는 소프트웨어 시스템LLM에서의 agentLLM 단독 → 단순히 질문에 답변하거나 텍스트 생성.LLM agent → LLM이 여러 외부 도구(tool)나 API, 데이터베이스, 실험 장비를 스스.. 2025. 8. 16.

[2025-2] 이루가 - A survey on large language model based autonomous agents 논문 링크: https://arxiv.org/abs/2308.11432 A Survey on Large Language Model based Autonomous AgentsAutonomous agents have long been a prominent research focus in both academic and industry communities. Previous research in this field often focuses on training agents with limited knowledge within isolated environments, which diverges significantly from harxiv.org1. Introduction자율 에이전트의 정의환경에 위치하고 그 .. 2025. 8. 16.

[2025-2] 김지원 - Auto-GPT for Online Decision Making: Benchmarks andAdditional Opinions 논문 정보: Yang, H., Yue, S., & He, Y. (2023). Auto-gpt for online decision making: Benchmarks and additional opinions. arXiv preprint arXiv:2306.02224.링크: https://arxiv.org/pdf/2306.02224인용수: 226회 (2025-08-16 기준)초록Auto-GPT는 의사 결정 태스크를 수행하는 LLM을 활용하는 자율 agent이다.Auto-GPT 스타일의 연구가 진행되고 있으나 Auto-GPT의 실제 세계 의사 결정 태스크를 풀기에 효과성과 유연성과 관련된 의문이 여전히 존재한다.이러한 불확실성은 벤치마크의 부재와 제한된 실제 세계 관여 능력 때문이다.이 연구에서 Auto-GP.. 2025. 8. 16.

이전 1 2 3 4 5 ··· 21 다음

티스토리툴바