본문 바로가기

책상 밖 세상을 경험할 수 있는 Playground를 제공하고, 수동적 학습에서 창조의 삶으로의 전환을 위한 새로운 라이프 스타일을 제시합니다.

전체 글344

[2025-2] 이루가 - A survey on large language model based autonomous agents 논문 링크: https://arxiv.org/abs/2308.11432 A Survey on Large Language Model based Autonomous AgentsAutonomous agents have long been a prominent research focus in both academic and industry communities. Previous research in this field often focuses on training agents with limited knowledge within isolated environments, which diverges significantly from harxiv.org1. Introduction자율 에이전트의 정의환경에 위치하고 그 .. 2025. 8. 16.

[2025-2] 김지원 - Auto-GPT for Online Decision Making: Benchmarks andAdditional Opinions 논문 정보: Yang, H., Yue, S., & He, Y. (2023). Auto-gpt for online decision making: Benchmarks and additional opinions. arXiv preprint arXiv:2306.02224.링크: https://arxiv.org/pdf/2306.02224인용수: 226회 (2025-08-16 기준)초록Auto-GPT는 의사 결정 태스크를 수행하는 LLM을 활용하는 자율 agent이다.Auto-GPT 스타일의 연구가 진행되고 있으나 Auto-GPT의 실제 세계 의사 결정 태스크를 풀기에 효과성과 유연성과 관련된 의문이 여전히 존재한다.이러한 불확실성은 벤치마크의 부재와 제한된 실제 세계 관여 능력 때문이다.이 연구에서 Auto-GP.. 2025. 8. 16.

[2025-2] 전연주 - DiT: Scalable Diffusion Models with Transformers 논문 링크: 2212.09748깃허브 링크: facebookresearch/DiT: Official PyTorch Implementation of "Scalable Diffusion Models with Transformers" GitHub - facebookresearch/DiT: Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"Official PyTorch Implementation of "Scalable Diffusion Models with Transformers" - facebookresearch/DiTgithub.comAbstract아이디어: 기존 Diffusion 모델은 U-Net을 backbone.. 2025. 8. 16.

[2025-2] 박제우 - Playing Atari with Deep Reinforcement Learning 1. Introduction이미지, 음성 등 고차원의 감각 데이터를 제어하는 것은 강화학습의 오랜 과제이다.기존 딥러닝 모델을 강화학습에 적용하기에는 한계가 있었다우선 이러한 딥러닝 방식은 라벨링 된 데이터가 필요한데, 강화학습은 스칼라값으로 된 리워드가 필요하다.피쳐간의 독립, 고정된 데이터 분포 등 딥러닝의 기본적인 전제 역시 강화학습과는 맞지 않는 부분이 많다.본 연구는 따라서 CNN방식과 Experience Replay Mechanism을 통해 이러한 한계를 극복한다.experience replay mechanism을 활용한다. 이는 과거의 전이 행렬을 무작위로 샘플링 하여 다양한 과거 행동에 걸쳐 학습 분포를 부드럽게 한다.신경망은 Q-learning 알고리즘을 사용하고 가중치 갱신에는 SGD .. 2025. 8. 15.

이전 1 2 3 4 5 6 7 8 ··· 86 다음

티스토리툴바