Miscellaneous69 [2025-2] 박제우 - FLAT REWARD IN POLICY PARAMETER SPACE IMPLIES ROBUST REINFORCEMENT LEARNING https://openreview.net/forum?id=4OaO3GjP7k Flat Reward in Policy Parameter Space Implies Robust Reinforcement...Investigating flat minima on loss surfaces in parameter space is well-documented in the supervised learning context, highlighting its advantages for model generalization. However, limited attention...openreview.net 강화학습은 지도학습, 비지도학습과 함께 대표적인 인공지능 모델의 학습 방법 중 하나이다. Data Point와 Label로 학.. 2025. 7. 18. [2025-2] 박지원 - GPTQ 논문) https://arxiv.org/abs/2210.17323 GPTQ: Accurate Post-Training Quantization for Generative Pre-trained TransformersGenerative Pre-trained Transformer models, known as GPT or OPT, set themselves apart through breakthrough performance across complex language modelling tasks, but also by their extremely high computational and storage costs. Specifically, due to their massarxiv.org 1. GPTQ란 GPTQ.. 2025. 7. 1. [2025-1] 유경석 - nnDetection: A Self-configuring Method for Medical Object Detection https://arxiv.org/abs/2106.00817 nnDetection: A Self-configuring Method for Medical Object DetectionSimultaneous localisation and categorization of objects in medical images, also referred to as medical object detection, is of high clinical relevance because diagnostic decisions often depend on rating of objects rather than e.g. pixels. For this task, tharxiv.orghttps://github.com/MIC-DKFZ/nnDet.. 2025. 5. 24. [2025 - 1] 김지원 - An Algorithmic Crystal Ball: Forecasts-based on MachineLearning 논문 소개 논문 제목 : An Algorithmic Crystal Ball: Forecasts-based on MachineLearning발간년도: 2018저자 : Jin-Kyu Jung, Manasa Patnam, and Anna Ter-Martirosyan특징 : IMF(Internationa Monetary Fnd) Working Paper Research Question매크로 데이터(다음 분기 GDP 성장률 등)을 예측할 때에도 딥러닝이 높은 정확도를 보여주는가? Background기존에 IMF나 World Bank와 같은 기관들이 각 국가의 전망에 대한 보고서들을 내면서 다음 분기 GDP 성장률을 예측한다.하지만 Timmermann(2007)에 따르면 IMF의 World Economic Outl.. 2025. 5. 10. 이전 1 2 3 4 5 ··· 18 다음