본문 바로가기

책상 밖 세상을 경험할 수 있는 Playground를 제공하고, 수동적 학습에서 창조의 삶으로의 전환을 위한 새로운 라이프 스타일을 제시합니다.

Multi-Modal32

[2026-1] 강민정, 염제원 - GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks Paperhttps://arxiv.org/abs/2510.04374 GDPval: Evaluating AI Model Performance on Real-World Economically Valuable TasksWe introduce GDPval, a benchmark evaluating AI model capabilities on real-world economically valuable tasks. GDPval covers the majority of U.S. Bureau of Labor Statistics Work Activities for 44 occupations across the top 9 sectors contributing to U.S. GDParxiv.orgArticlehttps://.. 2026. 3. 20.

[2026-1] 백승우 - AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines https://arxiv.org/abs/2602.14296 AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State MachinesThe performance of autonomous Web GUI agents heavily relies on the quality and quantity of their training data. However, a fundamental bottleneck persists: collecting interaction trajectories from real-world websites is expensive and difficult to verify. Tarxiv.org 2026. 3. 10.

[2026-1] 정유림 - FiLM: Visual Reasoning with a General Conditioning Layer paper link : https://arxiv.org/pdf/1709.07871 CLEVR datset : 다단계 추론의 학습이 필요. 기존 방법의 성능이 좋지않았음.reasoninng 능력 평가 : CLEVR datset : 다단계 추론의 학습이 필요. 기존 방법의 성능이 좋지않았음.FiLM (Feature-wise Linear Modulation): 조건 입력(질문)에 따라, 신경망 중간 feature에 대해, feature별 변환 수행. 시각적 추론에서, FiLM layer를 추가해서, 질문을 처리하는 RNN이 이미지 처리를 담당하는 CNN의 계산에 영향을 미치게됨.즉, 질문의 내용에 따라 이미지를 처리하는 방식 자체가 달라짐.→ Conditional Normalization의 일반화로 볼수있.. 2026. 2. 21.

[2026-1] 정재훈 - CoCa: Contrastive Captioners are Image-Text Foundation Models https://arxiv.org/abs/2205.01917v2 1. Introduction최근 BERT, T5, GPT-3와 같이 web-scale data로 pretrained된 기반 언어 모델들이 zero-shot, few-shot, 전이학습 등을 통해 대규모 멀티태스킹 능력을 증명하며 부상하고 있습니다. 각각 task에 전문화된 개별 모델에 비해 대규모 downstream을 위해 pretrained된 모델은 학습비용을 상각할 수 있어 인간 수준 지능의 모델을 위한 한계를 뛰어넘을 수 있는 가능성을 제시합니다. vision-language problem에 대하여 여러 기반 모델들이 후보로 탐색되었다.1. Single-encoder : 이전 연구들은 cross-entropy loss로 pretraine.. 2026. 2. 21.

이전 1 2 3 4 5 ··· 8 다음

티스토리툴바