본문 바로가기

책상 밖 세상을 경험할 수 있는 Playground를 제공하고, 수동적 학습에서 창조의 삶으로의 전환을 위한 새로운 라이프 스타일을 제시합니다.

분류 전체보기331

[2025-1] 황영희 - U-Net: Convolutional Networks for Biomedical Image Segmentation https://arxiv.org/abs/1505.04597 U-Net: Convolutional Networks for Biomedical Image SegmentationThere is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotatedarxiv.org1. U-Net 이란?이미지 세그멘테이션(Image Segmenta.. 2025. 2. 13.

[2025-1] 임재열 - Hymba: A Hybrid-head Architecture for Small Language Models Hymba는 2024년 NVIDIA에서 제안한 모델입니다. [Hymba]https://arxiv.org/abs/2411.13676 Hymba: A Hybrid-head Architecture for Small Language ModelsWe propose Hymba, a family of small language models featuring a hybrid-head parallel architecture that integrates transformer attention mechanisms with state space models (SSMs) for enhanced efficiency. Attention heads provide high-resolution recall, whilearxiv.org*.. 2025. 2. 12.

[2025-1] 김학선 - DeepSeek-Coder: When the Large Language Model Meets Programming - The Rise of Code Intelligence IntroductionLLMs의 급속한 발전으로 인해 소프트웨어 개발 분야는 크게 변화했다. 그러나 이러한 발전에도 불구하고 LLMs의 주요 도전 과제는 오픈 소스 모델과 폐쇄형 소스 모델간의 성능 격차이다. 강력한 폐쇄형 소스 모델들은 외부의 접근이 제한되며, 독점적인 성격으로 인해 활용에 제약이 따른다. 이러한 도전 과제에 대응하기 위해 DeepSeek-Coder 시리즈를 제시했다.DeepSeek-Coder 시리즈Size: 1.3B ~ 33BVersion: Base, InstructPre-train data: Repository 수준에서의 학습 데이터를 구성(→ 교차 파일 이해 능력 향상)Pre-train processLoss: Next token predictionMethod: Fill-In-the.. 2025. 2. 12.

[2025-1] 정규원 AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing https://aclanthology.org/2023.emnlp-main.664/ AD-NLP: A Benchmark for Anomaly Detection in Natural Language ProcessingMatei Bejan, Andrei Manolache, Marius Popescu. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.aclanthology.org 이전까지는 데이터셋의 일부 클래스를 다운샘플링 하였는데 이는 재현성 문제와 특정 유형의 이상을 감지하는 데 편향된 모델이라는 점에서 정교한 시나리오 인식이 어렵다는 문제를 야기했다. 본 논문에서는 통합된 벤치 마크를 제공.. 2025. 2. 11.

이전 1 ··· 32 33 34 35 36 37 38 ··· 83 다음

티스토리툴바