PaliGemma 구현 Part 2
modeling_gemma
modeling_gemma
modeling_siglip, processing_paligemma
Mistral 7B, Mixtral 8x7b
Phi-3-3.8B (Multi-turn PE, Generated Knowledge PE)
Mistral-7B (CoT PE, Zero-shot PE)
LLaMA-3-8B (Multi-turn PE, Few-shot PE)
Flash Attention 개념, 코드 실습
Flash Attention 개념, 코드 실습
Hugging Face, OLLaMA, LangChain, VectorDB, RAG
LLM 평가, LLM 기반 시스템 평가
sLLM, sLLM vs LLM, sLLM 예시
94 Architectures
94 Architectures
VLM downstream tasks
arxiv 2024
Inference
LLM Inference를 위한 라이브러리
DPO 데이터셋 구축 & DPO 수행
SFT 데이터 & Full-finetuning 하기
Evolving
LLM을 통한 데이터 생성
Open Source Model 종류 및 특징
DPO 데이터 전처리 & 생성하기
Multi-GPU
FSDP, ZeRO 예제
분산 처리 기법
Single GPU 환경에서 LLM 돌리기
Hugging Face & PEFT
GPU vs CPU
LLM & GPU
NeurIPS 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
Diffusion Models and Representation Learning; A Survey (TPAMI 2024)
Diffusion Models and Representation Learning; A Survey (TPAMI 2024)
MME, MMMU, GQA, ChartQA, POPE, NoCaps, TextVQA
ICML 2024 Oral
NeurIPS 2022
arxiv 2024
arxiv 2024
CVPR 2023 Highlighted Paper
arxiv 2023
Proximal Policy Optimization, Direct Preference Optimization
Offload, DeepSpeed
NeurIPS 2023
NeurIPS 2024 Oral
Float32 vs Float16 vs BFloat16
ICLR 2025 under review, arxiv 2024
NeurIPS 2023 Oral
ICLR 2025 under review, arxiv 2024
feat ChatGPT
ACL 2024
arxiv 2024
feat 테디노트
Neural Discrete Representation Learning (NeurIPS 2017)
ICLR 2024
NeurIPSW TSALM 2024
ICLR 2025 submission
ICLR 2025 submission
NeurIPSW TSALM 2024
NeurIPS 2024
ICML 2024
arxiv 2023
NeurIPS 2023
A Survey on Speech Large Language Models
A Survey on Speech Large Language Models
A Survey on Speech Large Language Models
NeurIPS 2018 Best paper
ICLR 2021
PEFT, Prompt Tuning
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
쉽고 빠르게 익히는 실전 LLM
Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models at NeurIPS 2023
arxiv 2023