Large Language Models

GPT4MTS; Prompt-based Large Language Model for Multimodal Time-series Forecasting

2 minute read

AAAI 2024

TimePFN; Effective Multivariate Time Series Forecasting with Synthetic Data

5 minute read

AAAI 2025

TimeCMA; Towards LLM-Empowered Multivariate Time Series Forecasting via Cross-Modality Alignment

1 minute read

AAAI 2025 Oral

TimeCAP; Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents

3 minute read

AAAI 2025

TempoPFN; Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting

4 minute read

arxiv 2025

Multi-Modal Forecaster; Jointly Predicting Time Series and Textual Data

1 minute read

arxiv 2024

FNSPID; A Comprehensive Financial News Dataset in Time Series

9 minute read

KDD ADS 2024

Chronos-2; From Univariate to Universal Forecasting

10 minute read

arxiv 2025

KV Cache, Prefill chunk

2 minute read

Function Calling

1 minute read

Speculative Decoding

1 minute read

DALL-E 2

5 minute read

arxiv 2022

DALL-E

2 minute read

arxiv 2021

(OpenAI API) OpenAI Fine-tuning

1 minute read

(OpenAI API) OpenAPI Embedding

less than 1 minute read

(OpenAI API) Sentiment Analysis

4 minute read

(OpenAI API) SQL Translate

1 minute read

(OpenAI API) Structured Output

1 minute read

JSON vs JSONL

less than 1 minute read

TS-RAG

2 minute read

arxiv 2025

Distributed Training 101 - Part 2

less than 1 minute read

Distributed Training 101 - Part 2

Distributed Training 101 - Part 1

less than 1 minute read

Distributed Training 101 - Part 1

Tree-of-Thoughts (ToT)

4 minute read

NeurIPS 2023

Chain-of-Thought Reasoning without Prompting

2 minute read

NeurIPS 2024

Test-time scaling (TTS)

4 minute read

ITI-C (Inference-Time Intervention – Contrastive)

1 minute read

NeurIPS 2023 Spotlight

CAA (Contrastive Activation Addition)

less than 1 minute read

ACL 2024

AURA (Activation Unlearning with Ranking Activation)

1 minute read

ICML 2024

ACTADD (Activation Addition)

less than 1 minute read

ICLR 2025 submission

VLM2Vec; Training Vision-Language Models for Massive Multimodal Embedding Tasks

7 minute read

arxiv 2025

Towards Time-Series Reasoning with LLMs

1 minute read

NeurIPSW 2024

TRACE; Grounding Time Series in Context for Multimodal Embedding and Retrieval

2 minute read

arxiv 2025

MTBench; A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering

2 minute read

arxiv 2025

MCD-TSF; Multimodal Conditioned Diffusive Time Series Forecasting

2 minute read

arxiv 2025

ITFormer; Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset

1 minute read

ICML 2025

How Can Time Series Analysis Benefit From Multiple Modalities? A Survey and Outlook

1 minute read

arxiv 2025

Exploiting Language Power for Time Series Forecasting with Exogenous Variables

2 minute read

WWW 2025

Beyond Numbers; A Survey of Time Series Analysis in the Era of Multimodal LLMs

1 minute read

arxiv 2025

(Quantization) (7) FakeQuant, GGUF, GPTQ, AW, SpinQuant

7 minute read

PTQ, QAT 방법론들

Q-Former

2 minute read

VLM Tasks,Benchmarks

7 minute read

(Quantization) (6) QAT, PTQ

1 minute read

QAT (Quantization-Aware Training), PTQ (Post-Training Quantization)

(Quantization) (5) Symmetric vs. Asymmetric quantization

1 minute read

Q-LoRA (Quantized Low-Rank Adaptation)

(Quantization) (4) Q-LoRA

1 minute read

Q-LoRA (Quantized Low-Rank Adaptation)

(Quantization) (3) Scaling factor, Zero-point

1 minute read

(Quantization) (1) Integer Quantization

2 minute read

INT8, INT4

(Quantization) (1) Floating Point

2 minute read

LLaVA; Visual Instruction Tuning 간단 요약

1 minute read

arxiv 2025

Test-time Scaling (TTS) - Budget Forcing

1 minute read

TTS, Budget Forcing

Rotary Positional Embedding (RoPE)

4 minute read

Fine-tuning LLMs

7 minute read

ChatTime; A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data

11 minute read

AAAI 2025

RMSNorm

1 minute read

SwiGLU

2 minute read

Tie Embeddings

1 minute read

Gradient Norm

1 minute read

LLaVE; Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

12 minute read

MMRL; Multi-Modal Representation Learning for Vision-Language Models

5 minute read

CVPR 2025

Foundation Models for Time Series; A Survey

10 minute read

arxiv 2025

Sequence Packing

2 minute read

Flash Attention

2 minute read

PaliGemma 구현 Part 2

8 minute read

modeling_gemma

PaliGemma 구현 Part 1

6 minute read

modeling_siglip, processing_paligemma

All about Mistral

3 minute read

Mistral 7B, Mixtral 8x7b

(sLM-6c) Prompt Engineering 실습 3

3 minute read

Phi-3-3.8B (Multi-turn PE, Generated Knowledge PE)

(sLM-6b) Prompt Engineering 실습 2

2 minute read

Mistral-7B (CoT PE, Zero-shot PE)

(sLM-6a) Prompt Engineering 실습 1

2 minute read

LLaMA-3-8B (Multi-turn PE, Few-shot PE)

(sLM-6) Prompt Engineering

less than 1 minute read

Flash Attention 개념, 코드 실습

(sLM-5) Flash Attention

1 minute read

Flash Attention 개념, 코드 실습

(sLM-4) Quantization 실습

2 minute read

(sLM-3) sLM 구축을 위한 기반 기술

8 minute read

Hugging Face, OLLaMA, LangChain, VectorDB, RAG

(sLM-2) LLM 모델 평가방법

4 minute read

LLM 평가, LLM 기반 시스템 평가

Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language Models

9 minute read

(sLM-1) Introduction to sLM

1 minute read

sLLM, sLLM vs LLM, sLLM 예시

VLM survey - slides

less than 1 minute read

94 Architectures

VLM summary

less than 1 minute read

94 Architectures

FinTSB; A Comprehensive and Practical Benchmark for Financial Time Series Forecasting

12 minute read

VLM downstream tasks

3 minute read

VLM downstream tasks

All about DeepSeek

6 minute read

arxiv 2024

vllm의 PagedAttention

2 minute read

LLM Fine-tuning 실습 프로젝트 - Part 7

3 minute read

Inference

LLM Inference를 위한 라이브러리

2 minute read

LLM Inference를 위한 라이브러리

LLM Fine-tuning 실습 프로젝트 - Part 6

4 minute read

DPO 데이터셋 구축 & DPO 수행

LLM Fine-tuning 실습 프로젝트 - Part 5

4 minute read

SFT 데이터 & Full-finetuning 하기

LLM Fine-tuning 실습 프로젝트 - Part 4

5 minute read

Evolving

LLM Fine-tuning 실습 프로젝트 - Part 3

5 minute read

LLM을 통한 데이터 생성

LLM Fine-tuning 실습 프로젝트 - Part 2

1 minute read

Open Source Model 종류 및 특징

LLM Fine-tuning 실습 프로젝트 - Part 1

1 minute read

최적화 기법 심화3 - LLM 성능 판단하기

1 minute read

DPO 데이터 관련 실습

2 minute read

DPO 데이터 전처리 & 생성하기

최적화 기법 심화2 - LLM을 위한 데이터 전처리

6 minute read

최적화 기법 심화1 - GPU 메모리 관리 및 최적화

3 minute read

LLM 모델 파인튜닝을 위한 GPU 최적화 (4) 실습2

3 minute read

Multi-GPU

FSDP, ZeRO 예제

3 minute read

FSDP, ZeRO 예제

LLM 모델 파인튜닝을 위한 GPU 최적화 (5)

6 minute read

분산 처리 기법

LLM 모델 파인튜닝을 위한 GPU 최적화 (4) 실습

6 minute read

Single GPU 환경에서 LLM 돌리기

LLM 모델 파인튜닝을 위한 GPU 최적화 (3)

3 minute read

Hugging Face & PEFT

LLM 모델 파인튜닝을 위한 GPU 최적화 (2)

3 minute read

GPU vs CPU

LLM 모델 파인튜닝을 위한 GPU 최적화 (1)

3 minute read

LLM & GPU

Unveiling Encoder-Free Vision-Language Models

6 minute read

NeurIPS 2024

(Pytorch) Distributed Training - DDP

1 minute read

DDP

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

less than 1 minute read

ICLR 2024

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

6 minute read

ICLR 2024

MMICL; Empowering Vision-Language Model with Multi-Modal In-context Learning

4 minute read

ICLR 2024

UNIT; Unifying Image and Text Recognition in One Vision Encoder

1 minute read

NeurIPS 2024

CLIPS; An Enhanced CLIP Framework for Learning with Synthetic Captions

6 minute read

arxiv 2024

VLMo; Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

less than 1 minute read

NeurIPS 2022

(VLM survey) (Part 6; Performance Comparison & Future Works)

3 minute read

arxiv 2024

(VLM survey) (Part 5; VLM Knowledge Distillation)

10 minute read

arxiv 2024

(VLM survey) (Part 4; VLM Transfer Learning)

11 minute read

arxiv 2024

(VLM survey) (Part 3; VLM Pretraining)

8 minute read

arxiv 2024

(VLM survey) (Part 2; VLM Foundations & Datasets)

3 minute read

arxiv 2024

(VLM survey) (Part 1; Intro & Background)

6 minute read

arxiv 2024

Large Language Models; A Survey (Part 4)

7 minute read

arxiv 2024

Large Language Models; A Survey (Part 3)

7 minute read

arxiv 2024

Large Language Models; A Survey (Part 2)

9 minute read

arxiv 2024

Large Language Models; A Survey (Part 1)

7 minute read

arxiv 2024

(Diffusion survey) (Part 1; xxx)

4 minute read

Diffusion Models and Representation Learning; A Survey (TPAMI 2024)

(Diffusion survey) (Part 1; xxx)

6 minute read

Diffusion Models and Representation Learning; A Survey (TPAMI 2024)

MLLM Benchmarks

less than 1 minute read

MME, MMMU, GQA, ChartQA, POPE, NoCaps, TextVQA

Vision-Language Models (VLMs)

4 minute read

Unicoder-VL; A Universal Encoder for Vision and Language by Cross-modal Pre-training

less than 1 minute read

AAAI 2020

TinyGPT-V; Efficient Multimodal Large Language Model via Small Backbones

1 minute read

arxiv 2023

Perceiver IO; A General Architecture for Structured Inputs & Outputs

less than 1 minute read

NExT-GPT; Any-to-Any Multimodal LLM

2 minute read

ICML 2024 Oral

Mixture-of-Experts with Expert Choice Routing

2 minute read

NeurIPS 2022

DeepSeek-VL2; Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

3 minute read

arxiv 2024

KAN; Kolmogorov–Arnold Networks

3 minute read

arxiv 2024

ImageBind; One Embedding Space To Bind Them All

1 minute read

CVPR 2023 Highlighted Paper

DeepSpeed

2 minute read

DeepSeek-VL; Towards Real-World Vision-Language Understanding

3 minute read

arxiv 2024

Qwen-VL; A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

3 minute read

arxiv 2023

PPO in RLHF vs DPO

1 minute read

Proximal Policy Optimization, Direct Preference Optimization

Offload

1 minute read

Offload, DeepSpeed

Meta-Transformer; A Unified Framework for Multimodal Learning

less than 1 minute read

arxiv 2023

Byte Latent Transformer; Patches Scale Better Than Tokens

1 minute read

FAIR 2024

Janus-Pro; Unified Multimodal Understanding and Generation with Data and Model Scaling

3 minute read

arxiv 2025

An Inverse Scaling Law for CLIP Training

1 minute read

NeurIPS 2023

Gradient Checkpointing

1 minute read

Flamingo; a Visual Language Model for Few-Shot Learning

less than 1 minute read

NeurIPS 2022

Q-LoRA

2 minute read

HydraLoRA; An Asymmetric LoRA Architecture for Efficient Fine-Tuning

1 minute read

NeurIPS 2024 Oral

BLIP-2; Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

2 minute read

ICML 2023

Quantization

less than 1 minute read

Float32 vs Float16 vs BFloat16

NV-Embed; Improved Techniques for Training LLMs as Generalist Embedding Models

1 minute read

ICLR 2025 under review, arxiv 2024

BLIP; Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

2 minute read

ICML 2022

Visual Instruction Tuning

2 minute read

NeurIPS 2023 Oral

Fine-Grained Abnormality Prompt Learning for Zero-shot Anomaly Detection

4 minute read

ICLR 2025 under review, arxiv 2024

LCM-LoRA; A Universal Stable-Diffusion Acceleration Module

less than 1 minute read

arxiv 2023

CoCa; Contrastive Captioners are Image-Text Foundation Models

3 minute read

arxiv 2022

Training LLMs to Reason in a Continuous Latent Space

2 minute read

arxiv 2024

SimVLM; Simple Visual Language Model Pretraining with Weak Supervision

1 minute read

ICLR 2022

NLP Tasks

2 minute read

GPU 설명

2 minute read

feat ChatGPT

Active prompting

5 minute read

ACL 2024

LLM Libraries 정리

2 minute read

Consistency Models – Optimizing Diffusion Models Inference

1 minute read

ICML 2023

DeepSeek-R1; Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

3 minute read

arxiv 2025

Titans; Learning to Memorize at Test Time

4 minute read

arxiv 2024

RAG 실습

2 minute read

feat 테디노트

Large Concept Models; Language Modeling in a Sentence Representation Space

4 minute read

arxiv 2024

Byte Latent Transformer; Patches Scale Better Than Tokens

3 minute read

arxiv 2024

Training Large Language Models to Reason in a Continuous Latent Space

1 minute read

arxiv 2024

VQ-VAE

1 minute read

Neural Discrete Representation Learning (NeurIPS 2017)

Hymba; A Hybrid-head Architecture for Small Language Models

2 minute read

ICLR 2025

LLaMA-Mesh; Unifying 3D Mesh Generation with Language Models

1 minute read

arxiv 2024

DALL-E; Zero-Shot Text-to-Image Generation

1 minute read

ICML 2021

TokenFormer; Rethinking Transformer Scaling with Tokenized Model Parameters

1 minute read

ICLR 2025

Transformers Can Do Arithmetic with the Right Embeddings

less than 1 minute read

NeurIPS 2024

CLLMs; Consistency Large Language Models

2 minute read

ICLR 2024

Towards Time-Series Reasoning with LLMs

2 minute read

NeurIPSW TSALM 2024

Metadata Matters for Time Series; Informative Forecasting with Transformers

2 minute read

ICLR 2025 submission

LeMoLE; LLM-Enhanced Mixture of Linear Experts for Time Series Forecasting

3 minute read

ICLR 2025 submission

LLM-Mixer; Multiscale Mixing in LLMs for Time Series Forecasting

1 minute read

LLM-TS Integrator; Integrating LLM for Enhanced Time Series Modeling

less than 1 minute read

NeurIPSW TSALM 2024

Are Language Models Actually Useful for Time Series Forecasting?

1 minute read

NeurIPS 2024

ReFT; Representation Finetuning for Language Models

1 minute read

NeurIPS 2024

Mixture-of-Agents Enhances Large Language Model Capabilities

less than 1 minute read

ICLR 2025

The Era of 1-bit LLMs; All Large Language Models are in 1.58 Bits

1 minute read

arxiv 2024

Self-Rewarding Language Models

1 minute read

ICML 2024

Fast Inference of Mixture-of-Experts Language Models with Offloading

3 minute read

arxiv 2023

Orca 2; Teaching Small Language Models How to Reason

2 minute read

arxiv 2023

CodeFusion; A Pre-trained Diffusion Model for Code Generation

2 minute read

EMNLP 2023

Table-GPT; Table-tuned GPT for Diverse Table Tasks

less than 1 minute read

SIGMOD 2024

ORPO; Large Language Models As Optimizers

1 minute read

ICLR 2024

Code Llama

2 minute read

arxiv 2023

Orca; Progressive Learning from Complex Explanation Traces of GPT-4

1 minute read

arxiv 2023

LongNet; Scaling Transformers to 1,000,000,000 Tokens

less than 1 minute read

arxiv 2023

LIMA; Less Is More for Alignment

1 minute read

NeurIPS 2023

Shepherd; A Critic for Language Model Generation

less than 1 minute read

arxiv 2023

Speech LLMs; 3) Multimodal Information Fusion and Training Strategies

2 minute read

A Survey on Speech Large Language Models

Universal and Transferable Adversarial LLM Attacks

1 minute read

Speech LLMs; 2) Recent Advances in Speech LLMs

12 minute read

A Survey on Speech Large Language Models

From Sparse to Soft Mixture of Experts (SoftMoE)

1 minute read

ICLR 2024

Speech LLMs; 1) Introduction

1 minute read

A Survey on Speech Large Language Models

Speech Tasks

less than 1 minute read

Neural ODE

1 minute read

NeurIPS 2018 Best paper

Low Rank Adaptation (LoRA)

less than 1 minute read

ICLR 2021

Prompt Tuning

1 minute read

PEFT, Prompt Tuning

(LLM 교재) 3.프롬프트 엔지니어링의 첫 번째 단계

2 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 3.프롬프트 엔지니어링의 첫 번째 단계

2 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 2.LLM을 이용한 의미 기반 검색 - 실습 2

2 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 2.LLM을 이용한 의미 기반 검색 - 실습 2

3 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 2.LLM을 이용한 의미 기반 검색 - 실습 1

4 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 2.LLM을 이용한 의미 기반 검색

5 minute read

쉽고 빠르게 익히는 실전 LLM

(LLM 교재) 1.LLM 소개

1 minute read

쉽고 빠르게 익히는 실전 LLM

Language Models Still Struggle to Zero-shot Reason about Time Series

3 minute read

EMNLP 2024

Time-LLM; TS Forecasting by Reprogrammming LLM

5 minute read

ICLR 2024

Lag-Llama; Towards Foundation Models for Time Series Forecasting

3 minute read

Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models at NeurIPS 2023

TimeGPT-1

4 minute read

arxiv 2023

One Fits All; Power General TS Analysis by Pretrained LM

1 minute read

NeurIPS 2023

(paper 101) LLM4TS; Two-stage Fine-tuning for TSF with Pretrained LLMs

4 minute read

2023

Seunghan Lee

Large Language Models