Large Language Models

MLLM Benchmarks

less than 1 minute read

MME, MMMU, GQA, ChartQA, POPE, NoCaps, TextVQA

PPO in RLHF vs DPO

1 minute read

Proximal Policy Optimization, Direct Preference Optimization

Offload

1 minute read

Offload, DeepSpeed

Quantization

less than 1 minute read

Float32 vs Float16 vs BFloat16

VQ-VAE

1 minute read

Neural Discrete Representation Learning (NeurIPS 2017)