NLP

MLLM Benchmarks

less than 1 minute read

MME, MMMU, GQA, ChartQA, POPE, NoCaps, TextVQA

PPO in RLHF vs DPO

1 minute read

Proximal Policy Optimization, Direct Preference Optimization