(VLM survey) (Part 6; Performance Comparison & Future Works)
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
arxiv 2024
Diffusion Models and Representation Learning; A Survey (TPAMI 2024)
Diffusion Models and Representation Learning; A Survey (TPAMI 2024)
ICML 2024 Oral
CVPR 2023 Highlighted Paper
Offload, DeepSpeed
Float32 vs Float16 vs BFloat16
분산 처리 기법
Hugging Face & PEFT
GPU vs CPU
ICLR 2024 Oral
LLM & GPU
feat ChatGPT
arxiv 2024
A Survey on Speech Large Language Models
A Survey on Speech Large Language Models
A Survey on Speech Large Language Models
Multimodal Transformer, Cross-modal attention, self-attention
Signal Data, Wav2Vec, SincNet, PASE
Signal Data, Wav2Vec, SincNet, PASE
Signal Data, Fourier Transform, MFCC
Signal Data, Fourier Transform, MFCC
Multimodal Learning, Multimodal Representations
Multimodal Learning, Translation
Multimodal Learning, Multimodal Representations
Multimodal Deep Learning에 대한 소개글