Deep Learning

Audio representation as waveform or spectrogram, Speech encoder and SpeechLLM

Mar 10, 2024

Multimodality, Fusion, Original tasks, datasets and models, CLIP and text2image Diffusion, Fusion of Large Pre-trained Models, LMM Assistants and Open-source recent datasets

Mar 8, 2024

Deep Learning

Deep Learning class CC66204

Feb 24, 2024