
Audio representation as waveform or spectrogram, Speech encoder and SpeechLLM
Mar 10, 2024

Multimodality, Fusion, Original tasks, datasets and models, CLIP and text2image Diffusion, Fusion of Large Pre-trained Models, LMM Assistants and Open-source recent datasets
Mar 8, 2024

Deep Learning class CC66204
Feb 24, 2024