A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Dec 10, 2025·
Nicolas Calbucura
Valentin Barriere
Valentin Barriere
· 1 min read
The Step one of our method consists in audio token selection pipeline based on an $\ell_1$ logistic regression using Bag-of-Word representation. This results on fewer Audio Tokens selected for a specific task.
Type
Publication
Arxiv (submitted)

This work goes directly in the context of my Fondecyt de Iniciacion🗣️💬🤖 project.