A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Dec 10, 2025·

Nicolas Calbucura

Valentin Barriere

· 1 min read

The Step one of our method consists in audio token selection pipeline based on an $\ell_1$ logistic regression using Bag-of-Word representation. This results on fewer Audio Tokens selected for a specific task.

Type

Conference paper

Publication

Arxiv (submitted)

This work goes directly in the context of my Fondecyt de Iniciacion🗣️💬🤖 project.

Last updated on Dec 10, 2025

Multimodality Natural Language Processing Social Interactions

Authors

Valentin Barriere

Researcher and Teacher

Adapting Bias Evaluation to Domain Contexts using Generative Models Aug 25, 2025 →