Result Details

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification

VILLATORO-TELLO, E.; MADIKERI, S.; SHARMA, B.; KHALIL, D.; KUMAR, S.; NIGMATULINA, I.; MOTLÍČEK, P.; GANAPATHIRAJU, A. Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024. p. 12617-12621. ISBN: 979-8-3503-4485-1.
Type
conference paper
Language
English
Authors
VILLATORO-TELLO, E.
Madikeri Srikanth, FIT (FIT)
SHARMA, B.
KHALIL, D.
KUMAR, S.
NIGMATULINA, I.
Motlíček Petr, doc. Ing., Ph.D., DCGM (FIT)
GANAPATHIRAJU, A.
Abstract

Spoken Language Understanding (SLU) technologies have
greatly improved due to the effective pretraining of speech
representations. A common requirement of industry-based
solutions is the portability to deploy SLU models in voice-
assistant devices. Thus, distilling knowledge from large text-
based language models has become an attractive solution for
achieving good performance and guaranteeing portability. In
this paper, we introduce a novel architecture that uses a cross-
modal attention mechanism to extract bin-level contextual
embeddings from a word-confusion network (WNC) encod-
ing such that these can be directly compared and aligned with
traditional text-based contextual embeddings. This alignment
is achieved using a recently proposed tokenwise constrastive
loss function. We validate our architecture's effectiveness
by fine-tuning our WCN-based pretrained model to do intent
classification (IC) on the well-known SLURP dataset. Ob-
tained accuracy on the IC task (81%), depicts a 9.4% relative
improvement compared to a recent/equivalent E2E method

Keywords

Word-Confusion-Networks, Cross-modal Alignment, Knowledge Distillation, Intent Classification

URL
Published
2024
Pages
12617–12621
Proceedings
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Conference
2024 IEEE International Conference on Acoustics, Speech and Signal Processing IEEE
ISBN
979-8-3503-4485-1
Publisher
IEEE Signal Processing Society
Place
Seoul
DOI
BibTeX
@inproceedings{BUT196786,
  author="VILLATORO-TELLO, E. and MADIKERI, S. and SHARMA, B. and KHALIL, D. and KUMAR, S. and NIGMATULINA, I. and MOTLÍČEK, P. and GANAPATHIRAJU, A.",
  title="Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification",
  booktitle="ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",
  year="2024",
  pages="12617--12621",
  publisher="IEEE Signal Processing Society",
  address="Seoul",
  doi="10.1109/ICASSP48485.2024.10445934",
  isbn="979-8-3503-4485-1",
  url="https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10445934"
}
Files
Projects
Exchanges for SPEech ReseArch aNd TechnOlogies, EU, Horizon 2020, start: 2021-01-01, end: 2025-12-31, running
Research groups
Departments
Back to top