Publication Details

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

YUSUF Bolaji, ČERNOCKÝ Jan and SARAÇLAR Murat. End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 31, no. 08, 2023, pp. 3070-3080. ISSN 2329-9290. Available from: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10201906
Czech title
Celostní vyhledávání klíčových slov s otevřeným slovníkem a vícejazyčnými neurálními reprezentacemi
Type
journal article
Language
english
Authors
Yusuf Bolaji (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
Saraçlar Murat (UBOGAZ)
URL
Keywords

Keyword search, spoken term detection, end-to-end keyword search, asr-free keyword search, keyword spotting.

Abstract

Conventional keyword search systems operate on automatic speech recognition (ASR) outputs, which causes them to have a complex indexing and search pipeline. This has led to interest in ASR-free approaches to simplify the search procedure. We recently proposed a neural ASR-free keyword search model which achieves competitive performance while maintaining an efficient and simplified pipeline, where queries and documents are encoded with a pair of recurrent neural network encoders and the encodings are combined with a dot-product. In this article, we extend this work with multilingual pretraining and detailed analysis of the model. Our experiments show that the proposed multilingual training significantly improves the model performance and that despite not matching a strong ASR-based conventional keyword search system for short queries and queries comprising in-vocabulary words, the proposed model outperforms the ASR-based system for long queries and queries that do not appear in the training data.

Published
2023
Pages
3070-3080
Journal
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 31, no. 8, ISSN 2329-9290
Publisher
IEEE Signal Processing Society
DOI
UT WoS
001047323400008
EID Scopus
BibTeX
@ARTICLE{FITPUB13057,
   author = "Bolaji Yusuf and Jan \v{C}ernock\'{y} and Murat Sara\c{c}lar",
   title = "End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations",
   pages = "3070--3080",
   journal = "IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING",
   volume = 31,
   number = 08,
   year = 2023,
   ISSN = "2329-9290",
   doi = "10.1109/TASLP.2023.3301239",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/13057"
}
Back to top