Result Details

End-to-End Open Vocabulary Keyword Search

YUSUF, B.; GOK, A.; GUNDOGDU, B.; SARAÇLAR, M. End-to-End Open Vocabulary Keyword Search. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. no. 8, p. 4388-4392. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Yusuf Bolaji, DCGM (FIT)
GOK, A.
GUNDOGDU, B.
SARAÇLAR, M.
Abstract

Recently, neural approaches to spoken content retrieval have becomepopular. However, they tend to be restricted in their vocabularyor in their ability to deal with imbalanced test settings.These restrictions limit their applicability in keyword search,where the set of queries is not known beforehand, and wherethe system should return not just whether an utterance containsa query but the exact location of any such occurrences.In this work, we propose a model directly optimized for keywordsearch. The model takes a query and an utterance as inputand returns a sequence of probabilities for each frame of theutterance of the query having occurred in that frame. Experimentsshow that the proposed model not only outperforms similarend-to-end models on a task where the ratio of positive andnegative trials is artificially balanced, but it is also able to dealwith the far more challenging task of keyword search with itsinherent imbalance. Furthermore, using our system to rescorethe outputs an LVCSR-based keyword search system leads tosignificant improvements on the latter.

Keywords

keyword search, spoken term detection

URL
Published
2021
Pages
4388–4392
Journal
Proceedings of Interspeech, vol. 2021, no. 8, ISSN 1990-9772
Proceedings
Proceedings Interspeech 2021
Conference
Interspeech Conference
Publisher
International Speech Communication Association
Place
Brno
DOI
UT WoS
000841879504096
EID Scopus
BibTeX
@inproceedings{BUT175847,
  author="YUSUF, B. and GOK, A. and GUNDOGDU, B. and SARAÇLAR, M.",
  title="End-to-End Open Vocabulary Keyword Search",
  booktitle="Proceedings Interspeech 2021",
  year="2021",
  journal="Proceedings of Interspeech",
  volume="2021",
  number="8",
  pages="4388--4392",
  publisher="International Speech Communication Association",
  address="Brno",
  doi="10.21437/Interspeech.2021-1399",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/interspeech_2021/yusuf21_interspeech.html"
}
Files
Projects
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-20-6460, start: 2020-03-01, end: 2023-02-28, completed
Multiple Intelligent Conversation Agent Sevices for Reception, Management and Integration of Third Country Nationals, EU, Horizon 2020, start: 2020-02-01, end: 2023-04-30, completed
Research groups
Departments
Back to top