Detail výsledku

End-to-End Open Vocabulary Keyword Search

YUSUF, B.; GOK, A.; GUNDOGDU, B.; SARAÇLAR, M. End-to-End Open Vocabulary Keyword Search. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. no. 8, p. 4388-4392. ISSN: 1990-9772.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Yusuf Bolaji, UPGM (FIT)
GOK, A.
GUNDOGDU, B.
SARAÇLAR, M.
Abstrakt

Recently, neural approaches to spoken content retrieval have becomepopular. However, they tend to be restricted in their vocabularyor in their ability to deal with imbalanced test settings.These restrictions limit their applicability in keyword search,where the set of queries is not known beforehand, and wherethe system should return not just whether an utterance containsa query but the exact location of any such occurrences.In this work, we propose a model directly optimized for keywordsearch. The model takes a query and an utterance as inputand returns a sequence of probabilities for each frame of theutterance of the query having occurred in that frame. Experimentsshow that the proposed model not only outperforms similarend-to-end models on a task where the ratio of positive andnegative trials is artificially balanced, but it is also able to dealwith the far more challenging task of keyword search with itsinherent imbalance. Furthermore, using our system to rescorethe outputs an LVCSR-based keyword search system leads tosignificant improvements on the latter.

Klíčová slova

keyword search, spoken term detection

URL
Rok
2021
Strany
4388–4392
Časopis
Proceedings of Interspeech, roč. 2021, č. 8, ISSN 1990-9772
Sborník
Proceedings Interspeech 2021
Konference
Interspeech Conference
Vydavatel
International Speech Communication Association
Místo
Brno
DOI
UT WoS
000841879504096
EID Scopus
BibTeX
@inproceedings{BUT175847,
  author="YUSUF, B. and GOK, A. and GUNDOGDU, B. and SARAÇLAR, M.",
  title="End-to-End Open Vocabulary Keyword Search",
  booktitle="Proceedings Interspeech 2021",
  year="2021",
  journal="Proceedings of Interspeech",
  volume="2021",
  number="8",
  pages="4388--4392",
  publisher="International Speech Communication Association",
  address="Brno",
  doi="10.21437/Interspeech.2021-1399",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/interspeech_2021/yusuf21_interspeech.html"
}
Soubory
Projekty
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-20-6460, zahájení: 2020-03-01, ukončení: 2023-02-28, ukončen
Vícenásobné služby inteligentního konverzačního agenta pro přijetí, řízení a integraci občanů třetích zemí v EU, EU, Horizon 2020, zahájení: 2020-02-01, ukončení: 2023-04-30, ukončen
Výzkumné skupiny
Pracoviště
Nahoru