Result Details

Word-subword based keyword spotting with implications in OOV detection

ČERNOCKÝ, J.; SZŐKE, I.; HANNEMANN, M.; KOMBRINK, S. Word-subword based keyword spotting with implications in OOV detection. Pacific Grove: Institute of Electrical and Electronics Engineers, 2010. 34 p.
Type
presentation, poster
Language
English
Authors
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Hannemann Mirko, Ph.D., DCGM (FIT)
Kombrink Stefan, Dipl.-Linguist., DCGM (FIT)
Abstract

The talk is on our work in designing hybrid word-subword keyword spotting systems, that maintain the accuracy of LVCSR, while allowing for detecting OOVs as sequences of sub-word units.

Keywords

speech recognition, keyword spotting, spoken term detection, OOV

URL
Annotation

Main-stream systems for keyword spotting and spoken term detection are based on the series of Large Vocabulary Continuous Speech Recognizer with subsequent search in its output. These systems are limited by the vocabulary of the recognizer and are not able to detect Out of Vocabulary (OOV) words. This talk will present our work in designing hybrid word-subword keyword spotting systems, that maintain the accuracy of LVCSR, while allowing for detecting OOVs as sequences of sub-word units. We will also show the links of this work to the detection, description and clustering of OOVs, as investigated in the framework of the EC-sponsored project DIRAC.

Published
2010
Pages
34
Conference
Asilomar Conference on Signals, Systems, and Computers
Publisher
Institute of Electrical and Electronics Engineers
Place
Pacific Grove
BibTeX
@misc{BUT63577,
  author="Jan {Černocký} and Igor {Szőke} and Mirko {Hannemann} and Stefan {Kombrink}",
  title="Word-subword based keyword spotting with implications in OOV detection",
  year="2010",
  pages="34",
  publisher="Institute of Electrical and Electronics Engineers",
  address="Pacific Grove",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/asilomar_kwd_oov.ppt"
}
Projects
DIRAC - Detection and Identification of Rare Audio-visual Cues, MŠMT, Šestý rámcový program Evropského společenství pro výzkum, technický rozvoj a demonstrační činnosti, 027787, start: 2006-01-01, end: 2010-12-31, completed
Overcoming the language barrier complicating investigation into financing terrorism and serious financial crimes, MV, Program bezpečnostního výzkumu, VD20072010B16, start: 2007-08-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Research groups
Departments
Back to top