Result Details

Improving Acoustic Based Keyword Spotting Using LVCSR Lattices

MOTLÍČEK, P.; VALENTE, F.; SZŐKE, I. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012. p. 4413-4416. ISBN: 978-1-4673-0044-5.
Type
conference paper
Language
English
Authors
Motlíček Petr, doc. Ing., Ph.D., DCGM (FIT)
Valente Fabio
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Abstract

This paper summarizes experimental results achieved with acousticand LVCSR-KWS systems exploited on conversational audiorecordings.

Keywords

KeyWord Spotting (KWS), Spoken Term Detection(STD), Confidence Measure (CM)

URL
Annotation

This paper investigates detection of English keywords in a conversational scenario using a combination of acoustic and LVCSR based keyword spotting systems. Acoustic KWS systems search predefined words in parameterized spoken data. Corresponding confidences are represented by likelihood ratios given the keyword models and a background model. First, due to the especially high number of false-alarms, the acoustic KWS system is augmented with confidence measures estimated from corresponding LVCSR lattices. Then, various strategies to combine scores estimated by the acoustic and several LVCSR based KWS systems are explored. We show that a linear regression based combination significantly outperforms other (model-based) techniques. Due to that, the relative number of false-alarms of the combined KWS system decreased by more than 50% compared to the acoustic KWS system. Finally, an attention is also paid to the complexities of the KWS systems enabling them to potentially be exploited in real-detection tasks.

Published
2012
Pages
4413–4416
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal Processing 2012
Conference
The 37th International Conference on Acoustics, Speech, and Signal Processing
ISBN
978-1-4673-0044-5
Publisher
IEEE Signal Processing Society
Place
Kyoto
DOI
BibTeX
@inproceedings{BUT91501,
  author="Petr {Motlíček} and Fabio {Valente} and Igor {Szőke}",
  title="Improving Acoustic Based Keyword Spotting Using LVCSR Lattices",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal Processing 2012",
  year="2012",
  pages="4413--4416",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6288898",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/motlicek_icassp2012_0004413.pdf"
}
Projects
National Support for Project Together Anywhere, Together Anytime, MŠMT, Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb., 7E11024, start: 2011-01-01, end: 2011-12-31, running
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Research groups
Departments
Back to top