Result Details
Acoustic keyword spotter - optimization from end-user perspective
Grézl František, Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Fapšo Michal, Ing., Ph.D., DCGM (FIT)
This paper is on acoustic keyword spotting. It presents several steps that have to be done to obtain a usable acoustic keyword spotting system. The novelty of the system is in the calibration.
keyword spotting, spoken term detection, neural networks, calibration
The paper deals with the development of acoustic keyword spotter (KWS) meeting requirements of a real user from the security community. While the basic scheme of the KWS is relatively standard, it uses novel features derived by a hierarchy of neural networks, and score normalization trained to maximize a user-like evaluation metric. The results are reported on a selection of Czech conversational telephone speech (CTS), radio and read data.
@inproceedings{BUT35213,
author="Igor {Szőke} and František {Grézl} and Jan {Černocký} and Michal {Fapšo}",
title="Acoustic keyword spotter - optimization from end-user perspective",
booktitle="Proceedings of the 2010 IEEE Spoken Language Technology Workshop",
year="2010",
series="IEEE Catalog Number: CFP 10SLT-USB",
pages="177--181",
publisher="IEEE Signal Processing Society",
address="Berkeley, California",
isbn="978-1-4244-7902-3",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/sz%f6ke_SLT2010_p.177.pdf"
}
Recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-10-2, 2010, start: 2010-04-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed
Theory and applications of phoneme posterior estimation in speech processing, GACR, Doktorské granty, GP102/09/P635, start: 2009-01-01, end: 2011-12-31, completed