Detail výsledku
Comparison of methods for language-dependent and language-independent query-by-example spoken term detection
Fapšo Michal, Ing., Ph.D., FIT (FIT), UPGM (FIT)
Szőke Igor, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Grézl František, Ing., Ph.D., UPGM (FIT)
This article investigates query-by-example (QbE) spoken term detection (STD), in which the query is notentered as text, but selected in speech data or spoken. Two feature extractors based on neural networks(NN) are introduced: the first producing phone-state posteriors and the second making use of a compressiveNN layer. They are combined with three different QbE detectors: while the Gaussian mixture model/hiddenMarkov model (GMM/HMM) and dynamic time warping (DTW) both work on continuous feature vectors,the third one, based on weighted finite-state transducers (WFST), processes phone lattices.
Experimentation, Query-by-example, DTW-based query-by-example, GMM/HMM-basedquery-by-example, WFST-based query-by-example, bottleneck features, keyword spotting
@article{BUT97057,
author="Javier {Tejedor} and Michal {Fapšo} and Igor {Szőke} and Jan {Černocký} and František {Grézl}",
title="Comparison of methods for language-dependent and language-independent query-by-example spoken term detection",
journal="ACM TRANSACTIONS ON INFORMATION SYSTEMS",
year="2012",
volume="2012",
number="30",
pages="1--34",
doi="10.1145/2328967.2328971",
issn="1046-8188",
url="http://dl.acm.org/citation.cfm?id=2328971&CFID=187707319&CFTOKEN=67886685"
}
Jazykově nezávislá detekce klíčových slov, GAČR, Postdoktorandské granty, GPP202/12/P567, zahájení: 2012-01-01, ukončení: 2014-12-31, ukončen
Multiligvální rozpoznávání a vyhledávání v řeči pro elektronické slovníky, MPO, TIP, FR-TI1/034, zahájení: 2009-09-01, ukončení: 2013-08-31, ukončen
Výzkum informačních technologií z hlediska bezpečnosti, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, zahájení: 2007-01-01, ukončení: 2013-12-31, řešení