Detail výsledku

Comparison of Keyword Spotting Approaches for Informal Continuous Speech

SZŐKE, I.; SCHWARZ, P.; MATĚJKA, P.; BURGET, L.; FAPŠO, M.; KARAFIÁT, M.; ČERNOCKÝ, J. Comparison of Keyword Spotting Approaches for Informal Continuous Speech. 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Edinburgh: 2005. p. 1-12.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Szőke Igor, Ing., Ph.D., UPGM (FIT)
Schwarz Petr, Ing., Ph.D., UPGM (FIT)
Matějka Pavel, Ing., Ph.D., UPGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Fapšo Michal, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

This paper describes several approaches to keyword spotting (KWS) forinformal continuous speech. We compare acoustic keyword spotting,spotting in word lattices generated by large vocabulary continuousspeech recognition and a hybrid approach making use of phoneme latticesgenerated by a phoneme recognizer. The systems are compared oncarefully defined test data extracted from ICSI meeting database. Theadvantages and drawbacks of different approaches are discussed. Theacoustic and phoneme-lattice based KWS are based on a phonemerecognizer making use of temporal-pattern (TRAP) feature extraction andposterior estimation using neural nets. We show its superiority overtraditional HMM/GMM systems. A posterior probability transformationfunction is introduced for posterior based acoustic keyword spotting.We also propose a posterior masking algorithm to speed-up acoustickeyword spotting.

Klíčová slova

comparison, keyword spotting, hidden Markov model, long temporal trajectory, phoneme recognizer

URL
Rok
2005
Strany
1–12
Sborník
2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Konference
2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Místo
Edinburgh
BibTeX
@inproceedings{BUT18063,
  author="Igor {Szőke} and Petr {Schwarz} and Pavel {Matějka} and Lukáš {Burget} and Michal {Fapšo} and Martin {Karafiát} and Jan {Černocký}",
  title="Comparison of Keyword Spotting Approaches for Informal Continuous Speech",
  booktitle="2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms",
  year="2005",
  pages="1--12",
  address="Edinburgh",
  url="https://www.fit.vut.cz/research/publication/7887/"
}
Projekty
Daty řízené a antropické kódování a rozpoznávání řeči, GAČR, Postdoktorandské granty, GP102/02/D108, zahájení: 2002-09-01, ukončení: 2005-08-30, ukončen
Nové směry ve výzkumu a využití hlasových technologií, GAČR, Standardní projekty, GA102/05/0278, zahájení: 2005-01-01, ukončení: 2007-12-31, ukončen
Posílená skupinová interakce, EU, Sixth Framework programme, 506811-AMI, zahájení: 2004-01-01, ukončení: 2006-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru