Result Details

Progress in the BBN Keyword Search System for the DARPA RATS Program

NG, T.; HSIAO, R.; ZHANG, L.; KARAKOS, D.; MALLIDI, S.; KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; ZHANG, B.; NGUYEN, L.; SCHWARTZ, R. Progress in the BBN Keyword Search System for the DARPA RATS Program. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 959-963. ISBN: 978-1-63439-435-2.
Type
conference paper
Language
English
Authors
Ng Tim
Hsiao Roger, FIT (FIT)
Zhang Le
Karakos Damianos, FIT (FIT)
Mallidi Sri Harish, FIT (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Veselý Karel, Ing., Ph.D., DCGM (FIT)
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Zhang Bing
Nguyen Long
Schwartz Richard, FIT (FIT)
Abstract

This article is about the progress in the BBN Keyword Search System for the DARPA RATS Program (Robust Automatic Transcription of Speech).

Keywords

speech recognition, KWS, MLP, DNN

URL
Annotation

This paper presents a set of techniques that we used to improve our keyword search system for the third phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded radio communication channels. The results for both Levantine and Farsi, which are the two target languages for the keyword search (KWS) task, are reported. About 13% absolute reduction in word error rate (from 70.2% to 57.6%) is achieved by using acoustic features derived from stacked Multi-Layer Perceptrons (MLP) and Deep Neural Network (DNN) acoustic models. In addition to score normalization and score/system combination for keyword search, we showed that the false alarm rate at the target false reject rate (15%) was reduced by about 1% (from 5.39% to 4.45%) by reducing the deletion errors of the speech-to-text system.

Published
2014
Pages
959–963
Proceedings
Proceedings of Interspeech 2014
Conference
Interspeech Conference
ISBN
978-1-63439-435-2
Publisher
International Speech Communication Association
Place
Singapore
UT WoS
000395050100195
EID Scopus
BibTeX
@inproceedings{BUT111666,
  author="Tim {Ng} and Roger {Hsiao} and Le {Zhang} and Damianos {Karakos} and Sri Harish {Mallidi} and Martin {Karafiát} and Karel {Veselý} and Igor {Szőke} and Bing {Zhang} and Long {Nguyen} and Richard {Schwartz}",
  title="Progress in the BBN Keyword Search System for the DARPA RATS Program",
  booktitle="Proceedings of Interspeech 2014",
  year="2014",
  pages="959--963",
  publisher="International Speech Communication Association",
  address="Singapore",
  isbn="978-1-63439-435-2",
  url="http://www.isca-speech.org/archive/interspeech_2014/i14_0959.html"
}
Projects
DARPA Robust Automatic Transcription of Speech (RATS) - RATS Patrol I, BBN, start: 2010-09-23, end: 2014-06-30, completed
Speech recognition for low-resource languages, GACR, Postdoktorandské granty, GPP202/12/P604, start: 2012-01-01, end: 2014-12-31, completed
Research groups
Departments
Back to top