Result Details

Recovery of Rare Words in Lecture Speech

KOMBRINK, S.; HANNEMANN, M.; BURGET, L.; HEŘMANSKÝ, H. Recovery of Rare Words in Lecture Speech. Proc. Text, Speech and Dialogue 2010. Lecture Notes in Computer Science. Brno: Springer Verlag, 2010. no. 9, p. 330-337. ISBN: 978-3-642-15759-2. ISSN: 0302-9743.

Type

conference paper

Language

English

Authors

Kombrink Stefan, Dipl.-Linguist., DCGM (FIT)
Hannemann Mirko, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Heřmanský Hynek, prof. Ing., Dr. Eng., DCGM (FIT)

Abstract

This paper is on recovery of rare words in lecture speech. We use a hybrid word/sub-word recognizer to detect OOV words occurring in English talks and describe them as sequences of sub-words.

Keywords

speech, rare words, recognizer, detect OOV words, sub-words, lectures

URL

https://www.fit.vut.cz/research/group/speech/public/publi/2010/kombrink_TSD…

Annotation

The vocabulary used in speech usually consists of two types of words: a limited set of common words, shared across multiple documents, and a virtually unlimited set of rare words, each of which might appear a few times only in particular documents. In most documents, however, these rare words are not seen at all. The first type of words is typically included in the language model of an automatic speech recognizer (ASR) and is thus widely referred to as invocabulary (IV). Words of the second type are missing in the language model and thus are called out-of-vocabulary (OOV). However, these words usually carry important information. We use a hybrid word/sub-word recognizer to detect OOV words occurring in English talks and describe them as sequences of sub-words.We detected about one third of all OOV words, and were able to recover the correct spelling for 26.2% of all detections by using a phoneme-to-grapheme (P2G) conversion trained on the recognition dictionary. By omitting detections corresponding to recovered IV words, we were able to increase the precision of the OOV detection substantially

Published

2010

Pages

330–337

Journal

Lecture Notes in Computer Science, vol. 2010, no. 9, ISSN 0302-9743

Proceedings

Proc. Text, Speech and Dialogue 2010

Conference

13th International Conference on Text, Speech and Dialogue, TSD 2010

ISBN

978-3-642-15759-2

Publisher

Springer Verlag

Place

Brno

BibTeX

@inproceedings{BUT34927,
  author="Stefan {Kombrink} and Mirko {Hannemann} and Lukáš {Burget} and Hynek {Heřmanský}",
  title="Recovery of Rare Words in Lecture Speech",
  booktitle="Proc. Text, Speech and Dialogue 2010",
  year="2010",
  journal="Lecture Notes in Computer Science",
  volume="2010",
  number="9",
  pages="330--337",
  publisher="Springer Verlag",
  address="Brno",
  isbn="978-3-642-15759-2",
  issn="0302-9743",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kombrink_TSD_2010_330.pdf"
}

Projects

DIRAC - Detection and Identification of Rare Audio-visual Cues, MŠMT, Šestý rámcový program Evropského společenství pro výzkum, technický rozvoj a demonstrační činnosti, 027787, start: 2006-01-01, end: 2010-12-31, completed
Recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-10-2, 2010, start: 2010-04-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)