Detail výsledku

Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources

BENEŠ, K.; IRIE, K.; BECK, E.; SCHLÜTER, R.; NEY, H. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources. Proceedings of DAGA 2019. Rostock: DEGA Head office, Deutsche Gesellschaft für Akustik, 2019. p. 954-957. ISBN: 978-3-939296-14-0.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Beneš Karel, Ing., Ph.D., UPGM (FIT)
IRIE, K.
BECK, E.
SCHLÜTER, R.
NEY, H.
Abstrakt

Classically, automatic speech recognition (ASR) modelsare decomposed into acoustic models and language models(LM). LMs usually exploit the linguistic structure ona purely textual level and usually contribute strongly toan ASR systems performance. LMs are estimated onlarge amounts of textual data covering the target domain.However, most utterances cover more specic topics, e.g.inuencing the vocabulary used. Therefore, it's desirableto have the LM adjusted to an utterance's topic. Previouswork achieves this by crawling extra data from theweb or by using signicant amounts of previous speechdata to train topic-specic LM on. We propose a wayof adapting the LM directly using the target utteranceto be recognized. The corresponding adaptation needsto be done in an unsupervised or automatically supervisedway based on the speech input. To deal withcorresponding errors robustly, we employ topic encodingsfrom the recently proposed Subspace MultinomialModel. This model also avoids any need of explicit topiclabelling during training or recognition, making the proposedmethod straight-forward to use. We demonstratethe performance of the method on the Librispeech corpus,which consists of read ction books, and we discussit's behaviour qualitatively.

Klíčová slova

speech recognition

URL
Rok
2019
Strany
954–957
Sborník
Proceedings of DAGA 2019
Konference
DAGA 2019 - 45. Jahrestagung für Akustik, 18. - 21. März 2019
ISBN
978-3-939296-14-0
Vydavatel
DEGA Head office, Deutsche Gesellschaft für Akustik
Místo
Rostock
BibTeX
@inproceedings{BUT160005,
  author="BENEŠ, K. and IRIE, K. and BECK, E. and SCHLÜTER, R. and NEY, H.",
  title="Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources",
  booktitle="Proceedings of DAGA 2019",
  year="2019",
  pages="954--957",
  publisher="DEGA Head office, Deutsche Gesellschaft für Akustik",
  address="Rostock",
  isbn="978-3-939296-14-0",
  url="https://www.dega-akustik.de/publikationen/online-proceedings/"
}
Soubory
Projekty
Mezinárodní mobilita výzkumníků Vysokého učení technického v Brně, EU, OPVVV PO2 Mezinárodní mobilita výzkumných pracovníků, EF16_027/0008371, CZ.02.2.69/0.0/0.0/16_027/0008371, zahájení: 2018-01-01, ukončení: 2022-09-30, řešení
Výzkumné skupiny
Pracoviště
Nahoru