Faculty of Information Technology, BUT

Publication Details

Recurrent neural network based language model

MIKOLOV Tomáš, KARAFIÁT Martin, BURGET Lukáš, ČERNOCKÝ Jan and KHUDANPUR Sanjeev. Recurrent neural network based language model. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 1045-1048. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Czech title
Jazykový model založený na rekurentních neuronových sítích
Type
conference paper
Language
english
Authors
Mikolov Tomáš, Ing. (DCGM FIT BUT)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, doc. Dr. Ing. (DCGM FIT BUT)
Khudanpur Sanjeev (JHU)
URL
Keywords
language modeling, recurrent neural networks, speech recognition
Abstract
This paper is on new application to speech recognition, the recurrent neural network based language model (RNN LM).
Annotation
A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results indicate that it is possible to obtain around 50% reduction of perplexity by using mixture of several RNN LMs, compared to a state of the art backoff language model. Speech recognition experiments show around 18% reduction of word error rate on the Wall Street Journal task when comparing models trained on the same amount of data, and around 5% on the much harder NIST RT05 task, even when the backoff model is trained on much more data than the RNN LM. We provide ample empirical evidence to suggest that connectionist language models are superior to standard n-gram techniques, except their high computational (training) complexity.
Published
2010
Pages
1045-1048
Journal
Proceedings of Interspeech, vol. 2010, no. 9, ISSN 1990-9772
Proceedings
Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)
Conference
Interspeech 2010, Tokyo, JP
ISBN
978-1-61782-123-3
Publisher
International Speech Communication Association
Place
Makuhari, Chiba, JP
BibTeX
@INPROCEEDINGS{FITPUB9362,
   author = "Tom\'{a}\v{s} Mikolov and Martin Karafi\'{a}t and Luk\'{a}\v{s} Burget and Jan \v{C}ernock\'{y} and Sanjeev Khudanpur",
   title = "Recurrent neural network based language model",
   pages = "1045--1048",
   booktitle = "Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)",
   journal = "Proceedings of Interspeech",
   volume = 2010,
   number = 9,
   year = 2010,
   location = "Makuhari, Chiba, JP",
   publisher = "International Speech Communication Association",
   ISBN = "978-1-61782-123-3",
   ISSN = "1990-9772",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9362"
}
Back to top