Thesis Details

Exploring Contextual Information in Neural Machine Translation

Master's Thesis Student: Jon Josef Academic Year: 2018/2019 Supervisor: Smrž Pavel, doc. RNDr., Ph.D.
Czech title
Exploring Contextual Information in Neural Machine Translation
Language
English
Abstract

This works explores means of utilizing extra-sentential context in neural machine translation (NMT). Traditionally, NMT systems translate one source sentence into one target sentence, without any notion of the surrounding text. This is clearly insufficient and different from how humans translate text. For many high-resource language pairs, translations produced by NMT may be under certain, strict conditions, nearly indistinguishable from human produced translations. One of these conditions is that evaluators score the sentences separately. When evaluating whole documents, even the best NMT systems still fall short of human translators. This motivates the research of employing document level context in NMT, since there might not be much more space left to improve translations on the sentence level, at least for high resource languages and domains. This work summarizes recent state-of-the art approaches to context utilization, implements several of them, evaluates them both in terms of general translation quality and on specific context related phenomena, and analyzes their advantages and shortcomings. A hand-made context phenomena test set for English to Czech translation was created for this task.

Keywords

NMT, neural machine translation, context, recurrent neural networks, transformer, document level translation, discourse, cohesion, coherence

Department
Degree Programme
Information Technology, Field of Study Bioinformatics and Biocomputing
Files
Status
defended, grade A
Date
20 June 2019
Reviewer
Committee
Sekanina Lukáš, prof. Ing., Ph.D. (DCSY FIT BUT), předseda
Grégr Matěj, Ing., Ph.D. (DIFS FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Martínek Tomáš, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Matoušek Radomil, doc. Ing., Ph.D. (IACS FME BUT), člen
Ryšavý Ondřej, doc. Ing., Ph.D. (DIFS FIT BUT), člen
Citation
JON, Josef. Exploring Contextual Information in Neural Machine Translation. Brno, 2019. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-20. Supervised by Smrž Pavel. Available from: https://www.fit.vut.cz/study/thesis/21979/
BibTeX
@mastersthesis{FITMT21979,
    author = "Josef Jon",
    type = "Master's thesis",
    title = "Exploring Contextual Information in Neural Machine Translation",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/21979/"
}
Back to top