Exploring Contextual Information in Neural Machine Translation

Czech title

Language

English

Abstract

This works explores means of utilizing extra-sentential context in neural machine translation (NMT). Traditionally, NMT systems translate one source sentence into one target sentence, without any notion of the surrounding text. This is clearly insufficient and different from how humans translate text. For many high-resource language pairs, translations produced by NMT may be under certain, strict conditions, nearly indistinguishable from human produced translations. One of these conditions is that evaluators score the sentences separately. When evaluating whole documents, even the best NMT systems still fall short of human translators. This motivates the research of employing document level context in NMT, since there might not be much more space left to improve translations on the sentence level, at least for high resource languages and domains. This work summarizes recent state-of-the art approaches to context utilization, implements several of them, evaluates them both in terms of general translation quality and on specific context related phenomena, and analyzes their advantages and shortcomings. A hand-made context phenomena test set for English to Czech translation was created for this task.

Keywords

NMT, neural machine translation, context, recurrent neural networks, transformer, document level translation, discourse, cohesion, coherence

Department

Department of Computer Graphics and Multimedia FIT BUT

Degree Programme

Information Technology, Field of Study Bioinformatics and Biocomputing

Files

Status

defended, grade A

Date

20 June 2019

Reviewer

Fajčík Martin, Ing.

Committee

Sekanina Lukáš, prof. Ing., Ph.D. (DCSY FIT BUT), předseda
Grégr Matěj, Ing., Ph.D. (DIFS FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Martínek Tomáš, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Matoušek Radomil, doc. Ing., Ph.D. (IACS FME BUT), člen
Ryšavý Ondřej, doc. Ing., Ph.D. (DIFS FIT BUT), člen

Citation

JON, Josef. Exploring Contextual Information in Neural Machine Translation. Brno, 2019. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-20. Supervised by Smrž Pavel. Available from: https://www.fit.vut.cz/study/thesis/21979/

BibTeX

@mastersthesis{FITMT21979,
    author = "Josef Jon",
    type = "Master's thesis",
    title = "Exploring Contextual Information in Neural Machine Translation",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/21979/"
}