Thesis Details

Stylized Natural Language Generation in Dialogue Systems

Bachelor's Thesis Student: Bolshakova Ksenia Academic Year: 2019/2020 Supervisor: Fajčík Martin, Ing.
Czech title
Generování stylizovaného lidského jazyka v dialogových systémech
Language
English
Abstract

This thesis explores approaches of generating natural language in various styles. In addition, it also explores the model's ability to control the strength of style manifestation in a generated sequence. The model for natural language generation was implemented with several aspects of style manifestation, namely poetry, humor, sentiment and specificity. Beam search and Nucleus sampling were used as decoding strategies of language models. The proposed experiments are based on weighted decoding. In particular, the probability mass function computed via language model that generates response is modified via two approaches. The first approach uses manually crafted features, such as NIDF. The second one uses neural probabilistic language models trained on stylistic datasets. The architecture of the model is presented in two versions. The first one is a LSTM-based baseline and the second one uses state-of-the-art pre-trained models BART and GPT-2 for text generation. The experiments have revealed the problem that even current state-of-the-art models suffer from poor trade-off estimation between the style and the context. In other words, the more the style is manifested in the generated sequence, the less it relates to the topic discussed in the dialogue.

Keywords

Artificial neural network, Natural language generation, Dialogue systems, seq2seq, BART, GPT-2, LSTM

Department
Degree Programme
Information Technology
Files
Status
defended, grade A
Date
10 July 2020
Reviewer
Committee
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), předseda
Češka Milan, doc. RNDr., Ph.D. (DITS FIT BUT), člen
Jaroš Jiří, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Orság Filip, Ing., Ph.D. (DITS FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
Citation
BOLSHAKOVA, Ksenia. Stylized Natural Language Generation in Dialogue Systems. Brno, 2020. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2020-07-10. Supervised by Fajčík Martin. Available from: https://www.fit.vut.cz/study/thesis/22587/
BibTeX
@bachelorsthesis{FITBT22587,
    author = "Ksenia Bolshakova",
    type = "Bachelor's thesis",
    title = "Stylized Natural Language Generation in Dialogue Systems",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2020,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/22587/"
}
Back to top