Thesis Details

Posilované učení pro 3D hry

Bachelor's Thesis Student: Beránek Michal Academic Year: 2018/2019 Supervisor: Hradiš Michal, Ing., Ph.D.
English title
Reinforcement Learning for 3D Games
Language
Czech
Abstract

Thesis deals with neural network learning on simple tasks in 3D shooter Doom, mediated by research platform ViZDoom. The main goal is to create an agent, which is able to learn multiple tasks simultaneously. Reinforcement learning algorithm used to achieve this goal is called Rainbow, which combines several improvements of DQN algorithm. I proposed and experimented with two different architectures of neural network for learning multiple tasks. One of them was successful and after a relatively short period of learning it reached almost 50% of maximum possible reward. The key element of this achievement is an Embedding layer for parametric description of task environment. The main discovery is, that Rainbow is able to learn in 3D environment and with the help of Embedding layer, it is able to learn on multiple tasks simultaneously.

Keywords

neural network, reinforcement learning, ViZDoom, transfer learning, Rainbow algorithm, PyTorch, Embedding layer

Department
Degree Programme
Information Technology
Files
Status
defended, grade D
Date
11 June 2019
Reviewer
Committee
Herout Adam, prof. Ing., Ph.D. (DCGM FIT BUT), předseda
Bidlo Michal, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Čadík Martin, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Křivka Zbyněk, Ing., Ph.D. (DIFS FIT BUT), člen
Rogalewicz Adam, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Citation
BERÁNEK, Michal. Posilované učení pro 3D hry. Brno, 2019. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-11. Supervised by Hradiš Michal. Available from: https://www.fit.vut.cz/study/thesis/22181/
BibTeX
@bachelorsthesis{FITBT22181,
    author = "Michal Ber\'{a}nek",
    type = "Bachelor's thesis",
    title = "Posilovan\'{e} u\v{c}en\'{i} pro 3D hry",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/22181/"
}
Back to top