Thesis Details

Posilované učení pro hru typu Bomberman

Bachelor's Thesis Student: Adamčiak Jakub Academic Year: 2021/2022 Supervisor: Hradiš Michal, Ing., Ph.D.
English title
Reinforcement Learning for Bomberman Type Game

This bachelor's thesis aims to develop, implement and train reinforcement learning models for a Bomberman-type game. It is based on Bomberland environment from CoderOne. This environment was created for education and research in the field of artificial intelligence. In this thesis I tackle the settings and problems of implementing agent into the environment. I used 2 policies (MLP and CNN), 2 algorithms (PPO and A2C) and 5 setups of neural networks for feature extraction with the use of libraries stable baselines 3 and pytorch. Total training time resulted in 1207 real-world hours, 4168 computing hours and 271 milions of time steps. Although the training was not successful, this thesis shows the process of implementing a reinforcement learning model into a Gym environment.


artificial inteligence, AI, machine learning, ML, reinforcement learning, RL, convolutional neural networks, CNN, PPO, A2C, python, stable baselines3, ai-gym, pytorch, games, bomberman

Degree Programme
defended, grade C
16 June 2022
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT), předseda
Burgetová Ivana, Ing., Ph.D. (DIFS FIT BUT), člen
Kreslíková Jitka, doc. RNDr., CSc. (DIFS FIT BUT), člen
Smrčka Aleš, Ing., Ph.D. (DITS FIT BUT), člen
Strnadel Josef, Ing., Ph.D. (DCSY FIT BUT), člen
ADAMČIAK, Jakub. Posilované učení pro hru typu Bomberman. Brno, 2022. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2022-06-16. Supervised by Hradiš Michal. Available from:
    author = "Jakub Adam\v{c}iak",
    type = "Bachelor's thesis",
    title = "Posilovan\'{e} u\v{c}en\'{i} pro hru typu Bomberman",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2022,
    location = "Brno, CZ",
    language = "czech",
    url = ""
Back to top