Thesis Details

Posilované učení pro pohyb robota

Bachelor's Thesis Student: Hás David Academic Year: 2021/2022 Supervisor: Hradiš Michal, Ing., Ph.D.
English title
Reinforcement Learning for Mobile Robots
Language
Czech
Abstract

This paper is concerned with reinforcement learning for robotic movement in simulated physical environment. These are difficult problems for reinforcement learning, where agents need to face several challenges. One of them is continuous action space, as agent usually interacts with the environment by applying force on joints of the robot. Another problem is that parts of the robot often affect each other in complex ways and are also affected by gravity, inertia and other physical effects. For these and more reasons simple reinforcement learning algorithms are not suitable for these tasks. One of recent solutions is the Soft Actor-Critic algorithm (SAC), which emerged at the same time as similarly performing TD3, and both outperforming the older DDPG. SAC agents are rewarded for behaving more randomly, thus their goal is to maximize entropy along with maximizing the reward. This paper describes usage of this algorithm for teaching agents robotic movement. It describes implementation of the algorithms using the PyTorch machine learning framework and evaluates it on environments from OpenAI Gym platform using the PyBullet physics engine. Lastly, the algorithm is applied on custom environment with robot Atlas.

Keywords

machine learning, neural networks, reinforcement learning, deep learning, soft actor-critic, SAC, OpenAI Gym

Department
Degree Programme
Information Technology
Files
Status
defended, grade D
Date
14 June 2022
Reviewer
Committee
Čadík Martin, doc. Ing., Ph.D. (DCGM FIT BUT), předseda
Bařina David, Ing., Ph.D. (DCGM FIT BUT), člen
Burget Radek, doc. Ing., Ph.D. (DIFS FIT BUT), člen
Češka Milan, doc. RNDr., Ph.D. (DITS FIT BUT), člen
Mrázek Vojtěch, Ing., Ph.D. (DCSY FIT BUT), člen
Citation
HÁS, David. Posilované učení pro pohyb robota. Brno, 2022. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2022-06-14. Supervised by Hradiš Michal. Available from: https://www.fit.vut.cz/study/thesis/23906/
BibTeX
@bachelorsthesis{FITBT23906,
    author = "David H\'{a}s",
    type = "Bachelor's thesis",
    title = "Posilovan\'{e} u\v{c}en\'{i} pro pohyb robota",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2022,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/23906/"
}
Back to top