Deep reinforcement learning in an OpenAI Gym  environment

Dadał, Sebastian

Simple view

Full metadata view

Authors

Statistics

Deep reinforcement learning in an OpenAI Gym environment

master

Alternative title

Głębokie uczenie ze wzmocnieniem w środowisku OpenAI Gym

Author

Dadał Sebastian

Reviewer

Białas Piotr

Kutt Krzysztof

Advisor

Kutt Krzysztof

Date of defence

2023-10-18

Keywords in Polish

uczenie maszynowe, uczenie ze wzmocnieniem, głębokie uczenie ze wzmocnieniem, a2c, dqn, ppo, trpo, openai gym, pygame, sieci neuronowe, gry wideo

Keywords in English

machine learning, reinforcement learning, deep reinforcement learning, ml, rl, drl, a2c, dqn, ppo, trpo, openai gym, pygame, neural networks, deep q-networks, advantage actor-critic, proximal policy optimization, trust region policy optimization, video games

Language

English

Abstract in Polish

Celem pracy było stworzenie nowego środowiska OpenAI Gym w formie gry arcade stworzonej przy użyciu biblioteki Pygame. Nowe środowisko zostało przetestowane przy pomocy zbioru nowoczesnych algorytmów głębokiego uczenia ze wzmocnieniem (A2C, DQN, PPO, TRPO) z użyciem referencyjnych implementacji z biblioteki Stable Baselines 3. Eksperymenty wykazały, że algorytmy były w stanie nauczyć się reguł gry, a w przypadku algorytmów PPO i TRPO osiągnąć efektywność lepszą od człowieka.

Abstract in English

The goal of the thesis was to create a new OpenAI Gym environment in the form of an arcade video game built in Pygame. The new environment has been tested with a set of modern deep reinforcement learning algorithms (A2C, DQN, PPO, TRPO) using the reference implementations from the Stable Baselines 3 library. Experiments have shown that algorithms were able to learn the dynamics of the game and, in case of PPO and TRPO algorithms, demonstrate super-human performance.

dc.abstract.en	The goal of the thesis was to create a new OpenAI Gym environment in the form of an arcade video game built in Pygame. The new environment has been tested with a set of modern deep reinforcement learning algorithms (A2C, DQN, PPO, TRPO) using the reference implementations from the Stable Baselines 3 library. Experiments have shown that algorithms were able to learn the dynamics of the game and, in case of PPO and TRPO algorithms, demonstrate super-human performance.	pl
dc.abstract.pl	Celem pracy było stworzenie nowego środowiska OpenAI Gym w formie gry arcade stworzonej przy użyciu biblioteki Pygame. Nowe środowisko zostało przetestowane przy pomocy zbioru nowoczesnych algorytmów głębokiego uczenia ze wzmocnieniem (A2C, DQN, PPO, TRPO) z użyciem referencyjnych implementacji z biblioteki Stable Baselines 3. Eksperymenty wykazały, że algorytmy były w stanie nauczyć się reguł gry, a w przypadku algorytmów PPO i TRPO osiągnąć efektywność lepszą od człowieka.	pl
dc.affiliation	Uniwersytet Jagielloński w Krakowie	pl
dc.contributor.advisor	Kutt, Krzysztof	pl
dc.contributor.author	Dadał, Sebastian	pl
dc.contributor.departmentbycode	UJK/UJK	pl
dc.contributor.reviewer	Białas, Piotr - 127296	pl
dc.contributor.reviewer	Kutt, Krzysztof	pl
dc.date.accessioned	2023-10-27T21:35:37Z
dc.date.available	2023-10-27T21:35:37Z
dc.date.submitted	2023-10-18	pl
dc.fieldofstudy	informatyka stosowana	pl
dc.identifier.apd	diploma-162701-162799	pl
dc.identifier.uri	https://ruj.uj.edu.pl/xmlui/handle/item/322416
dc.language	eng	pl
dc.subject.en	machine learning, reinforcement learning, deep reinforcement learning, ml, rl, drl, a2c, dqn, ppo, trpo, openai gym, pygame, neural networks, deep q-networks, advantage actor-critic, proximal policy optimization, trust region policy optimization, video games	pl
dc.subject.pl	uczenie maszynowe, uczenie ze wzmocnieniem, głębokie uczenie ze wzmocnieniem, a2c, dqn, ppo, trpo, openai gym, pygame, sieci neuronowe, gry wideo	pl
dc.title	Deep reinforcement learning in an OpenAI Gym environment	pl
dc.title.alternative	Głębokie uczenie ze wzmocnieniem w środowisku OpenAI Gym	pl
dc.type	master	pl
dspace.entity.type	Publication

dc.abstract.enpl

The goal of the thesis was to create a new OpenAI Gym environment in the form of an arcade video game built in Pygame. The new environment has been tested with a set of modern deep reinforcement learning algorithms (A2C, DQN, PPO, TRPO) using the reference implementations from the Stable Baselines 3 library. Experiments have shown that algorithms were able to learn the dynamics of the game and, in case of PPO and TRPO algorithms, demonstrate super-human performance.

dc.abstract.plpl

Celem pracy było stworzenie nowego środowiska OpenAI Gym w formie gry arcade stworzonej przy użyciu biblioteki Pygame. Nowe środowisko zostało przetestowane przy pomocy zbioru nowoczesnych algorytmów głębokiego uczenia ze wzmocnieniem (A2C, DQN, PPO, TRPO) z użyciem referencyjnych implementacji z biblioteki Stable Baselines 3. Eksperymenty wykazały, że algorytmy były w stanie nauczyć się reguł gry, a w przypadku algorytmów PPO i TRPO osiągnąć efektywność lepszą od człowieka.

dc.affiliationpl

Uniwersytet Jagielloński w Krakowie

dc.contributor.advisorpl

Kutt, Krzysztof

dc.contributor.authorpl

Dadał, Sebastian

dc.contributor.departmentbycodepl

UJK/UJK

dc.contributor.reviewerpl

Białas, Piotr - 127296

dc.contributor.reviewerpl

Kutt, Krzysztof

dc.date.accessioned

2023-10-27T21:35:37Z

dc.date.available

2023-10-27T21:35:37Z

dc.date.submittedpl

2023-10-18

dc.fieldofstudypl

informatyka stosowana

dc.identifier.apdpl

diploma-162701-162799

dc.identifier.uri

https://ruj.uj.edu.pl/xmlui/handle/item/322416

dc.languagepl

eng

dc.subject.enpl

machine learning, reinforcement learning, deep reinforcement learning, ml, rl, drl, a2c, dqn, ppo, trpo, openai gym, pygame, neural networks, deep q-networks, advantage actor-critic, proximal policy optimization, trust region policy optimization, video games

dc.subject.plpl

uczenie maszynowe, uczenie ze wzmocnieniem, głębokie uczenie ze wzmocnieniem, a2c, dqn, ppo, trpo, openai gym, pygame, sieci neuronowe, gry wideo

dc.titlepl

Deep reinforcement learning in an OpenAI Gym environment

dc.title.alternativepl

Głębokie uczenie ze wzmocnieniem w środowisku OpenAI Gym

dc.typepl

master

dspace.entity.type

Publication

Affiliations

No affiliation

Dadał, Sebastian

Białas, Piotr

Kutt, Krzysztof

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views

103 Views per month

Views per city

Warsaw

20

Bielefeld

7

Johannesburg

3

Karachi

3

Semenyih

3

Istanbul

2

Kuala Lumpur

2

Mannheim

2

New York

2

Osaka

2

No access

Collections

Masters theses