Nauczanie przez wzmacnianie

Maresca, Rocco

Simple view

Full metadata view

Authors

Statistics

Nauczanie przez wzmacnianie

licenciate

Alternative title

Reinforcement learning

Author

Maresca Rocco

Reviewer

Kapela Tomasz

Spurek Przemysław

Advisor

Kapela Tomasz

Date of defence

2018-07-05

Keywords in Polish

nauczanie maszynowe, wzmocnienie, nauczanie inkrementacyjne, problemy sterowania

Keywords in English

machine learning, reinforcement, incremental learning, control problems

Language

Polish

Abstract in Polish

Celem pracy jest przedstawienie algorytmów nauczania przez wzmacnianie. Omówione zostają metody rozwiązujące problemy z jednym stanem. Po wprowadzeniu pojęcia decyzyjnych procesów Markowa, zostaje wyprowadzone i omówione równanie Bellmana. Pod koniec pracy przedstawione są dwa algorytmy nauczania przez wzmacnianie - Sarsa oraz Q-Learning, oraz przy użyciu drugiego z nich, rozwiązany jest problem sterowania.

Abstract in English

The purpose of this thesis is to present the basics of reinforcement learning algorithms. Firstly, methods of solving one state problems are presented. After introducing the concept of Markov decision process, the Bellman equation is derived and described. At the end of this paper, Sarsa and Q-Learning algorithms are introduced and compared, the second algorithm is used to solve the control problem.

dc.abstract.en	The purpose of this thesis is to present the basics of reinforcement learning algorithms. Firstly, methods of solving one state problems are presented. After introducing the concept of Markov decision process, the Bellman equation is derived and described. At the end of this paper, Sarsa and Q-Learning algorithms are introduced and compared, the second algorithm is used to solve the control problem.	pl
dc.abstract.pl	Celem pracy jest przedstawienie algorytmów nauczania przez wzmacnianie. Omówione zostają metody rozwiązujące problemy z jednym stanem. Po wprowadzeniu pojęcia decyzyjnych procesów Markowa, zostaje wyprowadzone i omówione równanie Bellmana. Pod koniec pracy przedstawione są dwa algorytmy nauczania przez wzmacnianie - Sarsa oraz Q-Learning, oraz przy użyciu drugiego z nich, rozwiązany jest problem sterowania.	pl
dc.affiliation	Wydział Matematyki i Informatyki	pl
dc.area	obszar nauk ścisłych	pl
dc.contributor.advisor	Kapela, Tomasz - 128624	pl
dc.contributor.author	Maresca, Rocco	pl
dc.contributor.departmentbycode	UJK/WMI2	pl
dc.contributor.reviewer	Kapela, Tomasz - 128624	pl
dc.contributor.reviewer	Spurek, Przemysław	pl
dc.date.accessioned	2020-07-27T15:27:46Z
dc.date.available	2020-07-27T15:27:46Z
dc.date.submitted	2018-07-05	pl
dc.fieldofstudy	matematyka komputerowa	pl
dc.identifier.apd	diploma-122863-210808	pl
dc.identifier.project	APD / O	pl
dc.identifier.uri	https://ruj.uj.edu.pl/xmlui/handle/item/227273
dc.language	pol	pl
dc.subject.en	machine learning, reinforcement, incremental learning, control problems	pl
dc.subject.pl	nauczanie maszynowe, wzmocnienie, nauczanie inkrementacyjne, problemy sterowania	pl
dc.title	Nauczanie przez wzmacnianie	pl
dc.title.alternative	Reinforcement learning	pl
dc.type	licenciate	pl
dspace.entity.type	Publication

dc.abstract.enpl

The purpose of this thesis is to present the basics of reinforcement learning algorithms. Firstly, methods of solving one state problems are presented. After introducing the concept of Markov decision process, the Bellman equation is derived and described. At the end of this paper, Sarsa and Q-Learning algorithms are introduced and compared, the second algorithm is used to solve the control problem.

dc.abstract.plpl

Celem pracy jest przedstawienie algorytmów nauczania przez wzmacnianie. Omówione zostają metody rozwiązujące problemy z jednym stanem. Po wprowadzeniu pojęcia decyzyjnych procesów Markowa, zostaje wyprowadzone i omówione równanie Bellmana. Pod koniec pracy przedstawione są dwa algorytmy nauczania przez wzmacnianie - Sarsa oraz Q-Learning, oraz przy użyciu drugiego z nich, rozwiązany jest problem sterowania.

dc.affiliationpl

Wydział Matematyki i Informatyki

dc.areapl

obszar nauk ścisłych

dc.contributor.advisorpl

Kapela, Tomasz - 128624

dc.contributor.authorpl

Maresca, Rocco

dc.contributor.departmentbycodepl

UJK/WMI2

dc.contributor.reviewerpl

Kapela, Tomasz - 128624

dc.contributor.reviewerpl

Spurek, Przemysław

dc.date.accessioned

2020-07-27T15:27:46Z

dc.date.available

2020-07-27T15:27:46Z

dc.date.submittedpl

2018-07-05

dc.fieldofstudypl

matematyka komputerowa

dc.identifier.apdpl

diploma-122863-210808

dc.identifier.projectpl

APD / O

dc.identifier.uri

https://ruj.uj.edu.pl/xmlui/handle/item/227273

dc.languagepl

pol

dc.subject.enpl

machine learning, reinforcement, incremental learning, control problems

dc.subject.plpl

nauczanie maszynowe, wzmocnienie, nauczanie inkrementacyjne, problemy sterowania

dc.titlepl

Nauczanie przez wzmacnianie

dc.title.alternativepl

Reinforcement learning

dc.typepl

licenciate

dspace.entity.type

Publication

Affiliations

No affiliation

Maresca, Rocco

Kapela, Tomasz

Spurek, Przemysław

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views

21 Views per month

Views per city

Warsaw

7

Gniezno

3

Dublin

2

Wroclaw

2

Krynica-Zdroj

1

Pruszcz Gdanski

1

Szczecin

1

Turawa

1

Wapielsk

1

No access

Collections

Bachelor's theses

ROD UJ