Universal perturbation in the reinforcement learning

Stachura, Damian

Simple view

Full metadata view

Authors

Statistics

Universal perturbation in the reinforcement learning

master

Alternative title

Uniwersalne perturbacje w uczeniu ze wzmocnieniem

Author

Stachura Damian

Reviewer

Spurek Przemysław

Tabor Jacek

Advisor

Tabor Jacek

Date of defence

2020-09-14

Keywords in Polish

uczenie ze wzmocnieniem, przykłady adwersarialne, uniwersalne perturbacje, uczenie maszynowe, Atari 2600, metoda gradientu prostego, TensorFlow, OpenAI Gym

Keywords in English

reinforcement learning, adversarial examples, universal adversarial perturbations, machine learning, Atari 2600, gradient descent method, TensorFlow, OpenAI Gym

Language

English

Abstract in Polish

Głębokie sieci neuronowe, mimo tego że osiągają wyniki przekraczające ludzkie możliwości dla wielu zadań, są podatne na przykłady adwersarialne. Zjawisko to zostało zaprezentowane także dla uczenia ze wzmocnieniem. Demonstrujemy, że można skonstruować uniwersalny atak, tak że dodajemy tą samą perturbację do różnych ramek dla pewnej gry w celu oszukania strategii sieci neuronowej. Pokazujemy także, że nasze perturbacje przenoszą się, w pewnym stopniu, do gier na których perturbacja nie była trenowana. Do eksperymentów wykorzystaliśmy gry Atari 2600.

Abstract in English

Deep neural networks, despite their superhuman performance on many tasks, are susceptible to adversarial examples. This phenomenon has been demonstrated also in the context of reinforcement learning. We demonstrate that one can construct a universal attack that, for a given game, adds the same perturbation for different frames to fool the neural network policy. Also, we show that such perturbation transfers, to some extent, to different games that it was not trained on. We perform our experiments on Atari 2600 games.

dc.abstract.en	Deep neural networks, despite their superhuman performance on many tasks, are susceptible to adversarial examples. This phenomenon has been demonstrated also in the context of reinforcement learning. We demonstrate that one can construct a universal attack that, for a given game, adds the same perturbation for different frames to fool the neural network policy. Also, we show that such perturbation transfers, to some extent, to different games that it was not trained on. We perform our experiments on Atari 2600 games.	pl
dc.abstract.pl	Głębokie sieci neuronowe, mimo tego że osiągają wyniki przekraczające ludzkie możliwości dla wielu zadań, są podatne na przykłady adwersarialne. Zjawisko to zostało zaprezentowane także dla uczenia ze wzmocnieniem. Demonstrujemy, że można skonstruować uniwersalny atak, tak że dodajemy tą samą perturbację do różnych ramek dla pewnej gry w celu oszukania strategii sieci neuronowej. Pokazujemy także, że nasze perturbacje przenoszą się, w pewnym stopniu, do gier na których perturbacja nie była trenowana. Do eksperymentów wykorzystaliśmy gry Atari 2600.	pl
dc.affiliation	Wydział Matematyki i Informatyki	pl
dc.area	obszar nauk ścisłych	pl
dc.contributor.advisor	Tabor, Jacek - 132362	pl
dc.contributor.author	Stachura, Damian	pl
dc.contributor.departmentbycode	UJK/WMI2	pl
dc.contributor.reviewer	Spurek, Przemysław	pl
dc.contributor.reviewer	Tabor, Jacek - 132362	pl
dc.date.accessioned	2020-10-20T19:21:39Z
dc.date.available	2020-10-20T19:21:39Z
dc.date.submitted	2020-09-14	pl
dc.fieldofstudy	nauczanie maszynowe	pl
dc.identifier.apd	diploma-141438-209204	pl
dc.identifier.project	APD / O	pl
dc.identifier.uri	https://ruj.uj.edu.pl/xmlui/handle/item/249160
dc.language	eng	pl
dc.subject.en	reinforcement learning, adversarial examples, universal adversarial perturbations, machine learning, Atari 2600, gradient descent method, TensorFlow, OpenAI Gym	pl
dc.subject.pl	uczenie ze wzmocnieniem, przykłady adwersarialne, uniwersalne perturbacje, uczenie maszynowe, Atari 2600, metoda gradientu prostego, TensorFlow, OpenAI Gym	pl
dc.title	Universal perturbation in the reinforcement learning	pl
dc.title.alternative	Uniwersalne perturbacje w uczeniu ze wzmocnieniem	pl
dc.type	master	pl
dspace.entity.type	Publication

dc.abstract.enpl

Deep neural networks, despite their superhuman performance on many tasks, are susceptible to adversarial examples. This phenomenon has been demonstrated also in the context of reinforcement learning. We demonstrate that one can construct a universal attack that, for a given game, adds the same perturbation for different frames to fool the neural network policy. Also, we show that such perturbation transfers, to some extent, to different games that it was not trained on. We perform our experiments on Atari 2600 games.

dc.abstract.plpl

Głębokie sieci neuronowe, mimo tego że osiągają wyniki przekraczające ludzkie możliwości dla wielu zadań, są podatne na przykłady adwersarialne. Zjawisko to zostało zaprezentowane także dla uczenia ze wzmocnieniem. Demonstrujemy, że można skonstruować uniwersalny atak, tak że dodajemy tą samą perturbację do różnych ramek dla pewnej gry w celu oszukania strategii sieci neuronowej. Pokazujemy także, że nasze perturbacje przenoszą się, w pewnym stopniu, do gier na których perturbacja nie była trenowana. Do eksperymentów wykorzystaliśmy gry Atari 2600.

dc.affiliationpl

Wydział Matematyki i Informatyki

dc.areapl

obszar nauk ścisłych

dc.contributor.advisorpl

Tabor, Jacek - 132362

dc.contributor.authorpl

Stachura, Damian

dc.contributor.departmentbycodepl

UJK/WMI2

dc.contributor.reviewerpl

Spurek, Przemysław

dc.contributor.reviewerpl

Tabor, Jacek - 132362

dc.date.accessioned

2020-10-20T19:21:39Z

dc.date.available

2020-10-20T19:21:39Z

dc.date.submittedpl

2020-09-14

dc.fieldofstudypl

nauczanie maszynowe

dc.identifier.apdpl

diploma-141438-209204

dc.identifier.projectpl

APD / O

dc.identifier.uri

https://ruj.uj.edu.pl/xmlui/handle/item/249160

dc.languagepl

eng

dc.subject.enpl

reinforcement learning, adversarial examples, universal adversarial perturbations, machine learning, Atari 2600, gradient descent method, TensorFlow, OpenAI Gym

dc.subject.plpl

uczenie ze wzmocnieniem, przykłady adwersarialne, uniwersalne perturbacje, uczenie maszynowe, Atari 2600, metoda gradientu prostego, TensorFlow, OpenAI Gym

dc.titlepl

Universal perturbation in the reinforcement learning

dc.title.alternativepl

Uniwersalne perturbacje w uczeniu ze wzmocnieniem

dc.typepl

master

dspace.entity.type

Publication

Affiliations

No affiliation

Stachura, Damian

Spurek, Przemysław

Tabor, Jacek

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views

24 Views per month

Views per city

Nowy Sącz

4

Krakow

3

Trzebnica

3

Wroclaw

2

Dublin

1

Katowice

1

Nowy Korczyn

1

Warsaw

1

No access

Collections

Masters theses

ROD UJ