Universal perturbation in the reinforcement learning

Stachura, Damian

Simple view

Full metadata view

Authors

Statistics

Universal perturbation in the reinforcement learning

master

Alternative title

Uniwersalne perturbacje w uczeniu ze wzmocnieniem

Author

Stachura Damian

Reviewer

Spurek Przemysław

Tabor Jacek

Advisor

Tabor Jacek

Date of defence

2020-09-14

Keywords in Polish

uczenie ze wzmocnieniem, przykłady adwersarialne, uniwersalne perturbacje, uczenie maszynowe, Atari 2600, metoda gradientu prostego, TensorFlow, OpenAI Gym

Keywords in English

reinforcement learning, adversarial examples, universal adversarial perturbations, machine learning, Atari 2600, gradient descent method, TensorFlow, OpenAI Gym

Language

English

Abstract in Polish

Głębokie sieci neuronowe, mimo tego że osiągają wyniki przekraczające ludzkie możliwości dla wielu zadań, są podatne na przykłady adwersarialne. Zjawisko to zostało zaprezentowane także dla uczenia ze wzmocnieniem. Demonstrujemy, że można skonstruować uniwersalny atak, tak że dodajemy tą samą perturbację do różnych ramek dla pewnej gry w celu oszukania strategii sieci neuronowej. Pokazujemy także, że nasze perturbacje przenoszą się, w pewnym stopniu, do gier na których perturbacja nie była trenowana. Do eksperymentów wykorzystaliśmy gry Atari 2600.

Abstract in English

Deep neural networks, despite their superhuman performance on many tasks, are susceptible to adversarial examples. This phenomenon has been demonstrated also in the context of reinforcement learning. We demonstrate that one can construct a universal attack that, for a given game, adds the same perturbation for different frames to fool the neural network policy. Also, we show that such perturbation transfers, to some extent, to different games that it was not trained on. We perform our experiments on Atari 2600 games.

dc.abstract.en	Deep neural networks, despite their superhuman performance on many tasks, are susceptible to adversarial examples. This phenomenon has been demonstrated also in the context of reinforcement learning. We demonstrate that one can construct a universal attack that, for a given game, adds the same perturbation for different frames to fool the neural network policy. Also, we show that such perturbation transfers, to some extent, to different games that it was not trained on. We perform our experiments on Atari 2600 games.	pl
dc.abstract.pl	Głębokie sieci neuronowe, mimo tego że osiągają wyniki przekraczające ludzkie możliwości dla wielu zadań, są podatne na przykłady adwersarialne. Zjawisko to zostało zaprezentowane także dla uczenia ze wzmocnieniem. Demonstrujemy, że można skonstruować uniwersalny atak, tak że dodajemy tą samą perturbację do różnych ramek dla pewnej gry w celu oszukania strategii sieci neuronowej. Pokazujemy także, że nasze perturbacje przenoszą się, w pewnym stopniu, do gier na których perturbacja nie była trenowana. Do eksperymentów wykorzystaliśmy gry Atari 2600.	pl
dc.affiliation	Wydział Matematyki i Informatyki	pl
dc.area	obszar nauk ścisłych	pl
dc.contributor.advisor	Tabor, Jacek - 132362	pl
dc.contributor.author	Stachura, Damian	pl
dc.contributor.departmentbycode	UJK/WMI2	pl
dc.contributor.reviewer	Spurek, Przemysław	pl
dc.contributor.reviewer	Tabor, Jacek - 132362	pl
dc.date.accessioned	2020-10-20T19:21:39Z
dc.date.available	2020-10-20T19:21:39Z
dc.date.submitted	2020-09-14	pl
dc.fieldofstudy	nauczanie maszynowe	pl
dc.identifier.apd	diploma-141438-209204	pl
dc.identifier.project	APD / O	pl
dc.identifier.uri	https://ruj.uj.edu.pl/xmlui/handle/item/249160
dc.language	eng	pl
dc.subject.en	reinforcement learning, adversarial examples, universal adversarial perturbations, machine learning, Atari 2600, gradient descent method, TensorFlow, OpenAI Gym	pl
dc.subject.pl	uczenie ze wzmocnieniem, przykłady adwersarialne, uniwersalne perturbacje, uczenie maszynowe, Atari 2600, metoda gradientu prostego, TensorFlow, OpenAI Gym	pl
dc.title	Universal perturbation in the reinforcement learning	pl
dc.title.alternative	Uniwersalne perturbacje w uczeniu ze wzmocnieniem	pl
dc.type	master	pl
dspace.entity.type	Publication

Affiliations

No affiliation

Stachura, Damian

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views

0 Views per month

No access

Collections

Masters theses

ROD UJ