Simple view
Full metadata view
Authors
Statistics
Continual World : a robotic benchmark for continual reinforcement learning
Continual learning (CL) - the ability to continuously learn, building on previ ously acquired knowledge - is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World [54] as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and highlight unique algorithmic challenges in the RL setting. Our benchmark aims to provide a meaningful and computationally inexpensive challenge for the community and thus help better understand the performance of existing and future solutions. Information about the benchmark, including the open-source code, is available at https://sites.google.com/view/continualworld.
dc.abstract.en | Continual learning (CL) - the ability to continuously learn, building on previ ously acquired knowledge - is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World [54] as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and highlight unique algorithmic challenges in the RL setting. Our benchmark aims to provide a meaningful and computationally inexpensive challenge for the community and thus help better understand the performance of existing and future solutions. Information about the benchmark, including the open-source code, is available at https://sites.google.com/view/continualworld. | pl |
dc.affiliation | Wydział Matematyki i Informatyki | pl |
dc.affiliation | Szkoła Doktorska Nauk Ścisłych i Przyrodniczych | pl |
dc.conference | 35th Conference on Neural Information Processing Systems (NeurIPS 2021) | |
dc.conference.city | Virtual Event | |
dc.conference.country | Nieznany/błędny kraj | |
dc.conference.datefinish | 2021-12-14 | |
dc.conference.datestart | 2021-12-06 | |
dc.conference.series | Advances in Neural Information Processing Systems | |
dc.conference.seriesshortcut | NeurIPS | |
dc.conference.shortcut | NeurIPS 2021 | |
dc.contributor.author | Kucinski, Lukasz | pl |
dc.contributor.author | Miłoś, Piotr | pl |
dc.contributor.author | Pascanu, Razvan | pl |
dc.contributor.author | Wołczyk, Maciej - 247731 | pl |
dc.contributor.author | Zając, Michał - 220572 | pl |
dc.contributor.author | Pascanu, Razvan | |
dc.contributor.editor | Ranzato, M | pl |
dc.contributor.editor | Beygelzimer, A | pl |
dc.contributor.editor | Dauphin, Y | pl |
dc.contributor.editor | Liang, P.S. | pl |
dc.contributor.editor | Wortman Vaughan J. | pl |
dc.date.accession | 2022-01-25 | pl |
dc.date.accessioned | 2022-01-25T16:06:19Z | |
dc.date.available | 2022-01-25T16:06:19Z | |
dc.date.issued | 2022 | pl |
dc.description.conftype | international | pl |
dc.description.edition | Online First 2021-12-20 | pl |
dc.description.physical | 28496-28510 | |
dc.description.series | Advances in neural information processing systems | |
dc.description.seriesnumber | 34 | |
dc.identifier.bookweblink | https://search.worldcat.org/title/1422613707 | |
dc.identifier.isbn | 978-1-7138-4539-3 | |
dc.identifier.project | POIR.04.04.00-00-14DE/18-00 | pl |
dc.identifier.seriesissn | 1049-5258 | |
dc.identifier.uri | https://ruj.uj.edu.pl/xmlui/handle/item/287389 | |
dc.identifier.weblink | https://papers.nips.cc/paper/2021/file/ef8446f35513a8d6aa2308357a268a7e-Paper.pdf | pl |
dc.language | eng | pl |
dc.language.container | eng | pl |
dc.pubinfo | Red Hook : Curran Associates, Inc. | pl |
dc.rights | Dodaję tylko opis bibliograficzny | * |
dc.rights.licence | Bez licencji otwartego dostępu | |
dc.rights.uri | * | |
dc.subtype | ConferenceProceedings | pl |
dc.title | Continual World : a robotic benchmark for continual reinforcement learning | pl |
dc.title.container | Advances in Neural Information Processing Systems 34 | pl |
dc.title.volume | 35th Conference on Neural Information Processing Systems (NeurIPS 2021) | pl |
dc.type | BookSection | pl |
dspace.entity.type | Publication |