Modularization of End-to-End Learning: Case Study in Arcade Games

Melnik, Andrew; Fleer, Sascha; Schilling, Malte; Ritter, Helge

Modularization of End-to-End Learning: Case Study in Arcade Games

Melnik A, Fleer S, Schilling M, Ritter H (2019)
In: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning.

Konferenzbeitrag | Englisch

Download

Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!

Autor*in

Melnik, Andrew^UniBi; Fleer, Sascha^UniBi ; Schilling, Malte^UniBi ; Ritter, Helge^UniBi

Einrichtung

Center of Excellence - Cognitive Interaction Technology CITEC
Technische Fakultät > AG Neuroinformatik
Technische Fakultät > AG Visual AI for Extended Reality

Abstract / Bemerkung

Complex environments and tasks pose a difficult problem for holistic end-to-end learning approaches. Decomposition of an environment into interacting controllable and non-controllable objects allows supervised learning for non-controllable objects and universal value function approximator learning for controllable objects. Such decomposition should lead to a shorter learning time and better generalisation capability. Here, we consider arcade-game environments as sets of interacting objects (controllable, non-controllable) and propose a set of functional modules that are specialized on mastering different types of interactions in a broad range of environments. The modules utilize regression, supervised learning, and reinforcement learning algorithms. Results of this case study in different Atari games suggest that human-level performance can be achieved by a learning agent within a human amount of game experience (10-15 minutes game time) when a proper decomposition of an environment or a task is provided. However, automatization of such decomposition remains a challenging problem. This case study shows how a model of a causal structure underlying an environment or a task can benefit learning time and generalization capability of the agent, and argues in favor of exploiting modular structure in contrast to using pure end-to-end learning approaches.

Stichworte

Arcade Games; Causal Learning; Hierarchical Reinforcement Learning; End-to-End Learning

Erscheinungsjahr

2019

Titel des Konferenzbandes

32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning

Konferenz

32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

Konferenzort

Montréal, Canada.

Konferenzdatum

2018-12-02 – 2018-12-08

Page URI

https://pub.uni-bielefeld.de/record/2933988

Zitieren

Melnik A, Fleer S, Schilling M, Ritter H. Modularization of End-to-End Learning: Case Study in Arcade Games. In: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning. 2019.

Melnik, A., Fleer, S., Schilling, M., & Ritter, H. (2019). Modularization of End-to-End Learning: Case Study in Arcade Games. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning

Melnik, Andrew, Fleer, Sascha, Schilling, Malte, and Ritter, Helge. 2019. “Modularization of End-to-End Learning: Case Study in Arcade Games”. In 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning.

Melnik, A., Fleer, S., Schilling, M., and Ritter, H. (2019). “Modularization of End-to-End Learning: Case Study in Arcade Games” in 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning.

Melnik, A., et al., 2019. Modularization of End-to-End Learning: Case Study in Arcade Games. In 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning.

A. Melnik, et al., “Modularization of End-to-End Learning: Case Study in Arcade Games”, 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning, 2019.

Melnik, A., Fleer, S., Schilling, M., Ritter, H.: Modularization of End-to-End Learning: Case Study in Arcade Games. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning. (2019).

Melnik, Andrew, Fleer, Sascha, Schilling, Malte, and Ritter, Helge. “Modularization of End-to-End Learning: Case Study in Arcade Games”. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning. 2019.

Externes Material:

In sonstiger Relation

URL

https://nips.cc/Conferences/2018/Schedule?showEvent=10907

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Quellen

arXiv: 1901.09895

Suchen in

Google Scholar

PUB - Publikationen an der Universität Bielefeld

Modularization of End-to-End Learning: Case Study in Arcade Games

Zitieren