Continuous Control with a Combination of Supervised and Reinforcement Learning

Kangin, D; Pugeault, N

dc.contributor.author	Kangin, D
dc.contributor.author	Pugeault, N
dc.date.accessioned	2018-04-23T09:13:26Z
dc.date.issued	2018-10-15
dc.description.abstract	Reinforcement learning methods have recently achieved impressive results on a wide range of control problems. However, especially with complex inputs, they still require an extensive amount of training data in order to converge to a meaningful solution. This limits their applicability to complex input spaces such as video signals, and makes them impractical for use in complex real world problems, including many of those for video based control. Supervised learning, on the contrary, is capable of learning on a relatively limited number of samples, but relies on arbitrary hand-labelling of data rather than taskderived reward functions, and hence do not yield independent control policies. In this article we propose a novel, modelfree approach, which uses a combination of reinforcement and supervised learning for autonomous control and paves the way towards policy based control in real world environments. We use SpeedDreams/TORCS video game to demonstrate that our approach requires much less samples (hundreds of thousands against millions or tens of millions) comparing to the state-of-theart reinforcement learning techniques on similar data, and at the same time overcomes both supervised and reinforcement learning approaches in terms of quality. Additionally, we demonstrate applicability of the method to MuJoCo control problems.	en_GB
dc.description.sponsorship	The authors are grateful for the support by the UK Engineering and Physical Sciences Research Council (EPSRC) project DEVA EP/N035399/1.	en_GB
dc.identifier.citation	International Joint Conference on Neural Networks, 8-13 July 2018, Rio de Janeiro, Brazil	en_GB
dc.identifier.doi	10.1109/IJCNN.2018.8489702
dc.identifier.uri	http://hdl.handle.net/10871/32566
dc.language.iso	en	en_GB
dc.publisher	Institute of Electrical and Electronics Engineers	en_GB
dc.relation.url	http://www.ecomp.poli.br/~wcci2018/	en_GB
dc.rights	© 2018 IEEE.
dc.subject	Reinforcement Learning	en_GB
dc.subject	Deep Learning	en_GB
dc.subject	Continuous control	en_GB
dc.title	Continuous Control with a Combination of Supervised and Reinforcement Learning	en_GB
dc.type	Article	en_GB
dc.identifier.issn	2161-4393
dc.description	This is the author accepted manuscript. The final version is available from the Institute of Electrical and Electronics Engineers via the DOI in this record.	en_GB
dc.identifier.journal	Proceedings of the International Joint Conference on Neural Networks	en_GB

Files in this item

Name:: KanginPugeault_IJCNN2018.pdf
Size:: 3.494Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Computer Science

Show simple item record

Show Statistical Information