Hierarchy through composition with multitask LMDPs

Saxe, AM; Earle, AC; Rosman, Benjamin S

Hierarchy through composition with multitask LMDPs

http://proceedings.mlr.press/v70/saxe17a.html
http://proceedings.mlr.press/v70/saxe17a/saxe17a.pdf
http://hdl.handle.net/10204/9586

Abstract:

Hierarchical architectures are critical to the scalability of reinforcement learning methods. Most current hierarchical frameworks execute actions serially, with macro-actions comprising sequences of primitive actions. We propose a novel alternative to these control hierarchies based on concurrent execution of many actions in parallel. Our scheme exploits the guaranteed concurrent compositionality provided by the linearly solvable Markov decision process (LMDP) framework, which naturally enables a learning agent to draw on several macro-actions simultaneously to solve new tasks. We introduce the Multitask LMDP module, which maintains a parallel distributed representation of tasks and may be stacked to form deep hierarchies abstracted in space and time.

Reference:

Saxe, A.M., Earle, A.C., and Rosman, B.S. 2017. Hierarchy through composition with multitask LMDPs. Proceedings of the 34th International Conference on Machine Learning, PMLR 70:3017-3026, Sydney, Australia, 6-11 August 2017

Saxe, A., Earle, A., & Rosman, B. S. (2017). Hierarchy through composition with multitask LMDPs. Proceedings of Machine Learning Research. http://hdl.handle.net/10204/9586

Saxe, AM, AC Earle, and Benjamin S Rosman. "Hierarchy through composition with multitask LMDPs." (2017): http://hdl.handle.net/10204/9586

Saxe A, Earle A, Rosman BS, Hierarchy through composition with multitask LMDPs; Proceedings of Machine Learning Research; 2017. http://hdl.handle.net/10204/9586 .

Download RIS

Proceedings of the 34th International Conference on Machine Learning, PMLR 70:3017-3026, Sydney, Australia, 6-11 August 2017

Saxe, AM
Earle, AC
Rosman, Benjamin S

Aug 2017

Linearly-solvable MDPs
Hierarchies
Reinforcement learning

Show full item record

Files in this item

Saxe_19462_2017.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Hierarchy through composition with multitask LMDPs

Hierarchy through composition with multitask LMDPs

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect