What good are actions? Accelerating learning using learned action priors

Rosman, Benjamin S; Ramamoorthy, S

What good are actions? Accelerating learning using learned action priors

http://homepages.inf.ed.ac.uk/s0896970/papers/icdl12.pdf
http://hdl.handle.net/10204/6475

Abstract:

The computational complexity of learning in sequential decision problems grows exponentially with the number of actions available to the agent at each state. We present a method for accelerating this process by learning action priors that express the usefulness of each action in each state. These are learned from a set of different optimal policies from many tasks in the same state space, and are used to bias exploration away from less useful actions. This is shown to improve performance for tasks in the same domain but with different goals. We extend our method to base action priors on perceptual cues rather than absolute states, allowing the transfer of these priors between tasks with differing state spaces and transition functions, and demonstrate experimentally the advantages of learning with action priors in a reinforcement learning context.

Reference:

Rosman, B.S. and Ramamoorthy, S. 2012. What good are actions? Accelerating learning using learned action priors. IEEE Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob 2012), San Diego, California, USA, 7-9 November 2012

Rosman, B. S., & Ramamoorthy, S. (2012). What good are actions? Accelerating learning using learned action priors. http://hdl.handle.net/10204/6475

Rosman, Benjamin S, and S Ramamoorthy. "What good are actions? Accelerating learning using learned action priors." (2012): http://hdl.handle.net/10204/6475

Rosman BS, Ramamoorthy S, What good are actions? Accelerating learning using learned action priors; 2012. http://hdl.handle.net/10204/6475 .

Download RIS

IEEE Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob 2012), San Diego, California, USA, 7-9 November 2012

Rosman, Benjamin S
Ramamoorthy, S

Nov 2012

Artificial intelligence
Learning actions
Markov Decision Process
MDP

Show full item record

Files in this item

Rosman_2012.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

What good are actions? Accelerating learning using learned action priors

What good are actions? Accelerating learning using learned action priors

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect