Understanding structure of concurrent actions

Moodley, Perusha; Rosman, B.; Hong, Xia

Download

Preview

Text
- Accepted Version

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Moodley, P., Rosman, B. and Hong, X. ORCID: https://orcid.org/0000-0002-6832-2298 (2019) Understanding structure of concurrent actions. In: AI-2019: The Thirty-ninth SGAI International Conference, 17-19 Dec 2019, Cambridge, UK, pp. 78-90.

Abstract/Summary

Whereas most work in reinforcement learning (RL) ignores the structure or relationships between actions, in this paper we show that exploiting structure in the action space can improve sample efficiency during exploration. To show this we focus on concurrent action spaces where the RL agent selects multiple actions per timestep. Concurrent action spaces are challenging to learn in especially if the number of actions is large as this can lead to a combinatorial explosion of the action space. This paper proposes two methods: a first approach uses implicit structure to perform high-level action elimination using task-invariant actions; a second approach looks for more explicit structure in the form of action clusters. Both methods are context-free, focusing only on an analysis of the action space and show a significant improvement in policy convergence times.

Additional Information	International Conference on Innovative Techniques and Applications of Artificial Intelligence
Item Type	Conference or Workshop Item (Paper)
URI	https://centaur.reading.ac.uk/id/eprint/88398
Refereed	Yes
Divisions	Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Additional Information	International Conference on Innovative Techniques and Applications of Artificial Intelligence
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Deposit Details

University Staff: Request a correction | Centaur Editors: Update this record

Date Deposited:	20 Jan 2020 16:33	Date item deposited into CentAUR
Last Modified:	20 May 2024 17:33	Date item last modified