Search from over 60,000 research works

Advanced Search

Understanding structure of concurrent actions

[thumbnail of concurrent actions.pdf]
Preview
concurrent actions.pdf - Accepted Version (289kB) | Preview
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Moodley, P., Rosman, B. and Hong, X. orcid id iconORCID: https://orcid.org/0000-0002-6832-2298 (2019) Understanding structure of concurrent actions. In: AI-2019: The Thirty-ninth SGAI International Conference, 17-19 Dec 2019, Cambridge, UK, pp. 78-90.

Abstract/Summary

Whereas most work in reinforcement learning (RL) ignores the structure or relationships between actions, in this paper we show that exploiting structure in the action space can improve sample efficiency during exploration. To show this we focus on concurrent action spaces where the RL agent selects multiple actions per timestep. Concurrent action spaces are challenging to learn in especially if the number of actions is large as this can lead to a combinatorial explosion of the action space. This paper proposes two methods: a first approach uses implicit structure to perform high-level action elimination using task-invariant actions; a second approach looks for more explicit structure in the form of action clusters. Both methods are context-free, focusing only on an analysis of the action space and show a significant improvement in policy convergence times.

Additional Information International Conference on Innovative Techniques and Applications of Artificial Intelligence
Item Type Conference or Workshop Item (Paper)
URI https://reading-clone.eprints-hosting.org/id/eprint/88398
Item Type Conference or Workshop Item
Refereed Yes
Divisions Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Additional Information International Conference on Innovative Techniques and Applications of Artificial Intelligence
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Search Google Scholar