Learning to Solve Tasks with Exploring Prior Behaviour

Ruiqi Zhu; Siyuan Li; Tianhong Dai; Chongjie Zhang; Oya Celiktutan

doi:10.48550/arXiv.2307.02889

Learning to Solve Tasks with Exploring Prior Behaviour

Ruiqi Zhu, Siyuan Li, Tianhong Dai, Chongjie Zhang, Oya Celiktutan

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

Abstract

Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but the drawer is closed in the training. Without acquiring the prior behaviours of opening the drawer, the robot is unlikely to solve the task. To address this, in this paper we propose an Intrinsic Rewards Driven Example-based Control \textbf{(IRDEC)}. Our method can endow agents with the ability to explore and acquire the required prior behaviours and then connect to the task-specific behaviours in the demonstration to solve sparse-reward tasks without requiring additional demonstration of the prior behaviours. The performance of our method outperforms other baselines on three navigation tasks and one robotic manipulation task with sparse rewards. Codes are available at this https URL. https://github.com/Ricky-Zhu/IRDEC.

Original language	English
Title of host publication	2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Publisher	IEEE Explore
Number of pages	7
ISBN (Electronic)	978-1-6654-9190-7
DOIs	https://doi.org/10.48550/arXiv.2307.02889 https://doi.org/10.1109/IROS55552.2023.10342272
Publication status	Published - 13 Dec 2023
Event	IEEE?RSJ International Conference on Intelligent Robots and Systems - Detroit, United States Duration: 1 Oct 2023 → 5 Oct 2023 https://ieee-iros.org/

Conference

Conference	IEEE?RSJ International Conference on Intelligent Robots and Systems
Country/Territory	United States
City	Detroit
Period	1/10/23 → 5/10/23
Internet address	https://ieee-iros.org/

Access to Document

10.48550/arXiv.2307.02889Licence: Unspecified
10.1109/IROS55552.2023.10342272Licence: Unspecified

Zhu_etal_IROS_Learning_to_Solve_AAM
©2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted author manuscript, 969 KBLicence: Other

Cite this

Zhu, R, Li, S, Dai, T, Zhang, C & Celiktutan, O 2023, Learning to Solve Tasks with Exploring Prior Behaviour. in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE Explore, IEEE?RSJ International Conference on Intelligent Robots and Systems, Detroit, Michigan, United States, 1/10/23. https://doi.org/10.48550/arXiv.2307.02889, https://doi.org/10.1109/IROS55552.2023.10342272

@inproceedings{569158f1b26a40978d1f06415cc30e71,

title = "Learning to Solve Tasks with Exploring Prior Behaviour",

abstract = "Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but the drawer is closed in the training. Without acquiring the prior behaviours of opening the drawer, the robot is unlikely to solve the task. To address this, in this paper we propose an Intrinsic Rewards Driven Example-based Control \textbf{(IRDEC)}. Our method can endow agents with the ability to explore and acquire the required prior behaviours and then connect to the task-specific behaviours in the demonstration to solve sparse-reward tasks without requiring additional demonstration of the prior behaviours. The performance of our method outperforms other baselines on three navigation tasks and one robotic manipulation task with sparse rewards. Codes are available at this https URL. https://github.com/Ricky-Zhu/IRDEC.",

author = "Ruiqi Zhu and Siyuan Li and Tianhong Dai and Chongjie Zhang and Oya Celiktutan",

year = "2023",

month = dec,

day = "13",

doi = "10.48550/arXiv.2307.02889",

language = "English",

booktitle = "2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)",

publisher = "IEEE Explore",

note = "IEEE?RSJ International Conference on Intelligent Robots and Systems ; Conference date: 01-10-2023 Through 05-10-2023",

url = "https://ieee-iros.org/",

}

TY - GEN

T1 - Learning to Solve Tasks with Exploring Prior Behaviour

AU - Zhu, Ruiqi

AU - Li, Siyuan

AU - Dai, Tianhong

AU - Zhang, Chongjie

AU - Celiktutan, Oya

PY - 2023/12/13

Y1 - 2023/12/13

N2 - Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but the drawer is closed in the training. Without acquiring the prior behaviours of opening the drawer, the robot is unlikely to solve the task. To address this, in this paper we propose an Intrinsic Rewards Driven Example-based Control \textbf{(IRDEC)}. Our method can endow agents with the ability to explore and acquire the required prior behaviours and then connect to the task-specific behaviours in the demonstration to solve sparse-reward tasks without requiring additional demonstration of the prior behaviours. The performance of our method outperforms other baselines on three navigation tasks and one robotic manipulation task with sparse rewards. Codes are available at this https URL. https://github.com/Ricky-Zhu/IRDEC.

AB - Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but the drawer is closed in the training. Without acquiring the prior behaviours of opening the drawer, the robot is unlikely to solve the task. To address this, in this paper we propose an Intrinsic Rewards Driven Example-based Control \textbf{(IRDEC)}. Our method can endow agents with the ability to explore and acquire the required prior behaviours and then connect to the task-specific behaviours in the demonstration to solve sparse-reward tasks without requiring additional demonstration of the prior behaviours. The performance of our method outperforms other baselines on three navigation tasks and one robotic manipulation task with sparse rewards. Codes are available at this https URL. https://github.com/Ricky-Zhu/IRDEC.

U2 - 10.48550/arXiv.2307.02889

DO - 10.48550/arXiv.2307.02889

M3 - Published conference contribution

BT - 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

PB - IEEE Explore

T2 - IEEE?RSJ International Conference on Intelligent Robots and Systems

Y2 - 1 October 2023 through 5 October 2023

ER -

Learning to Solve Tasks with Exploring Prior Behaviour

Abstract

Conference

Access to Document

Fingerprint

Cite this