Self-Supervised Adversarial Imitation Learning

Juarez Monteiro; Nathan Gavenski; Felipe Meneguzzi; Rodrigo C. Barros

doi:10.1109/IJCNN54540.2023.10191197

Self-Supervised Adversarial Imitation Learning

Juarez Monteiro, Nathan Gavenski, Felipe Meneguzzi, Rodrigo C. Barros

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

Abstract

Behavioural cloning is an imitation learning technique that teaches an agent how to behave via expert demonstrations. Recent approaches use self-supervision of fully-observable unlabelled snapshots of the states to decode state pairs into actions. However, the iterative learning scheme employed by these techniques is prone to get trapped into bad local minima. Previous work uses goal-aware strategies to solve this issue. However, this requires manual intervention to verify whether an agent has reached its goal. We address this limitation by incorporating a discriminator into the original framework, offering two key advantages and directly solving a learning problem previous work had. First, it disposes of the manual intervention requirement. Second, it helps in learning by guiding function approximation based on the state transition of the expert's trajectories. Third, the discriminator solves a learning issue commonly present in the policy model, which is to sometimes perform a 'no action' within the environment until the agent finally halts.

Original language	English
Title of host publication	2023 International Joint Conference on Neural Networks (IJCNN)
Subtitle of host publication	18-23 June 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665488679
ISBN (Print)	978-1-6654-8868-6
DOIs	https://doi.org/10.1109/IJCNN54540.2023.10191197
Publication status	Published - 2 Aug 2023
Event	2023 International Joint Conference on Neural Networks, IJCNN 2023 - Gold Coast, Australia Duration: 18 Jun 2023 → 23 Jun 2023

Conference

Conference	2023 International Joint Conference on Neural Networks, IJCNN 2023
Country/Territory	Australia
City	Gold Coast
Period	18/06/23 → 23/06/23

Bibliographical note

This work was supported by UK Research and Innovation [grant number EP/S023356/1], in the UKRI Centre for Doctoral Training in Safe and Trusted Artificial Intelligence (www.safeandtrustedai.org) and made possible via King’s Computational Research, Engineering and Technology Environment (CREATE) [27].

Keywords

Adversarial Learning
Imitation Learning
Learning from Observation
Self-Supervised Learning

Access to Document

10.1109/IJCNN54540.2023.10191197Licence: Unspecified

Cite this

Monteiro, J, Gavenski, N, Meneguzzi, F & Barros, RC 2023, Self-Supervised Adversarial Imitation Learning. in 2023 International Joint Conference on Neural Networks (IJCNN): 18-23 June 2023. Institute of Electrical and Electronics Engineers Inc., 2023 International Joint Conference on Neural Networks, IJCNN 2023, Gold Coast, Australia, 18/06/23. https://doi.org/10.1109/IJCNN54540.2023.10191197

@inproceedings{5b308124f70a42d9ae0190819a344d04,

title = "Self-Supervised Adversarial Imitation Learning",

abstract = "Behavioural cloning is an imitation learning technique that teaches an agent how to behave via expert demonstrations. Recent approaches use self-supervision of fully-observable unlabelled snapshots of the states to decode state pairs into actions. However, the iterative learning scheme employed by these techniques is prone to get trapped into bad local minima. Previous work uses goal-aware strategies to solve this issue. However, this requires manual intervention to verify whether an agent has reached its goal. We address this limitation by incorporating a discriminator into the original framework, offering two key advantages and directly solving a learning problem previous work had. First, it disposes of the manual intervention requirement. Second, it helps in learning by guiding function approximation based on the state transition of the expert's trajectories. Third, the discriminator solves a learning issue commonly present in the policy model, which is to sometimes perform a 'no action' within the environment until the agent finally halts.",

keywords = "Adversarial Learning, Imitation Learning, Learning from Observation, Self-Supervised Learning",

author = "Juarez Monteiro and Nathan Gavenski and Felipe Meneguzzi and Barros, {Rodrigo C.}",

note = " This work was supported by UK Research and Innovation [grant number EP/S023356/1], in the UKRI Centre for Doctoral Training in Safe and Trusted Artificial Intelligence (www.safeandtrustedai.org) and made possible via King{\textquoteright}s Computational Research, Engineering and Technology Environment (CREATE) [27]. ; 2023 International Joint Conference on Neural Networks, IJCNN 2023 ; Conference date: 18-06-2023 Through 23-06-2023",

year = "2023",

month = aug,

day = "2",

doi = "10.1109/IJCNN54540.2023.10191197",

language = "English",

isbn = "978-1-6654-8868-6",

booktitle = "2023 International Joint Conference on Neural Networks (IJCNN)",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

address = "United States",

}

TY - GEN

T1 - Self-Supervised Adversarial Imitation Learning

AU - Monteiro, Juarez

AU - Gavenski, Nathan

AU - Meneguzzi, Felipe

AU - Barros, Rodrigo C.

N1 - This work was supported by UK Research and Innovation [grant number EP/S023356/1], in the UKRI Centre for Doctoral Training in Safe and Trusted Artificial Intelligence (www.safeandtrustedai.org) and made possible via King’s Computational Research, Engineering and Technology Environment (CREATE) [27].

PY - 2023/8/2

Y1 - 2023/8/2

N2 - Behavioural cloning is an imitation learning technique that teaches an agent how to behave via expert demonstrations. Recent approaches use self-supervision of fully-observable unlabelled snapshots of the states to decode state pairs into actions. However, the iterative learning scheme employed by these techniques is prone to get trapped into bad local minima. Previous work uses goal-aware strategies to solve this issue. However, this requires manual intervention to verify whether an agent has reached its goal. We address this limitation by incorporating a discriminator into the original framework, offering two key advantages and directly solving a learning problem previous work had. First, it disposes of the manual intervention requirement. Second, it helps in learning by guiding function approximation based on the state transition of the expert's trajectories. Third, the discriminator solves a learning issue commonly present in the policy model, which is to sometimes perform a 'no action' within the environment until the agent finally halts.

AB - Behavioural cloning is an imitation learning technique that teaches an agent how to behave via expert demonstrations. Recent approaches use self-supervision of fully-observable unlabelled snapshots of the states to decode state pairs into actions. However, the iterative learning scheme employed by these techniques is prone to get trapped into bad local minima. Previous work uses goal-aware strategies to solve this issue. However, this requires manual intervention to verify whether an agent has reached its goal. We address this limitation by incorporating a discriminator into the original framework, offering two key advantages and directly solving a learning problem previous work had. First, it disposes of the manual intervention requirement. Second, it helps in learning by guiding function approximation based on the state transition of the expert's trajectories. Third, the discriminator solves a learning issue commonly present in the policy model, which is to sometimes perform a 'no action' within the environment until the agent finally halts.

KW - Adversarial Learning

KW - Imitation Learning

KW - Learning from Observation

KW - Self-Supervised Learning

UR - http://www.scopus.com/inward/record.url?scp=85169569844&partnerID=8YFLogxK

U2 - 10.1109/IJCNN54540.2023.10191197

DO - 10.1109/IJCNN54540.2023.10191197

M3 - Published conference contribution

AN - SCOPUS:85169569844

SN - 978-1-6654-8868-6

BT - 2023 International Joint Conference on Neural Networks (IJCNN)

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 International Joint Conference on Neural Networks, IJCNN 2023

Y2 - 18 June 2023 through 23 June 2023

ER -

Self-Supervised Adversarial Imitation Learning

Abstract

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this