One-Shot Learning for Task-Oriented Grasping

Valerija Holomjova; Andrew J. Starkey; Bruno Yun; Pascal Meisner

doi:10.1109/LRA.2023.3326001

One-Shot Learning for Task-Oriented Grasping

Valerija Holomjova^*, Andrew J. Starkey, Bruno Yun, Pascal Meisner

^*Corresponding author for this work

Würzburg-Schweinfurt Technical University of Applied Sciences (THWS)

Research output: Contribution to journal › Article › peer-review

Abstract

Task-oriented grasping models aim to predict a suitable grasp pose on an object to fulfill a task. These systems have limited generalization capabilities to new tasks, but have shown the ability to generalize to novel objects by recognizing affordances. This object generalization comes at the cost of being unable to recognize the object category being grasped, which could lead to unpredictable or risky behaviors. To overcome these generalization limitations, we contribute a novel system for task-oriented grasping called the One-Shot Task-Oriented Grasping (OS-TOG) framework. OS-TOG comprises four interchangeable neural networks that interact through dependable reasoning components, resulting in a single system that predicts multiple grasp candidates for a specific object and task from multi-object scenes. Embedded one-shot learning models leverage references within a database for OS-TOG to generalize to novel objects and tasks more efficiently than existing alternatives. Additionally, the paper presents suitable candidates for the framework's neural components, covering essential adjustments for their integration and evaluative comparisons to state-of-the-art. In physical experiments with novel objects, OS-TOG recognizes 69.4% of detected objects correctly and predicts suitable task-oriented grasps with 82.3% accuracy, having a physical grasp success rate of 82.3%.

Original language	English
Pages (from-to)	8232-8238
Number of pages	7
Journal	IEEE Robotics and Automation Letters
Volume	8
Issue number	12
Early online date	Nov 2023
DOIs	https://doi.org/10.1109/LRA.2023.3326001
Publication status	Published - 1 Dec 2023

Keywords

computer vision for automation
Deep learning in grasping and manipulation
grasping
recognition

Access to Document

10.1109/LRA.2023.3326001Licence: Unspecified

Cite this

@article{ac60bb61276d4286aa6eb60da670d513,

title = "One-Shot Learning for Task-Oriented Grasping",

abstract = "Task-oriented grasping models aim to predict a suitable grasp pose on an object to fulfill a task. These systems have limited generalization capabilities to new tasks, but have shown the ability to generalize to novel objects by recognizing affordances. This object generalization comes at the cost of being unable to recognize the object category being grasped, which could lead to unpredictable or risky behaviors. To overcome these generalization limitations, we contribute a novel system for task-oriented grasping called the One-Shot Task-Oriented Grasping (OS-TOG) framework. OS-TOG comprises four interchangeable neural networks that interact through dependable reasoning components, resulting in a single system that predicts multiple grasp candidates for a specific object and task from multi-object scenes. Embedded one-shot learning models leverage references within a database for OS-TOG to generalize to novel objects and tasks more efficiently than existing alternatives. Additionally, the paper presents suitable candidates for the framework's neural components, covering essential adjustments for their integration and evaluative comparisons to state-of-the-art. In physical experiments with novel objects, OS-TOG recognizes 69.4% of detected objects correctly and predicts suitable task-oriented grasps with 82.3% accuracy, having a physical grasp success rate of 82.3%.",

keywords = "computer vision for automation, Deep learning in grasping and manipulation, grasping, recognition",

author = "Valerija Holomjova and Starkey, {Andrew J.} and Bruno Yun and Pascal Meisner",

year = "2023",

month = dec,

day = "1",

doi = "10.1109/LRA.2023.3326001",

language = "English",

volume = "8",

pages = "8232--8238",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "12",

}

TY - JOUR

T1 - One-Shot Learning for Task-Oriented Grasping

AU - Holomjova, Valerija

AU - Starkey, Andrew J.

AU - Yun, Bruno

AU - Meisner, Pascal

PY - 2023/12/1

Y1 - 2023/12/1

N2 - Task-oriented grasping models aim to predict a suitable grasp pose on an object to fulfill a task. These systems have limited generalization capabilities to new tasks, but have shown the ability to generalize to novel objects by recognizing affordances. This object generalization comes at the cost of being unable to recognize the object category being grasped, which could lead to unpredictable or risky behaviors. To overcome these generalization limitations, we contribute a novel system for task-oriented grasping called the One-Shot Task-Oriented Grasping (OS-TOG) framework. OS-TOG comprises four interchangeable neural networks that interact through dependable reasoning components, resulting in a single system that predicts multiple grasp candidates for a specific object and task from multi-object scenes. Embedded one-shot learning models leverage references within a database for OS-TOG to generalize to novel objects and tasks more efficiently than existing alternatives. Additionally, the paper presents suitable candidates for the framework's neural components, covering essential adjustments for their integration and evaluative comparisons to state-of-the-art. In physical experiments with novel objects, OS-TOG recognizes 69.4% of detected objects correctly and predicts suitable task-oriented grasps with 82.3% accuracy, having a physical grasp success rate of 82.3%.

AB - Task-oriented grasping models aim to predict a suitable grasp pose on an object to fulfill a task. These systems have limited generalization capabilities to new tasks, but have shown the ability to generalize to novel objects by recognizing affordances. This object generalization comes at the cost of being unable to recognize the object category being grasped, which could lead to unpredictable or risky behaviors. To overcome these generalization limitations, we contribute a novel system for task-oriented grasping called the One-Shot Task-Oriented Grasping (OS-TOG) framework. OS-TOG comprises four interchangeable neural networks that interact through dependable reasoning components, resulting in a single system that predicts multiple grasp candidates for a specific object and task from multi-object scenes. Embedded one-shot learning models leverage references within a database for OS-TOG to generalize to novel objects and tasks more efficiently than existing alternatives. Additionally, the paper presents suitable candidates for the framework's neural components, covering essential adjustments for their integration and evaluative comparisons to state-of-the-art. In physical experiments with novel objects, OS-TOG recognizes 69.4% of detected objects correctly and predicts suitable task-oriented grasps with 82.3% accuracy, having a physical grasp success rate of 82.3%.

KW - computer vision for automation

KW - Deep learning in grasping and manipulation

KW - grasping

KW - recognition

UR - http://www.scopus.com/inward/record.url?scp=85174857214&partnerID=8YFLogxK

U2 - 10.1109/LRA.2023.3326001

DO - 10.1109/LRA.2023.3326001

M3 - Article

AN - SCOPUS:85174857214

SN - 2377-3766

VL - 8

SP - 8232

EP - 8238

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 12

ER -

One-Shot Learning for Task-Oriented Grasping

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this