Cooperative Multiagent Attentional Communication for Large-Scale Task Space

Qijie Zou; Youkun Hu; Dewei Yi; Bing Gao; Jing Qin; Chi-Hua Chen

doi:10.1155/2022/4401653

Cooperative Multiagent Attentional Communication for Large-Scale Task Space

Qijie Zou, Youkun Hu^* (Corresponding Author), Dewei Yi, Bing Gao, Jing Qin, Chi-Hua Chen (Editor)

^*Corresponding author for this work

Dalian University of Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

9 Downloads (Pure)

Abstract

With the rapid development of mobile robots, they have begun to be widely used in industrial manufacturing, logistics scheduling, intelligent medical, and other fields. For large-scale task space, the communication between multiagents is the key to affect cooperation productivity, and agents can coordinate more effectively with the help of dynamic communication. However, the traditional communication mechanism uses simple message aggregation and broadcast and, in some cases, lacks the distinction of the importance of information. Multiagent deep reinforcement learning (MDRL) is valid to solve the problem of informational coordination strategies. However, how different messages affect each agent’s decision-making process remains a challenging task for large-scale task. To solve this problem, we propose IMANet (Import Message Attention Network). It divides the decision-making process into two substages: communication and action, where communication is considered to be part of the environment. First, an attention mechanism based on query vectors is introduced. The correlation between the query vector agent’s own information and the current state information of other agents is estimated, and then, the results are used to distinguish the importance of information from other agents. Second, the LSTM network is used as the unit controller for each agent, and individual rewards are used to guide the agent training after communication. Finally, IMANet is evaluated on tasks on challenging multi-agent platforms, Predator and Prey (PP), and traffic junction. The results show that IMANet can improve the efficiency of learning and training, especially when applied to large-scale task space, with a success rate 12% higher than CommNet in baseline experiments.

Original language	English
Article number	4401653
Number of pages	13
Journal	Wireless Communications and Mobile Computing
Volume	2022
Early online date	24 Jan 2022
DOIs	https://doi.org/10.1155/2022/4401653
Publication status	Published - 24 Jan 2022

Bibliographical note

Acknowledgments
This work was supported by the Dalian University Research Platform Project Funding: Dalian Wise Information Technology of Med and Health Key Laboratory, the National Natural Science Foundation of China: Research on the stability of multi-surface high-speed unmanned boat formation and the method of cooperative collision avoidance in complex sea conditions, NO.61673084.

Data Availability Statement

The data used to support the findings of this study are available from the authors upon request.

Access to Document

10.1155/2022/4401653Licence: CC BY

Zou_etal_WCMC_Cooperative_multi-agent_attentional_AAM
This is the peer reviewed accepted manuscript of [cite] Version of record at[DOI] https://creativecommons.org/licenses/by/4.0/
Accepted author manuscript, 697 KBLicence: CC BY
Zou_etal_Cooperative_multiagent_task_VOR
This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.https://creativecommons.org/licenses/by/4.0/
Final published version, 1.31 MBLicence: CC BY

Cite this

@article{020c8822cef24031b5d5ca1e16a70310,

title = "Cooperative Multiagent Attentional Communication for Large-Scale Task Space",

abstract = "With the rapid development of mobile robots, they have begun to be widely used in industrial manufacturing, logistics scheduling, intelligent medical, and other fields. For large-scale task space, the communication between multiagents is the key to affect cooperation productivity, and agents can coordinate more effectively with the help of dynamic communication. However, the traditional communication mechanism uses simple message aggregation and broadcast and, in some cases, lacks the distinction of the importance of information. Multiagent deep reinforcement learning (MDRL) is valid to solve the problem of informational coordination strategies. However, how different messages affect each agent{\textquoteright}s decision-making process remains a challenging task for large-scale task. To solve this problem, we propose IMANet (Import Message Attention Network). It divides the decision-making process into two substages: communication and action, where communication is considered to be part of the environment. First, an attention mechanism based on query vectors is introduced. The correlation between the query vector agent{\textquoteright}s own information and the current state information of other agents is estimated, and then, the results are used to distinguish the importance of information from other agents. Second, the LSTM network is used as the unit controller for each agent, and individual rewards are used to guide the agent training after communication. Finally, IMANet is evaluated on tasks on challenging multi-agent platforms, Predator and Prey (PP), and traffic junction. The results show that IMANet can improve the efficiency of learning and training, especially when applied to large-scale task space, with a success rate 12% higher than CommNet in baseline experiments.",

author = "Qijie Zou and Youkun Hu and Dewei Yi and Bing Gao and Jing Qin and Chi-Hua Chen",

note = "Acknowledgments This work was supported by the Dalian University Research Platform Project Funding: Dalian Wise Information Technology of Med and Health Key Laboratory, the National Natural Science Foundation of China: Research on the stability of multi-surface high-speed unmanned boat formation and the method of cooperative collision avoidance in complex sea conditions, NO.61673084.",

year = "2022",

month = jan,

day = "24",

doi = "10.1155/2022/4401653",

language = "English",

volume = "2022",

journal = "Wireless Communications and Mobile Computing",

issn = "1530-8677",

publisher = "Hindawi Limited",

}

TY - JOUR

T1 - Cooperative Multiagent Attentional Communication for Large-Scale Task Space

AU - Zou, Qijie

AU - Hu, Youkun

AU - Yi, Dewei

AU - Gao, Bing

AU - Qin, Jing

A2 - Chen, Chi-Hua

N1 - Acknowledgments This work was supported by the Dalian University Research Platform Project Funding: Dalian Wise Information Technology of Med and Health Key Laboratory, the National Natural Science Foundation of China: Research on the stability of multi-surface high-speed unmanned boat formation and the method of cooperative collision avoidance in complex sea conditions, NO.61673084.

PY - 2022/1/24

Y1 - 2022/1/24

N2 - With the rapid development of mobile robots, they have begun to be widely used in industrial manufacturing, logistics scheduling, intelligent medical, and other fields. For large-scale task space, the communication between multiagents is the key to affect cooperation productivity, and agents can coordinate more effectively with the help of dynamic communication. However, the traditional communication mechanism uses simple message aggregation and broadcast and, in some cases, lacks the distinction of the importance of information. Multiagent deep reinforcement learning (MDRL) is valid to solve the problem of informational coordination strategies. However, how different messages affect each agent’s decision-making process remains a challenging task for large-scale task. To solve this problem, we propose IMANet (Import Message Attention Network). It divides the decision-making process into two substages: communication and action, where communication is considered to be part of the environment. First, an attention mechanism based on query vectors is introduced. The correlation between the query vector agent’s own information and the current state information of other agents is estimated, and then, the results are used to distinguish the importance of information from other agents. Second, the LSTM network is used as the unit controller for each agent, and individual rewards are used to guide the agent training after communication. Finally, IMANet is evaluated on tasks on challenging multi-agent platforms, Predator and Prey (PP), and traffic junction. The results show that IMANet can improve the efficiency of learning and training, especially when applied to large-scale task space, with a success rate 12% higher than CommNet in baseline experiments.

AB - With the rapid development of mobile robots, they have begun to be widely used in industrial manufacturing, logistics scheduling, intelligent medical, and other fields. For large-scale task space, the communication between multiagents is the key to affect cooperation productivity, and agents can coordinate more effectively with the help of dynamic communication. However, the traditional communication mechanism uses simple message aggregation and broadcast and, in some cases, lacks the distinction of the importance of information. Multiagent deep reinforcement learning (MDRL) is valid to solve the problem of informational coordination strategies. However, how different messages affect each agent’s decision-making process remains a challenging task for large-scale task. To solve this problem, we propose IMANet (Import Message Attention Network). It divides the decision-making process into two substages: communication and action, where communication is considered to be part of the environment. First, an attention mechanism based on query vectors is introduced. The correlation between the query vector agent’s own information and the current state information of other agents is estimated, and then, the results are used to distinguish the importance of information from other agents. Second, the LSTM network is used as the unit controller for each agent, and individual rewards are used to guide the agent training after communication. Finally, IMANet is evaluated on tasks on challenging multi-agent platforms, Predator and Prey (PP), and traffic junction. The results show that IMANet can improve the efficiency of learning and training, especially when applied to large-scale task space, with a success rate 12% higher than CommNet in baseline experiments.

UR - http://www.scopus.com/inward/record.url?scp=85124402202&partnerID=8YFLogxK

U2 - 10.1155/2022/4401653

DO - 10.1155/2022/4401653

M3 - Article

SN - 1530-8677

VL - 2022

JO - Wireless Communications and Mobile Computing

JF - Wireless Communications and Mobile Computing

M1 - 4401653

ER -

Cooperative Multiagent Attentional Communication for Large-Scale Task Space

Abstract

Bibliographical note

Data Availability Statement

Access to Document

Other files and links

Fingerprint

Cite this