TY - GEN
T1 - Real-time Semantic Segmentation of Solar Photovoltaic Arrays for Autonomous UAV Flights
AU - Li, Nanyan
AU - Chen, Hao
AU - Sun, Zhigang
AU - Gao, Jing
AU - Yi, Dewei
AU - Liu, Cunjia
AU - Su, Jinya
PY - 2024/9/17
Y1 - 2024/9/17
N2 - This paper considers the real-time semantic segmentation of solar PhotoVoltaic (PV) arrays using down-facing cameras onboard Unmanned Aerial Vehicles (UAVs) to support autonomous navigation for ultra-low altitude industrial inspection. Various semantic segmentation algorithms, including SegNet, UNet, Deeplabv3+, and their respective variants, are compared to identify the most suitable one that strikes a balance between segmentation and real-time performance. Subsequently, each video frame is processed by the selected segmentation network to create a solar array mask. This mask is then optimally fitted to diverse solar array shapes to determine angular and horizontal deviations between the image center and the solar array center-line. The algorithms are validated on a self-constructed dataset of 594 solar PV array UAV images, encompassing varied environmental conditions and solar farm layouts. The top-performing network, lightweight vgg 16bn-Unet, achieves 99.4% overall accuracy and 98.2% Intersection over Union (IoU), while the most balanced network, mobilev3-Deeplabv3+, attains 98.9% accuracy and 96.5% IoU, with a 27.59 fps (using an NVIDIA GeForce MX150 GPU) suitable for real-time UAV flights.
AB - This paper considers the real-time semantic segmentation of solar PhotoVoltaic (PV) arrays using down-facing cameras onboard Unmanned Aerial Vehicles (UAVs) to support autonomous navigation for ultra-low altitude industrial inspection. Various semantic segmentation algorithms, including SegNet, UNet, Deeplabv3+, and their respective variants, are compared to identify the most suitable one that strikes a balance between segmentation and real-time performance. Subsequently, each video frame is processed by the selected segmentation network to create a solar array mask. This mask is then optimally fitted to diverse solar array shapes to determine angular and horizontal deviations between the image center and the solar array center-line. The algorithms are validated on a self-constructed dataset of 594 solar PV array UAV images, encompassing varied environmental conditions and solar farm layouts. The top-performing network, lightweight vgg 16bn-Unet, achieves 99.4% overall accuracy and 98.2% Intersection over Union (IoU), while the most balanced network, mobilev3-Deeplabv3+, attains 98.9% accuracy and 96.5% IoU, with a 27.59 fps (using an NVIDIA GeForce MX150 GPU) suitable for real-time UAV flights.
KW - CNN
KW - Machine Vision
KW - Real-time and Embedded systems
KW - Solar Inspection
KW - UAV Navigation
UR - http://www.scopus.com/inward/record.url?scp=85205451327&partnerID=8YFLogxK
U2 - 10.23919/CCC63176.2024.10662016
DO - 10.23919/CCC63176.2024.10662016
M3 - Published conference contribution
AN - SCOPUS:85205451327
T3 - Chinese Control Conference, CCC
SP - 7292
EP - 7297
BT - Proceedings of the 43rd Chinese Control Conference, CCC 2024
A2 - Na, Jing
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 43rd Chinese Control Conference, CCC 2024
Y2 - 28 July 2024 through 31 July 2024
ER -