Coordinate-Aware Mask R-CNN with Group Normalization: A Underwater Marine Animal Instance Segmentation Framework

Dewei Yi* (Corresponding Author), Hasan Bayarov Ahmedov, Shouyong Jiang, Yiren Li, Sean Joseph Flinn, Paul G. Fernandes

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Unsustainable fishing, driven by bycatch and discards, harms marine ecosystems. Addressing this, we propose a Coordinate-Aware Mask R-CNN (CAM-RCNN) method to enhance fish detection in commercial trawls. Leveraging CoordConv and Group Normalization, our approach improves generalization and stability. To tackle class imbalance, a compound Dice and cross-entropy loss is employed, and image data are enhanced through multi-scale retinex and color restoration. Evaluating on two fishing datasets, CAM-RCNN excels in accuracy and generalization, achieving the best Average Precision (AP) for instance mask and BBOX prediction in both source (39.7%, 40.2%) and target domains (24.4%, 24.2%). This method promotes sustainable fishing by selectively capturing desired fish, reducing harm to non-target species.

Original languageEnglish
Article number127488
JournalNeurocomputing
Early online date6 Mar 2024
DOIs
Publication statusE-pub ahead of print - 6 Mar 2024

Bibliographical note

This work was supported in part by the Fisheries Innovation & Sustainability and U.K. Department for Environment, Food & Rural Affairs under Grant numbers: FIS039 and FIS045A.

Keywords

  • Instance segmentation
  • Convolutional neural network (CNN)
  • Underwater dataset
  • Generalisability

Fingerprint

Dive into the research topics of 'Coordinate-Aware Mask R-CNN with Group Normalization: A Underwater Marine Animal Instance Segmentation Framework'. Together they form a unique fingerprint.

Cite this