Abstract
The robustness of supervised deep learning-based medical image classification is significantly undermined by label noise. Although several methods have been proposed to enhance classification performance in the presence of noisy labels, they face some challenges: 1) a struggle with class-imbalanced datasets, leading to the frequent overlooking of minority classes as noisy samples; 2) a singular focus on maximizing performance using noisy datasets, without incorporating experts-in-theloop for actively cleaning the noisy labels. To mitigate these challenges, we propose a two phase approach that combines Learning with Noisy Labels (LNL) and active learning. This approach not only improves the robustness of medical image classification in the presence of noisy labels, but also iteratively improves the quality of the dataset by relabeling the important incorrect labels, under a limited annotation budget. Furthermore, we introduce a novel Variance of Gradients approach in LNL phase, which complements the loss-based sample selection by also sampling under-represented samples. Using two imbalanced noisy medical classification datasets, we demonstrate that that our proposed technique is superior to its predecessors at handling class imbalance by not misidentifying clean samples from minority classes as mostly noisy samples.
Original language | English |
---|---|
DOIs | |
Publication status | Published - 3 Oct 2024 |
Event | MICCAI 2024: 27th International Conference on Medical Image Computing And Computer Assisted Intervention - Marrakesh, Morocco Duration: 6 Oct 2024 → 10 Oct 2024 Conference number: 27 https://conferences.miccai.org/2024/en/ |
Conference
Conference | MICCAI 2024 |
---|---|
Country/Territory | Morocco |
City | Marrakesh |
Period | 6/10/24 → 10/10/24 |
Internet address |
Bibliographical note
Acknowledgements. Research reported in this publication was supported by the NIGMS Award No. R35GM128877 of the National Institutes of Health, and by OAC Award No. 1808530 and CBET Award No. 2245152, both of the National Science Foundation, and by the Aberdeen Startup Grant CF10834-10. We also acknowledge Research Computing at the Rochester Institute of Technology [?]for providing computing resources.Keywords
- Active label cleaning
- Label noise
- Learning with noisy labels (LNL)
- Medical image classification
- Imbalanced data
- Active learning
- Limited budget