A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species

Philip T. Patton; Ted Cheeseman; Kenshin Abe; Taiki Yamaguchi; Walter Reade; Ken Southerland; Addison Howard; Erin M. Oleson; Jason B. Allen; Erin Ashe; Aline Athayde; Robin W. Baird; Charla Basran; Elsa Cabrera; John Calambokidis; Júlio Cardoso; Emma L. Carroll; Amina Cesario; Barbara J. Cheney; Enrico Corsi; Jens Currie; John W. Durban; Erin A. Falcone; Holly Fearnbach; Kiirsten Flynn; Trish Franklin; Wally Franklin; Bárbara Galletti Vernazzani; Tilen Genov; Marie Hill; David R. Johnston; Erin L. Keene; Sabre D. Mahaffy; Tamara L. McGuire; Liah McPherson; Catherine Meyer; Robert Michaud; Anastasia Miliou; Dara N. Orbach; Heidi C. Pearson; Marianne H. Rasmussen; William J. Rayment; Caroline Rinaldi; Renato Rinaldi; Salvatore Siciliano; Stephanie Stack; Beatriz Tintore; Leigh G. Torres; Jared R. Towers; Cameron Trotter; Reny Tyson Moore; Caroline R. Weir; Rebecca Wellard; Randall Wells; Kymberly M. Yano; Jochen R. Zaeschmar; Lars Bejder

doi:10.1111/2041-210X.14167

A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species

Philip T. Patton^* (Corresponding Author), Ted Cheeseman, Kenshin Abe, Taiki Yamaguchi, Walter Reade, Ken Southerland, Addison Howard, Erin M. Oleson, Jason B. Allen, Erin Ashe, Aline Athayde, Robin W. Baird, Charla Basran, Elsa Cabrera, John Calambokidis, Júlio Cardoso, Emma L. Carroll, Amina Cesario, Barbara J. Cheney, Enrico CorsiJens Currie, John W. Durban, Erin A. Falcone, Holly Fearnbach, Kiirsten Flynn, Trish Franklin, Wally Franklin, Bárbara Galletti Vernazzani, Tilen Genov, Marie Hill, David R. Johnston, Erin L. Keene, Sabre D. Mahaffy, Tamara L. McGuire, Liah McPherson, Catherine Meyer, Robert Michaud, Anastasia Miliou, Dara N. Orbach, Heidi C. Pearson, Marianne H. Rasmussen, William J. Rayment, Caroline Rinaldi, Renato Rinaldi, Salvatore Siciliano, Stephanie Stack, Beatriz Tintore, Leigh G. Torres, Jared R. Towers, Cameron Trotter, Reny Tyson Moore, Caroline R. Weir, Rebecca Wellard, Randall Wells, Kymberly M. Yano, Jochen R. Zaeschmar, Lars Bejder

^*Corresponding author for this work

Aberdeen Centre For Environmental Sustainability

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Researchers can investigate many aspects of animal ecology through noninvasive photo–identification. Photo–identification is becoming more efficient as matching individuals between photos is increasingly automated. However, the convolutional neural network models that have facilitated this change need many training images to generalize well. As a result, they have often been developed for individual species that meet this threshold. These single-species methods might underperform, as they ignore potential similarities in identifying characteristics and the photo–identification process among species.
In this paper, we introduce a multi-species photo–identification model based on a state-of-the-art method in human facial recognition, the ArcFace classification head. Our model uses two such heads to jointly classify species and identities, allowing species to share information and parameters within the network. As a demonstration, we trained this model with 50,796 images from 39 catalogues of 24 cetacean species, evaluating its predictive performance on 21,192 test images from the same catalogues. We further evaluated its predictive performance with two external catalogues entirely composed of identities that the model did not see during training.
The model achieved a mean average precision (MAP) of 0.869 on the test set. Of these, 10 catalogues representing seven species achieved a MAP score over 0.95. For some species, there was notable variation in performance among catalogues, largely explained by variation in photo quality. Finally, the model appeared to generalize well, with the two external catalogues scoring similarly to their species' counterparts in the larger test set.
From our cetacean application, we provide a list of recommendations for potential users of this model, focusing on those with cetacean photo–identification catalogues. For example, users with high quality images of animals identified by dorsal nicks and notches should expect near optimal performance. Users can expect decreasing performance for catalogues with higher proportions of indistinct individuals or poor quality photos. Finally, we note that this model is currently freely available as code in a GitHub repository and as a graphical user interface, with additional functionality for collaborative data management, via Happywhale.com.

Original language	English
Pages (from-to)	2611-2625
Number of pages	15
Journal	Methods in Ecology and Evolution
Volume	14
Issue number	10
Early online date	13 Jul 2023
DOIs	https://doi.org/10.1111/2041-210X.14167
Publication status	Published - Oct 2023

Bibliographical note

We thank the countless individuals who collected and/or processed the nearly 85,000 images used in this study and those who assisted, particularly those who sorted these images from the millions that did not end up in the catalogues. Additionally, we thank the other Kaggle competitors who helped develop the ideas, models and data used here, particularly those who released their datasets to the public. The graduate assistantship for Philip T. Patton was funded by the NOAA Fisheries QUEST Fellowship. This paper represents HIMB and SOEST contribution numbers 1932 and 11679, respectively. The technical support and advanced computing resources from University of Hawaii Information Technology Services—Cyberinfrastructure, funded in part by the National Science Foundation CC* awards # 2201428 and # 2232862 are gratefully acknowledged. Every photo–identification image was collected under permits according to relevant national guidelines, regulation and legislation.

Data Availability Statement

The competition data are freely available at https://www.kaggle.com/competitions/happy-whale-and-dolphin. The data and code necessary to train, validate, and test the model are available at Zenodo (Abe, 2023) and GitHub https://github.com/knshnb/kaggle-happywhale-1st-place.

Keywords

artificial intelligence
cetacean
computer vision
convolutional neural network
deep learning
dolphin
dorsal
lateral
machine learning
multi–species
photo–identification
whale

Access to Document

10.1111/2041-210X.14167Licence: CC BY-NC

Patton_etal_MEE_A_Deep_Learning_VoR
This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. © 2023 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Soc
Final published version, 7.44 MBLicence: CC BY-NC

Cite this

Patton, P. T., Cheeseman, T., Abe, K., Yamaguchi, T., Reade, W., Southerland, K., Howard, A., Oleson, E. M., Allen, J. B., Ashe, E., Athayde, A., Baird, R. W., Basran, C., Cabrera, E., Calambokidis, J., Cardoso, J., Carroll, E. L., Cesario, A., Cheney, B. J., ... Bejder, L. (2023). A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species. Methods in Ecology and Evolution, 14(10), 2611-2625. https://doi.org/10.1111/2041-210X.14167

Patton, PT, Cheeseman, T, Abe, K, Yamaguchi, T, Reade, W, Southerland, K, Howard, A, Oleson, EM, Allen, JB, Ashe, E, Athayde, A, Baird, RW, Basran, C, Cabrera, E, Calambokidis, J, Cardoso, J, Carroll, EL, Cesario, A, Cheney, BJ, Corsi, E, Currie, J, Durban, JW, Falcone, EA, Fearnbach, H, Flynn, K, Franklin, T, Franklin, W, Galletti Vernazzani, B, Genov, T, Hill, M, Johnston, DR, Keene, EL, Mahaffy, SD, McGuire, TL, McPherson, L, Meyer, C, Michaud, R, Miliou, A, Orbach, DN, Pearson, HC, Rasmussen, MH, Rayment, WJ, Rinaldi, C, Rinaldi, R, Siciliano, S, Stack, S, Tintore, B, Torres, LG, Towers, JR, Trotter, C, Tyson Moore, R, Weir, CR, Wellard, R, Wells, R, Yano, KM, Zaeschmar, JR & Bejder, L 2023, 'A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species', Methods in Ecology and Evolution, vol. 14, no. 10, pp. 2611-2625. https://doi.org/10.1111/2041-210X.14167

@article{052b15ef58914ff2a9d7cbe8cedd2b55,

title = "A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species",

abstract = "Researchers can investigate many aspects of animal ecology through noninvasive photo–identification. Photo–identification is becoming more efficient as matching individuals between photos is increasingly automated. However, the convolutional neural network models that have facilitated this change need many training images to generalize well. As a result, they have often been developed for individual species that meet this threshold. These single-species methods might underperform, as they ignore potential similarities in identifying characteristics and the photo–identification process among species.In this paper, we introduce a multi-species photo–identification model based on a state-of-the-art method in human facial recognition, the ArcFace classification head. Our model uses two such heads to jointly classify species and identities, allowing species to share information and parameters within the network. As a demonstration, we trained this model with 50,796 images from 39 catalogues of 24 cetacean species, evaluating its predictive performance on 21,192 test images from the same catalogues. We further evaluated its predictive performance with two external catalogues entirely composed of identities that the model did not see during training.The model achieved a mean average precision (MAP) of 0.869 on the test set. Of these, 10 catalogues representing seven species achieved a MAP score over 0.95. For some species, there was notable variation in performance among catalogues, largely explained by variation in photo quality. Finally, the model appeared to generalize well, with the two external catalogues scoring similarly to their species' counterparts in the larger test set.From our cetacean application, we provide a list of recommendations for potential users of this model, focusing on those with cetacean photo–identification catalogues. For example, users with high quality images of animals identified by dorsal nicks and notches should expect near optimal performance. Users can expect decreasing performance for catalogues with higher proportions of indistinct individuals or poor quality photos. Finally, we note that this model is currently freely available as code in a GitHub repository and as a graphical user interface, with additional functionality for collaborative data management, via Happywhale.com.",

keywords = "artificial intelligence, cetacean, computer vision, convolutional neural network, deep learning, dolphin, dorsal, lateral, machine learning, multi–species, photo–identification, whale",

author = "Patton, {Philip T.} and Ted Cheeseman and Kenshin Abe and Taiki Yamaguchi and Walter Reade and Ken Southerland and Addison Howard and Oleson, {Erin M.} and Allen, {Jason B.} and Erin Ashe and Aline Athayde and Baird, {Robin W.} and Charla Basran and Elsa Cabrera and John Calambokidis and J{\'u}lio Cardoso and Carroll, {Emma L.} and Amina Cesario and Cheney, {Barbara J.} and Enrico Corsi and Jens Currie and Durban, {John W.} and Falcone, {Erin A.} and Holly Fearnbach and Kiirsten Flynn and Trish Franklin and Wally Franklin and B{\'a}rbara Galletti Vernazzani and Tilen Genov and Marie Hill and Johnston, {David R.} and Keene, {Erin L.} and Mahaffy, {Sabre D.} and McGuire, {Tamara L.} and Liah McPherson and Catherine Meyer and Robert Michaud and Anastasia Miliou and Orbach, {Dara N.} and Pearson, {Heidi C.} and Rasmussen, {Marianne H.} and Rayment, {William J.} and Caroline Rinaldi and Renato Rinaldi and Salvatore Siciliano and Stephanie Stack and Beatriz Tintore and Torres, {Leigh G.} and Towers, {Jared R.} and Cameron Trotter and Reny Tyson Moore and Weir, {Caroline R.} and Rebecca Wellard and Randall Wells and Yano, {Kymberly M.} and Zaeschmar, {Jochen R.} and Lars Bejder",

note = "We thank the countless individuals who collected and/or processed the nearly 85,000 images used in this study and those who assisted, particularly those who sorted these images from the millions that did not end up in the catalogues. Additionally, we thank the other Kaggle competitors who helped develop the ideas, models and data used here, particularly those who released their datasets to the public. The graduate assistantship for Philip T. Patton was funded by the NOAA Fisheries QUEST Fellowship. This paper represents HIMB and SOEST contribution numbers 1932 and 11679, respectively. The technical support and advanced computing resources from University of Hawaii Information Technology Services—Cyberinfrastructure, funded in part by the National Science Foundation CC* awards # 2201428 and # 2232862 are gratefully acknowledged. Every photo–identification image was collected under permits according to relevant national guidelines, regulation and legislation.",

year = "2023",

month = oct,

doi = "10.1111/2041-210X.14167",

language = "English",

volume = "14",

pages = "2611--2625",

journal = "Methods in Ecology and Evolution",

issn = "2041-210X",

publisher = "WILEY-BLACKWELL",

number = "10",

}

TY - JOUR

T1 - A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species

AU - Patton, Philip T.

AU - Cheeseman, Ted

AU - Abe, Kenshin

AU - Yamaguchi, Taiki

AU - Reade, Walter

AU - Southerland, Ken

AU - Howard, Addison

AU - Oleson, Erin M.

AU - Allen, Jason B.

AU - Ashe, Erin

AU - Athayde, Aline

AU - Baird, Robin W.

AU - Basran, Charla

AU - Cabrera, Elsa

AU - Calambokidis, John

AU - Cardoso, Júlio

AU - Carroll, Emma L.

AU - Cesario, Amina

AU - Cheney, Barbara J.

AU - Corsi, Enrico

AU - Currie, Jens

AU - Durban, John W.

AU - Falcone, Erin A.

AU - Fearnbach, Holly

AU - Flynn, Kiirsten

AU - Franklin, Trish

AU - Franklin, Wally

AU - Galletti Vernazzani, Bárbara

AU - Genov, Tilen

AU - Hill, Marie

AU - Johnston, David R.

AU - Keene, Erin L.

AU - Mahaffy, Sabre D.

AU - McGuire, Tamara L.

AU - McPherson, Liah

AU - Meyer, Catherine

AU - Michaud, Robert

AU - Miliou, Anastasia

AU - Orbach, Dara N.

AU - Pearson, Heidi C.

AU - Rasmussen, Marianne H.

AU - Rayment, William J.

AU - Rinaldi, Caroline

AU - Rinaldi, Renato

AU - Siciliano, Salvatore

AU - Stack, Stephanie

AU - Tintore, Beatriz

AU - Torres, Leigh G.

AU - Towers, Jared R.

AU - Trotter, Cameron

AU - Tyson Moore, Reny

AU - Weir, Caroline R.

AU - Wellard, Rebecca

AU - Wells, Randall

AU - Yano, Kymberly M.

AU - Zaeschmar, Jochen R.

AU - Bejder, Lars

N1 - We thank the countless individuals who collected and/or processed the nearly 85,000 images used in this study and those who assisted, particularly those who sorted these images from the millions that did not end up in the catalogues. Additionally, we thank the other Kaggle competitors who helped develop the ideas, models and data used here, particularly those who released their datasets to the public. The graduate assistantship for Philip T. Patton was funded by the NOAA Fisheries QUEST Fellowship. This paper represents HIMB and SOEST contribution numbers 1932 and 11679, respectively. The technical support and advanced computing resources from University of Hawaii Information Technology Services—Cyberinfrastructure, funded in part by the National Science Foundation CC* awards # 2201428 and # 2232862 are gratefully acknowledged. Every photo–identification image was collected under permits according to relevant national guidelines, regulation and legislation.

PY - 2023/10

Y1 - 2023/10

N2 - Researchers can investigate many aspects of animal ecology through noninvasive photo–identification. Photo–identification is becoming more efficient as matching individuals between photos is increasingly automated. However, the convolutional neural network models that have facilitated this change need many training images to generalize well. As a result, they have often been developed for individual species that meet this threshold. These single-species methods might underperform, as they ignore potential similarities in identifying characteristics and the photo–identification process among species.In this paper, we introduce a multi-species photo–identification model based on a state-of-the-art method in human facial recognition, the ArcFace classification head. Our model uses two such heads to jointly classify species and identities, allowing species to share information and parameters within the network. As a demonstration, we trained this model with 50,796 images from 39 catalogues of 24 cetacean species, evaluating its predictive performance on 21,192 test images from the same catalogues. We further evaluated its predictive performance with two external catalogues entirely composed of identities that the model did not see during training.The model achieved a mean average precision (MAP) of 0.869 on the test set. Of these, 10 catalogues representing seven species achieved a MAP score over 0.95. For some species, there was notable variation in performance among catalogues, largely explained by variation in photo quality. Finally, the model appeared to generalize well, with the two external catalogues scoring similarly to their species' counterparts in the larger test set.From our cetacean application, we provide a list of recommendations for potential users of this model, focusing on those with cetacean photo–identification catalogues. For example, users with high quality images of animals identified by dorsal nicks and notches should expect near optimal performance. Users can expect decreasing performance for catalogues with higher proportions of indistinct individuals or poor quality photos. Finally, we note that this model is currently freely available as code in a GitHub repository and as a graphical user interface, with additional functionality for collaborative data management, via Happywhale.com.

AB - Researchers can investigate many aspects of animal ecology through noninvasive photo–identification. Photo–identification is becoming more efficient as matching individuals between photos is increasingly automated. However, the convolutional neural network models that have facilitated this change need many training images to generalize well. As a result, they have often been developed for individual species that meet this threshold. These single-species methods might underperform, as they ignore potential similarities in identifying characteristics and the photo–identification process among species.In this paper, we introduce a multi-species photo–identification model based on a state-of-the-art method in human facial recognition, the ArcFace classification head. Our model uses two such heads to jointly classify species and identities, allowing species to share information and parameters within the network. As a demonstration, we trained this model with 50,796 images from 39 catalogues of 24 cetacean species, evaluating its predictive performance on 21,192 test images from the same catalogues. We further evaluated its predictive performance with two external catalogues entirely composed of identities that the model did not see during training.The model achieved a mean average precision (MAP) of 0.869 on the test set. Of these, 10 catalogues representing seven species achieved a MAP score over 0.95. For some species, there was notable variation in performance among catalogues, largely explained by variation in photo quality. Finally, the model appeared to generalize well, with the two external catalogues scoring similarly to their species' counterparts in the larger test set.From our cetacean application, we provide a list of recommendations for potential users of this model, focusing on those with cetacean photo–identification catalogues. For example, users with high quality images of animals identified by dorsal nicks and notches should expect near optimal performance. Users can expect decreasing performance for catalogues with higher proportions of indistinct individuals or poor quality photos. Finally, we note that this model is currently freely available as code in a GitHub repository and as a graphical user interface, with additional functionality for collaborative data management, via Happywhale.com.

KW - artificial intelligence

KW - cetacean

KW - computer vision

KW - convolutional neural network

KW - deep learning

KW - dolphin

KW - dorsal

KW - lateral

KW - machine learning

KW - multi–species

KW - photo–identification

KW - whale

UR - http://www.scopus.com/inward/record.url?scp=85164831825&partnerID=8YFLogxK

U2 - 10.1111/2041-210X.14167

DO - 10.1111/2041-210X.14167

M3 - Article

SN - 2041-210X

VL - 14

SP - 2611

EP - 2625

JO - Methods in Ecology and Evolution

JF - Methods in Ecology and Evolution

IS - 10

ER -

A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species

Abstract

Bibliographical note

Data Availability Statement

Keywords

Access to Document

Other files and links

Fingerprint

Cite this