Hyperspherically Regularized Networks for Self-Supervision

Aiden Durrant; Georgios Leontidis

doi:10.1016/j.imavis.2022.104494

Hyperspherically Regularized Networks for Self-Supervision

Aiden Durrant^* (Corresponding Author), Georgios Leontidis

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

4 Downloads (Pure)

Abstract

Bootstrap Your Own Latent (BYOL) introduced an approach to self-supervised learning avoiding the contrastive paradigm and subsequently removing the com- putational burden of negative sampling associated with such methods. However, we empirically find that the image representations produced under the BYOL’s self-distillation paradigm are poorly distributed in representation space compared to contrastive methods. This work empirically demonstrates that feature diver- sity enforced by contrastive losses is beneficial to image representation uniformity when employed in BYOL, and as such, provides greater inter-class representa- tion separability. Additionally, we explore and advocate the use of regularization methods, specifically the layer-wise minimization of hyperspherical energy (i.e. maximization of entropy) of network weights to encourage representation unifor- mity. We show that directly optimizing a measure of uniformity alongside the standard loss, or regularizing the networks of the BYOL architecture to minimize the hyperspherical energy of neurons can produce more uniformly distributed and therefore better performing representations for downstream tasks.

Original language	English
Article number	104494
Journal	Image and Vision Computing
Volume	124
DOIs	https://doi.org/10.1016/j.imavis.2022.104494
Publication status	Published - 1 Jul 2022

Bibliographical note

Acknowledgments
This work used the Cirrus UK National Tier-2 HPC Service at EPCC (http://www.cirrus.ac.uk). Access granted through the project: ec173 - Next gen self-supervised learn- ing systems for vision tasks.
Open access via Elsevier Agreement

Keywords

Self-supervised learning
Representation learning
Representation separability
Image classification

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.imavis.2022.104494Licence: CC BY

Durrant_final versionFinal published version, 331 KBLicence: CC BY

"Maxwell" HPC for Research
Katie Wilde (Manager) & Andrew Phillips (Manager)
Research Facilities: Facility

Cite this

@article{4fcbd72231b4494883246e908132efe2,

title = "Hyperspherically Regularized Networks for Self-Supervision",

abstract = "Bootstrap Your Own Latent (BYOL) introduced an approach to self-supervised learning avoiding the contrastive paradigm and subsequently removing the com- putational burden of negative sampling associated with such methods. However, we empirically find that the image representations produced under the BYOL{\textquoteright}s self-distillation paradigm are poorly distributed in representation space compared to contrastive methods. This work empirically demonstrates that feature diver- sity enforced by contrastive losses is beneficial to image representation uniformity when employed in BYOL, and as such, provides greater inter-class representa- tion separability. Additionally, we explore and advocate the use of regularization methods, specifically the layer-wise minimization of hyperspherical energy (i.e. maximization of entropy) of network weights to encourage representation unifor- mity. We show that directly optimizing a measure of uniformity alongside the standard loss, or regularizing the networks of the BYOL architecture to minimize the hyperspherical energy of neurons can produce more uniformly distributed and therefore better performing representations for downstream tasks.",

keywords = "Self-supervised learning, Representation learning, Representation separability, Image classification",

author = "Aiden Durrant and Georgios Leontidis",

note = "Acknowledgments This work used the Cirrus UK National Tier-2 HPC Service at EPCC (http://www.cirrus.ac.uk). Access granted through the project: ec173 - Next gen self-supervised learn- ing systems for vision tasks. Open access via Elsevier Agreement",

year = "2022",

month = jul,

day = "1",

doi = "10.1016/j.imavis.2022.104494",

language = "English",

volume = "124",

journal = "Image and Vision Computing",

issn = "0262-8856",

publisher = "Elsevier Limited",

}

TY - JOUR

T1 - Hyperspherically Regularized Networks for Self-Supervision

AU - Durrant, Aiden

AU - Leontidis, Georgios

N1 - Acknowledgments This work used the Cirrus UK National Tier-2 HPC Service at EPCC (http://www.cirrus.ac.uk). Access granted through the project: ec173 - Next gen self-supervised learn- ing systems for vision tasks. Open access via Elsevier Agreement

PY - 2022/7/1

Y1 - 2022/7/1

N2 - Bootstrap Your Own Latent (BYOL) introduced an approach to self-supervised learning avoiding the contrastive paradigm and subsequently removing the com- putational burden of negative sampling associated with such methods. However, we empirically find that the image representations produced under the BYOL’s self-distillation paradigm are poorly distributed in representation space compared to contrastive methods. This work empirically demonstrates that feature diver- sity enforced by contrastive losses is beneficial to image representation uniformity when employed in BYOL, and as such, provides greater inter-class representa- tion separability. Additionally, we explore and advocate the use of regularization methods, specifically the layer-wise minimization of hyperspherical energy (i.e. maximization of entropy) of network weights to encourage representation unifor- mity. We show that directly optimizing a measure of uniformity alongside the standard loss, or regularizing the networks of the BYOL architecture to minimize the hyperspherical energy of neurons can produce more uniformly distributed and therefore better performing representations for downstream tasks.

AB - Bootstrap Your Own Latent (BYOL) introduced an approach to self-supervised learning avoiding the contrastive paradigm and subsequently removing the com- putational burden of negative sampling associated with such methods. However, we empirically find that the image representations produced under the BYOL’s self-distillation paradigm are poorly distributed in representation space compared to contrastive methods. This work empirically demonstrates that feature diver- sity enforced by contrastive losses is beneficial to image representation uniformity when employed in BYOL, and as such, provides greater inter-class representa- tion separability. Additionally, we explore and advocate the use of regularization methods, specifically the layer-wise minimization of hyperspherical energy (i.e. maximization of entropy) of network weights to encourage representation unifor- mity. We show that directly optimizing a measure of uniformity alongside the standard loss, or regularizing the networks of the BYOL architecture to minimize the hyperspherical energy of neurons can produce more uniformly distributed and therefore better performing representations for downstream tasks.

KW - Self-supervised learning

KW - Representation learning

KW - Representation separability

KW - Image classification

U2 - 10.1016/j.imavis.2022.104494

DO - 10.1016/j.imavis.2022.104494

M3 - Article

SN - 0262-8856

VL - 124

JO - Image and Vision Computing

JF - Image and Vision Computing

M1 - 104494

ER -

Hyperspherically Regularized Networks for Self-Supervision

Abstract

Bibliographical note

Keywords

UN SDGs

Access to Document

Fingerprint

Equipment

"Maxwell" HPC for Research

Cite this