UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation

Soumya Gupta; Sharib Ali; Xu Zhang; Binod Bhattarai; Jens Rittscher

doi:10.1007/978-3-031-21014-3_17

UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation

Soumya Gupta^* (Corresponding Author), Sharib Ali, Xu Zhang, Binod Bhattarai, Jens Rittscher

^*Corresponding author for this work

Computing Science

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

Abstract

While endoscopy is routinely used for surveillance, high operator dependence demands robust automated image analysis methods. Automated segmentation of region-of-interest (ROI) that includes lesions, inflammations, and instruments can serve to cope with the operator dependence problem in this field. Most supervised methods are developed by fitting models on the available ground truth mask samples only. This work proposes a joint training approach using the UNet coupled with a variational auto-encoder (VAE) to improve endoscopic image segmentation by exploiting original samples, predicted masks and ground truth masks. In the proposed UNet-eVAE, VAE utilises the masks to constrain ROI-specific feature representations for reconstruction as an auxiliary task. The fine-grained spatial information from VAE is fused with the UNet decoder to enrich the feature representations and improve segmentation performance. Our experimental results on both colonoscopy and ureteroscopy datasets demonstrate that the proposed architecture can learn robust representations and generalise segmentation performance on unseen samples while improving the baseline.

Original language	English
Title of host publication	Machine Learning in Medical Imaging
Subtitle of host publication	13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings
Publisher	Springer Nature
Pages	161-170
Number of pages	10
ISBN (Electronic)	978-3-031-21014-3
ISBN (Print)	978-3-031-21013-6
DOIs	https://doi.org/10.1007/978-3-031-21014-3_17
Publication status	Published - 16 Dec 2022

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer Nature
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Bibliographical note

We would like to thank Boston Scientific for funding this project (Grant No: DFR04690). SG and BT are funded by BSC, BB is funded by EndoMapper Horizon 2020 FET (GA 863146), SA and JR were supported by the NIHR Oxford Biomedical Research Centre.

Access to Document

10.1007/978-3-031-21014-3_17Licence: Unspecified

Cite this

Gupta, S., Ali, S., Zhang, X., Bhattarai, B., & Rittscher, J. (2022). UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation. In Machine Learning in Medical Imaging: 13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings (pp. 161-170). (Lecture Notes in Computer Science). Springer Nature. https://doi.org/10.1007/978-3-031-21014-3_17

UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation. / Gupta, Soumya (Corresponding Author); Ali, Sharib; Zhang, Xu et al.
Machine Learning in Medical Imaging: 13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings. Springer Nature, 2022. p. 161-170 (Lecture Notes in Computer Science).

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

Gupta, S, Ali, S, Zhang, X, Bhattarai, B & Rittscher, J 2022, UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation. in Machine Learning in Medical Imaging: 13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings. Lecture Notes in Computer Science, Springer Nature, pp. 161-170. https://doi.org/10.1007/978-3-031-21014-3_17

Gupta S, Ali S, Zhang X, Bhattarai B, Rittscher J. UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation. In Machine Learning in Medical Imaging: 13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings. Springer Nature. 2022. p. 161-170. (Lecture Notes in Computer Science). doi: 10.1007/978-3-031-21014-3_17

Gupta, Soumya ; Ali, Sharib ; Zhang, Xu et al. / UNet-eVAE : Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation. Machine Learning in Medical Imaging: 13th International Workshop, MLMI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings. Springer Nature, 2022. pp. 161-170 (Lecture Notes in Computer Science).

@inproceedings{c7bcc0f5423448c7987b2904af1b3304,

title = "UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation",

abstract = "While endoscopy is routinely used for surveillance, high operator dependence demands robust automated image analysis methods. Automated segmentation of region-of-interest (ROI) that includes lesions, inflammations, and instruments can serve to cope with the operator dependence problem in this field. Most supervised methods are developed by fitting models on the available ground truth mask samples only. This work proposes a joint training approach using the UNet coupled with a variational auto-encoder (VAE) to improve endoscopic image segmentation by exploiting original samples, predicted masks and ground truth masks. In the proposed UNet-eVAE, VAE utilises the masks to constrain ROI-specific feature representations for reconstruction as an auxiliary task. The fine-grained spatial information from VAE is fused with the UNet decoder to enrich the feature representations and improve segmentation performance. Our experimental results on both colonoscopy and ureteroscopy datasets demonstrate that the proposed architecture can learn robust representations and generalise segmentation performance on unseen samples while improving the baseline.",

author = "Soumya Gupta and Sharib Ali and Xu Zhang and Binod Bhattarai and Jens Rittscher",

note = "We would like to thank Boston Scientific for funding this project (Grant No: DFR04690). SG and BT are funded by BSC, BB is funded by EndoMapper Horizon 2020 FET (GA 863146), SA and JR were supported by the NIHR Oxford Biomedical Research Centre.",

year = "2022",

month = dec,

day = "16",

doi = "10.1007/978-3-031-21014-3_17",

language = "English",

isbn = "978-3-031-21013-6",

series = "Lecture Notes in Computer Science",

publisher = "Springer Nature",

pages = "161--170",

booktitle = "Machine Learning in Medical Imaging",

address = "United States",

}

TY - GEN

T1 - UNet-eVAE

T2 - Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation

AU - Gupta, Soumya

AU - Ali, Sharib

AU - Zhang, Xu

AU - Bhattarai, Binod

AU - Rittscher, Jens

N1 - We would like to thank Boston Scientific for funding this project (Grant No: DFR04690). SG and BT are funded by BSC, BB is funded by EndoMapper Horizon 2020 FET (GA 863146), SA and JR were supported by the NIHR Oxford Biomedical Research Centre.

PY - 2022/12/16

Y1 - 2022/12/16

N2 - While endoscopy is routinely used for surveillance, high operator dependence demands robust automated image analysis methods. Automated segmentation of region-of-interest (ROI) that includes lesions, inflammations, and instruments can serve to cope with the operator dependence problem in this field. Most supervised methods are developed by fitting models on the available ground truth mask samples only. This work proposes a joint training approach using the UNet coupled with a variational auto-encoder (VAE) to improve endoscopic image segmentation by exploiting original samples, predicted masks and ground truth masks. In the proposed UNet-eVAE, VAE utilises the masks to constrain ROI-specific feature representations for reconstruction as an auxiliary task. The fine-grained spatial information from VAE is fused with the UNet decoder to enrich the feature representations and improve segmentation performance. Our experimental results on both colonoscopy and ureteroscopy datasets demonstrate that the proposed architecture can learn robust representations and generalise segmentation performance on unseen samples while improving the baseline.

AB - While endoscopy is routinely used for surveillance, high operator dependence demands robust automated image analysis methods. Automated segmentation of region-of-interest (ROI) that includes lesions, inflammations, and instruments can serve to cope with the operator dependence problem in this field. Most supervised methods are developed by fitting models on the available ground truth mask samples only. This work proposes a joint training approach using the UNet coupled with a variational auto-encoder (VAE) to improve endoscopic image segmentation by exploiting original samples, predicted masks and ground truth masks. In the proposed UNet-eVAE, VAE utilises the masks to constrain ROI-specific feature representations for reconstruction as an auxiliary task. The fine-grained spatial information from VAE is fused with the UNet decoder to enrich the feature representations and improve segmentation performance. Our experimental results on both colonoscopy and ureteroscopy datasets demonstrate that the proposed architecture can learn robust representations and generalise segmentation performance on unseen samples while improving the baseline.

U2 - 10.1007/978-3-031-21014-3_17

DO - 10.1007/978-3-031-21014-3_17

M3 - Published conference contribution

SN - 978-3-031-21013-6

T3 - Lecture Notes in Computer Science

SP - 161

EP - 170

BT - Machine Learning in Medical Imaging

PB - Springer Nature

ER -

UNet-eVAE: Iterative Refinement Using VAE Embodied Learning for Endoscopic Image Segmentation

Abstract

Publication series

Bibliographical note

Access to Document

Fingerprint

Cite this