Let’s Agree to Disagree: Learning Highly Debatable Multirater Labelling

Sudre, Carole H.; Anson, Beatriz Gomez; Ingala, Silvia; Lane, Chris D.; Jimenez, Daniel; Haider, Lukas; Varsavsky, Thomas; Tanno, Ryutaro; Smith, Lorna; Ourselin, Sébastien; Jäger, Rolf H.; Cardoso, M. Jorge

doi:10.1007/978-3-030-32251-9_73

Carole H. Sudre^16,17,18,
Beatriz Gomez Anson¹⁹,
Silvia Ingala²⁰,
Chris D. Lane¹⁷,
Daniel Jimenez¹⁷,
Lukas Haider²¹,
Thomas Varsavsky^16,18,
Ryutaro Tanno¹⁸,
Lorna Smith²²,
Sébastien Ourselin¹⁶,
Rolf H. Jäger²³ &
…
M. Jorge Cardoso^16,17,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11767))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

9031 Accesses
6 Citations
1 Altmetric

Abstract

Classification and differentiation of small pathological objects may greatly vary among human raters due to differences in training, expertise and their consistency over time. In a radiological setting, objects commonly have high within-class appearance variability whilst sharing certain characteristics across different classes, making their distinction even more difficult. As an example, markers of cerebral small vessel disease, such as enlarged perivascular spaces (EPVS) and lacunes, can be very varied in their appearance while exhibiting high inter-class similarity, making this task highly challenging for human raters. In this work, we investigate joint models of individual rater behaviour and multi-rater consensus in a deep learning setting, and apply it to a brain lesion object-detection task. Results show that jointly modelling both individual and consensus estimates leads to significant improvements in performance when compared to directly predicting consensus labels, while also allowing the characterization of human-rater consistency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep learning from multiple experts improves identification of amyloid neuropathologies

Article Open access 28 April 2022

Learning to Segment When Experts Disagree

On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation

Notes

1.
http://www.itksnap.org/pmwiki/pmwiki.php?n=Main.HomePage.

References

Bouguelia, M.R., Nowaczyk, S., Santosh, K.C., Verikas, A.: Agreeing to disagree: active learning with noisy labels without crowdsourcing. Int. J. Mach. Learn. Cybern. 9(8), 1307–1319 (2018)
Article Google Scholar
Wang, H., Suh, J.W., Das, S.R., Pluta, J.B., Craige, C., Yushkevich, P.A.: Multi-atlas segmentation with joint label fusion. IEEE TPAMI 35(3), 611–623 (2013). https://doi.org/10.1109/TPAMI.2012.143
Jiang, L., Zhou, Z., Leung, T., Li, L.J., Fei-Fei, L.: MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels, December 2017. http://arxiv.org/abs/1712.05055
Li, Y., Yang, J., Song, Y., Cao, L., Luo, J., Li, L.J.: Learning From Noisy Labels With Distillation (2017)
Google Scholar
Ramirez, J., Berezuk, C., McNeely, A.A., Gao, F., McLaurin, J., Black, S.E.: Imaging the perivascular space as a potential biomarker of neurovascular and neurodegenerative diseases. Cell. Mol. Neurobiol. 36(2), 289–299 (2016)
Article Google Scholar
Sudre, C., et al.: 3D multirater RCNN for multimodal multiclass detection and characterization of extremely small objects. In: Proceedings of the 2nd International MIDL Conference on Proceedings of Machine Learning Research, vol. 102, pp. 447–456. PMLR, London, United Kingdom, 08–10 Jul 2019
Google Scholar
Tanno, R., Saeedi, A., Sankaranarayanan, S., Alexander, D.C., Silberman, N.: Learning from noisy labels by regularized estimation of annotator confusion. In: Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Tillin, T., Forouhi, N.G., McKeigue, P.M., Chatuverdi, N.F.T.S., Chaturvedi, N.: Southall and Brent REvisited: cohort profile of SABRE, a UK population-based comparison of cardiovascular disease and diabetes in people of European, Indian Asian and African Caribbean origins. Int. J. Epidemiol. 41(1), 33–42 (2012). https://doi.org/10.1093/ije/dyq175
Wardlaw, J.M., et al.: Neuroimaging standards for research into small vessel disease and its contribution to ageing and neurodegeneration. Lancet Neurol. 12, 822–838 (2013)
Article Google Scholar
Warfield, S.K., Zou, K.H., Wells, W.M.: Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE TMI 23(7), 903–921 (2004)
Google Scholar

Download references

Acknowledgments

We are extremely grateful to all the participants of the SABRE study, and past and present members of the SABRE team. This work was supported by an Alzheimer’s Society Junior Fellowship (AS-JF-17-011), the Wellcome/EPSRC Centre for Medical Engineering [WT 203148/Z/16/Z], IMI2 grant AMYPAD [115952], the MSCA-ITN-Demo [721820], and the Wellcome Flagship Programme in High-Dimensional Neurology. The SABRE study was funded at baseline by the Medical Research Council, Diabetes UK, and the British Heart Foundation. At follow-up, the study was funded by the Wellcome Trust (067100, 37055891 and 086676/7/08/Z), the British Heart Foundation (PG/06/145, PG/08/103/26133, PG/12/29/29497 and CS/13/1/30327) and Diabetes UK (13/0004774). We gratefully acknowledge NVIDIA corporation for the donation of a GPU Tesla K40 that was used in the preparation of this work.

Author information

Authors and Affiliations

School of Biomedical Engineering and Imaging Sciences, KCL, London, UK
Carole H. Sudre, Thomas Varsavsky, Sébastien Ourselin & M. Jorge Cardoso
Dementia Research Centre, UCL Institute of Neurology, London, UK
Carole H. Sudre, Chris D. Lane, Daniel Jimenez & M. Jorge Cardoso
Department of Medical Physics and Biomedical Engineering, UCL, London, UK
Carole H. Sudre, Thomas Varsavsky, Ryutaro Tanno & M. Jorge Cardoso
Santa Creu i Sant Pau Hospital, Universitat Autonòma de Barcelona, Barcelona, Spain
Beatriz Gomez Anson
Vrije University Medical Centre Amsterdam, Amsterdam, The Netherlands
Silvia Ingala
Queen Square Multiple Sclerosis Centre, UCL Institute of Neurology, London, UK
Lukas Haider
Cardiometabolic Phenotyping Group, Institute of Cardiovascular Science, UCL, London, UK
Lorna Smith
Brain Repair and Rehabilitation Group, Institute of Neurology, UCL, London, UK
Rolf H. Jäger

Authors

Carole H. Sudre
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Gomez Anson
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Ingala
View author publications
You can also search for this author in PubMed Google Scholar
Chris D. Lane
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Jimenez
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Haider
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Varsavsky
View author publications
You can also search for this author in PubMed Google Scholar
Ryutaro Tanno
View author publications
You can also search for this author in PubMed Google Scholar
Lorna Smith
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Ourselin
View author publications
You can also search for this author in PubMed Google Scholar
Rolf H. Jäger
View author publications
You can also search for this author in PubMed Google Scholar
M. Jorge Cardoso
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carole H. Sudre .

Editor information

Editors and Affiliations

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Dinggang Shen
University of Georgia, Athens, GA, USA
Tianming Liu
Western University, London, ON, Canada
Terry M. Peters
Yale University, New Haven, CT, USA
Lawrence H. Staib
University of Strasbourg, Illkirch, France
Caroline Essert
United Imaging Intelligence, Shanghai, China
Sean Zhou
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Pew-Thian Yap
Western University, London, ON, Canada
Ali Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sudre, C.H. et al. (2019). Let’s Agree to Disagree: Learning Highly Debatable Multirater Labelling. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11767. Springer, Cham. https://doi.org/10.1007/978-3-030-32251-9_73

Download citation

DOI: https://doi.org/10.1007/978-3-030-32251-9_73
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32250-2
Online ISBN: 978-3-030-32251-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Let’s Agree to Disagree: Learning Highly Debatable Multirater Labelling

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep learning from multiple experts improves identification of amyloid neuropathologies

Learning to Segment When Experts Disagree

On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Let’s Agree to Disagree: Learning Highly Debatable Multirater Labelling

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep learning from multiple experts improves identification of amyloid neuropathologies

Learning to Segment When Experts Disagree

On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation