CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

Nwoye, Chinedu Innocent; Yu, Tong; Sharma, Saurav; Murali, Aditya; Alapatt, Deepak; Vardazaryan, Armine; Yuan, Kun; Hajek, Jonas; Reiter, Wolfgang; Yamlahi, Amine; Smidt, Finn-Henri; Zou, Xiaoyang; Zheng, Guoyan; Oliveira, Bruno; Torres, Helena R.; Kondo, Satoshi; Kasai, Satoshi; Holm, Felix; Özsoy, Ege; Gui, Shuangchun; Li, Han; Raviteja, Sista; Sathish, Rachana; Poudel, Pranav; Bhattarai, Binod; Wang, Ziheng; Rui, Guo; Schellenberg, Melanie; Vilaça, João L.; Czempiel, Tobias; Wang, Zhenkun; Sheet, Debdoot; Thapa, Shrawan Kumar; Berniker, Max; Godau, Patrick; Morais, Pedro; Regmi, Sudarshan; Tran, Thuy Nuong; Fonseca, Jaime; Nölke, Jan-Hinrich; Lima, Estevão; Vazquez, Eduard; Maier-Hein, Lena; Navab, Nassir; Mascagni, Pietro; Seeliger, Barbara; Gonzalez, Cristians; Mutter, Didier; Padoy, Nicolas

doi:10.1016/j.media.2023.102888

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2302.06294 (eess)

[Submitted on 13 Feb 2023 (v1), last revised 14 Jul 2023 (this version, v2)]

Title:CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

View PDF

Abstract:Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of <instrument, verb, target> triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery.

Comments:	MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2302.06294 [eess.IV]
	(or arXiv:2302.06294v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2302.06294
Journal reference:	Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415
Related DOI:	https://doi.org/10.1016/j.media.2023.102888

Submission history

From: Chinedu Nwoye [view email]
[v1] Mon, 13 Feb 2023 11:53:14 UTC (5,239 KB)
[v2] Fri, 14 Jul 2023 19:06:27 UTC (5,461 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators