-
2018 Robotic Scene Segmentation Challenge
Authors:
Max Allan,
Satoshi Kondo,
Sebastian Bodenstedt,
Stefan Leger,
Rahim Kadkhodamohammadi,
Imanol Luengo,
Felix Fuentes,
Evangello Flouty,
Ahmed Mohammed,
Marius Pedersen,
Avinash Kori,
Varghese Alex,
Ganapathy Krishnamurthi,
David Rauber,
Robert Mendel,
Christoph Palm,
Sophia Bano,
Guinther Saibro,
Chi-Sheng Shih,
Hsun-An Chiang,
Juntang Zhuang,
Junlin Yang,
Vladimir Iglovikov,
Anton Dobrenkii,
Madhu Reddiboina
, et al. (16 additional authors not shown)
Abstract:
In 2015 we began a sub-challenge at the EndoVis workshop at MICCAI in Munich using endoscope images of ex-vivo tissue with automatically generated annotations from robot forward kinematics and instrument CAD models. However, the limited background variation and simple motion rendered the dataset uninformative in learning about which techniques would be suitable for segmentation in real surgery. In…
▽ More
In 2015 we began a sub-challenge at the EndoVis workshop at MICCAI in Munich using endoscope images of ex-vivo tissue with automatically generated annotations from robot forward kinematics and instrument CAD models. However, the limited background variation and simple motion rendered the dataset uninformative in learning about which techniques would be suitable for segmentation in real surgery. In 2017, at the same workshop in Quebec we introduced the robotic instrument segmentation dataset with 10 teams participating in the challenge to perform binary, articulating parts and type segmentation of da Vinci instruments. This challenge included realistic instrument motion and more complex porcine tissue as background and was widely addressed with modifications on U-Nets and other popular CNN architectures. In 2018 we added to the complexity by introducing a set of anatomical objects and medical devices to the segmented classes. To avoid over-complicating the challenge, we continued with porcine data which is dramatically simpler than human tissue due to the lack of fatty tissue occluding many organs.
△ Less
Submitted 2 August, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
CaDIS: Cataract Dataset for Image Segmentation
Authors:
Maria Grammatikopoulou,
Evangello Flouty,
Abdolrahim Kadkhodamohammadi,
Gwenol'e Quellec,
Andre Chow,
Jean Nehme,
Imanol Luengo,
Danail Stoyanov
Abstract:
Video feedback provides a wealth of information about surgical procedures and is the main sensory cue for surgeons. Scene understanding is crucial to computer assisted interventions (CAI) and to post-operative analysis of the surgical procedure. A fundamental building block of such capabilities is the identification and localization of surgical instruments and anatomical structures through semanti…
▽ More
Video feedback provides a wealth of information about surgical procedures and is the main sensory cue for surgeons. Scene understanding is crucial to computer assisted interventions (CAI) and to post-operative analysis of the surgical procedure. A fundamental building block of such capabilities is the identification and localization of surgical instruments and anatomical structures through semantic segmentation. Deep learning has advanced semantic segmentation techniques in the recent years but is inherently reliant on the availability of labelled datasets for model training. This paper introduces a dataset for semantic segmentation of cataract surgery videos complementing the publicly available CATARACTS challenge dataset. In addition, we benchmark the performance of several state-of-the-art deep learning models for semantic segmentation on the presented dataset. The dataset is publicly available at https://cataracts-semantic-segmentation2020.grand-challenge.org/.
△ Less
Submitted 22 February, 2022; v1 submitted 27 June, 2019;
originally announced June 2019.
-
SurReal: enhancing Surgical simulation Realism using style transfer
Authors:
Imanol Luengo,
Evangello Flouty,
Petros Giataganas,
Piyamate Wisanuvej,
Jean Nehme,
Danail Stoyanov
Abstract:
Surgical simulation is an increasingly important element of surgical education. Using simulation can be a means to address some of the significant challenges in developing surgical skills with limited time and resources. The photo-realistic fidelity of simulations is a key feature that can improve the experience and transfer ratio of trainees. In this paper, we demonstrate how we can enhance the v…
▽ More
Surgical simulation is an increasingly important element of surgical education. Using simulation can be a means to address some of the significant challenges in developing surgical skills with limited time and resources. The photo-realistic fidelity of simulations is a key feature that can improve the experience and transfer ratio of trainees. In this paper, we demonstrate how we can enhance the visual fidelity of existing surgical simulation by performing style transfer of multi-class labels from real surgical video onto synthetic content. We demonstrate our approach on simulations of cataract surgery using real data labels from an existing public dataset. Our results highlight the feasibility of the approach and also the powerful possibility to extend this technique to incorporate additional temporal constraints and to different applications.
△ Less
Submitted 7 November, 2018;
originally announced November 2018.
-
FaceOff: Anonymizing Videos in the Operating Rooms
Authors:
Evangello Flouty,
Odysseas Zisimopoulos,
Danail Stoyanov
Abstract:
Video capture in the surgical operating room (OR) is increasingly possible and has potential for use with computer assisted interventions (CAI), surgical data science and within smart OR integration. Captured video innately carries sensitive information that should not be completely visible in order to preserve the patient's and the clinical teams' identities. When surgical video streams are store…
▽ More
Video capture in the surgical operating room (OR) is increasingly possible and has potential for use with computer assisted interventions (CAI), surgical data science and within smart OR integration. Captured video innately carries sensitive information that should not be completely visible in order to preserve the patient's and the clinical teams' identities. When surgical video streams are stored on a server, the videos must be anonymized prior to storage if taken outside of the hospital. In this article, we describe how a deep learning model, Faster R-CNN, can be used for this purpose and help to anonymize video data captured in the OR. The model detects and blurs faces in an effort to preserve anonymity. After testing an existing face detection trained model, a new dataset tailored to the surgical environment, with faces obstructed by surgical masks and caps, was collected for fine-tuning to achieve higher face-detection rates in the OR. We also propose a temporal regularisation kernel to improve recall rates. The fine-tuned model achieves a face detection recall of 88.05 % and 93.45 % before and after applying temporal-smoothing respectively.
△ Less
Submitted 6 August, 2018;
originally announced August 2018.
-
DeepPhase: Surgical Phase Recognition in CATARACTS Videos
Authors:
Odysseas Zisimopoulos,
Evangello Flouty,
Imanol Luengo,
Petros Giataganas,
Jean Nehme,
Andre Chow,
Danail Stoyanov
Abstract:
Automated surgical workflow analysis and understanding can assist surgeons to standardize procedures and enhance post-surgical assessment and indexing, as well as, interventional monitoring. Computer-assisted interventional (CAI) systems based on video can perform workflow estimation through surgical instruments' recognition while linking them to an ontology of procedural phases. In this work, we…
▽ More
Automated surgical workflow analysis and understanding can assist surgeons to standardize procedures and enhance post-surgical assessment and indexing, as well as, interventional monitoring. Computer-assisted interventional (CAI) systems based on video can perform workflow estimation through surgical instruments' recognition while linking them to an ontology of procedural phases. In this work, we adopt a deep learning paradigm to detect surgical instruments in cataract surgery videos which in turn feed a surgical phase inference recurrent network that encodes temporal aspects of phase steps within the phase classification. Our models present comparable to state-of-the-art results for surgical tool detection and phase recognition with accuracies of 99 and 78% respectively.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.