Intranet

Accueil

ICube > Agenda : Thèse : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Thèse : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

27 gennaio 2017

13h30

Strasbourg - IRCAD - Amphi Hirsch

Soutenance de thèse : Andru Putra TWINANDA

Équipe : AVR

Titre : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Résumé : The main objective of this thesis is to address the problem of activity recognition in the operating room (OR). Activity recognition is an essential component in the development of context-aware and data management systems, which will allow various applications, such as automated assistance during difficult procedures and automatic report generation. Here, we focus on vision-based approaches since cameras are a common source of information to observe the OR without disrupting the surgical workflow. Specifically, we propose to use two complementary types of videos: laparoscopic and OR-scene RGBD videos. Laparoscopic videos record in detail the tool-tissue interactions inside the patients during minimally invasive surgeries, while OR-scene RGBD videos are recordings from a multi-view ceiling-mounted camera system which captures the activities occurring in the whole room. Despite the vast literature on activity recognition in computer vision, activity recognition in surgical setups is still far from being solved. The OR is a very particular and challenging environment where the objects are of similar colors and the scene contains a lot of clutter and occlusions. Furthermore, laparoscopic videos capture a scene completely different from the conventional videos used in the computer vision community, which typically contain humans in the scene. The laparoscopic videos also contain inherent visual challenges, such as rapid camera motion and specular reflection.

In this thesis, we investigate how state-of-the-art computer vision approaches perform on these videos and propose novel approaches to overcome some of the aforementioned challenges. First, we establish recognition pipelines to address activity recognition problems on both laparoscopic and OR-scene RGBD videos using the bag-of-word (BOW) approach, Support Vector Machines (SVM), and hidden Markov models (HMM). Second, we propose an extension to the BOW approach used on multi-view RGBD data to retain more spatial and temporal information during the encoding process. Ultimately, to alleviate the difficulties in manually designing the visual features to represent the data, we design deep learning architectures for surgical activity recognition. We also propose an end-to-end approach with an LSTM network to eliminate the need for SVM and HMM. To evaluate our proposed approaches, we generate large datasets of real surgical videos, including either laparoscopic videos or multi-view RGBD videos. The results demonstrate that the proposed approaches outperform the state-of-the-art methods in performing surgical activity recognition on these new datasets.

Cette thèse a été dirigée par Michel de Mathelin.

La soutenance aura lieu en anglais le vendredi 27 janvier 2017 à 13h30 dans l'amphithéâtre Hirsch de l'IRCAD.

À la une

Les postes d'enseignants-chercheurs ouverts aux concours sont publiés !

Le dépôt des candidatures pour les postes d’enseignants-chercheur est ouvert. Les offres sont...

Actualités

mar 7 2025

Interview de Thomas Alfroy, doctorant lauréat de Mature your PhD 2024

Dans cette interview, Thomas Alfroy, doctorant et membre de l’équipe Réseaux au Laboratoire ICube...

mar 6 2025

Interview d'Emmanuel Martins Seromenho, doctorant lauréat de Mature your PhD 2024

Dans cette interview, Emmanuel Martins Seromenho, doctorant et membre de l’équipe IPP...

feb 28 2025

Premier prix pour Iliass Ayaou au défi Textmine 2025

La conférence EGC (Extraction et Gestion des Connaissances) s’est déroulée du 27 au 31 janvier 2025...

dic 3 2024

Terdepol, finaliste du Pollutec Innovation Challenge 2024

Le salon Pollutec est l'événement international de référence des solutions pour l'environnement...

dic 3 2024

OptHySource, finaliste du Pollutec Innovation Challenge 2024

Le salon Pollutec est l'événement international de référence des solutions pour l'environnement...

nov 28 2024

Haitao Ge remporte le 13ème Prix Abertis France

Haitao Ge, doctorant à l'INSA Strasbourg au sein de l'équipe Génie civil - énergétique (GCE) a...

nov 26 2024

Visite du partenaire 2CRSi dans le cadre du projet 2PhaseEx

Dans le cadre du projet Interreg Offensive Science 2PhaseEx, cinq membres de l’équipe ICube/Mécaflu...

nov 18 2024

100 Start-up où investir en 2024 - Twinical citée

Le 13 novembre, le CNRS a réuni les 26 start-up issues de ses laboratoires sous tutelle,...

ott 4 2024

Notre participation au challenge "Au boulot à vélo"

L'équipe de l'Université de Strasbourg et la délégation Alsace du CNRS se sont brillamment...

ott 1 2024

2PhaseEx kick-off meeting

Le vendredi 20 septembre a eu lieu la réunion de lancement du projet INTERREG 2PhaseEx, au...

Toutes les actualités

Agenda

22 aprile 2025

11h00

Salle A501 - Pôle API Illkirch

Séminaire : High-speed imaging at ICube

23 aprile 2025

15h00

Fachhochschule Nordwestschweiz - salle 02.S.21

Thèse : Système de tracking électromagnétique pour l’implantation d’électrodes de stimulation cérébrale profonde utilisant des capteurs magnétiques intégrés

Agenda complet