Intranet

Accueil

ICube > Agenda : Thèse : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Thèse : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Le 27 janvier 2017

À 13h30

Strasbourg - IRCAD - Amphi Hirsch

Soutenance de thèse : Andru Putra TWINANDA

Équipe : AVR

Titre : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Résumé : The main objective of this thesis is to address the problem of activity recognition in the operating room (OR). Activity recognition is an essential component in the development of context-aware and data management systems, which will allow various applications, such as automated assistance during difficult procedures and automatic report generation. Here, we focus on vision-based approaches since cameras are a common source of information to observe the OR without disrupting the surgical workflow. Specifically, we propose to use two complementary types of videos: laparoscopic and OR-scene RGBD videos. Laparoscopic videos record in detail the tool-tissue interactions inside the patients during minimally invasive surgeries, while OR-scene RGBD videos are recordings from a multi-view ceiling-mounted camera system which captures the activities occurring in the whole room. Despite the vast literature on activity recognition in computer vision, activity recognition in surgical setups is still far from being solved. The OR is a very particular and challenging environment where the objects are of similar colors and the scene contains a lot of clutter and occlusions. Furthermore, laparoscopic videos capture a scene completely different from the conventional videos used in the computer vision community, which typically contain humans in the scene. The laparoscopic videos also contain inherent visual challenges, such as rapid camera motion and specular reflection.

In this thesis, we investigate how state-of-the-art computer vision approaches perform on these videos and propose novel approaches to overcome some of the aforementioned challenges. First, we establish recognition pipelines to address activity recognition problems on both laparoscopic and OR-scene RGBD videos using the bag-of-word (BOW) approach, Support Vector Machines (SVM), and hidden Markov models (HMM). Second, we propose an extension to the BOW approach used on multi-view RGBD data to retain more spatial and temporal information during the encoding process. Ultimately, to alleviate the difficulties in manually designing the visual features to represent the data, we design deep learning architectures for surgical activity recognition. We also propose an end-to-end approach with an LSTM network to eliminate the need for SVM and HMM. To evaluate our proposed approaches, we generate large datasets of real surgical videos, including either laparoscopic videos or multi-view RGBD videos. The results demonstrate that the proposed approaches outperform the state-of-the-art methods in performing surgical activity recognition on these new datasets.

Cette thèse a été dirigée par Michel de Mathelin.

La soutenance aura lieu en anglais le vendredi 27 janvier 2017 à 13h30 dans l'amphithéâtre Hirsch de l'IRCAD.

À la une

Premier prix pour Iliass Ayaou au défi Textmine 2025

La conférence EGC (Extraction et Gestion des Connaissances) s’est déroulée du 27 au 31 janvier 2025...

Actualités

oct. 27 2025

Nectarine, une nouvelle équipe-projet à l’antenne Inria de Strasbourg

Nectarine, la nouvelle équipe-projet à l’antenne Inria de Strasbourg a été fondée début octobre...

oct. 22 2025

Une solution innovante pour la dépollution des sols récompensée au concours i-Lab 2025

Le laboratoire ICube félicite la startup TERDEPOL SAS, distinguée par le concours national...

oct. 14 2025

Nomination de David Cazier à la direction du l'IUT d'Haguenau

À compter du 1er septembre 2025, une nouvelle équipe de direction prendra ses fonctions à la tête...

oct. 13 2025

ICube et l'université de Strasbourg lance son premier Student Chapter dédié à la photonique !

ICube et l'université de Strasbourg lance son premier Student Chapter dédié à la photonique ! Une...

sept. 1 2025

Pollution de l’air : un modèle d’IA temps réel né au laboratoire ICube

Nous sommes fiers de voir les travaux menés au sein du laboratoire ICube contribuer à une solution...

août 12 2025

Alexandre Stenger remporte le Best Student Paper Award à l’ICASSP 2025

Du 6 au 11 avril 2025, la communauté internationale du traitement du signal s’est réunie à...

août 6 2025

Le projet STRAD : quand un drone explore le street art

Lors de sa 11ème édition (27 mai – 29 juin 2025), le Street Art Fest Grenoble-Alpes a présenté une...

juil. 31 2025

Distinction IPCAI 2025 pour CAMMA

L’article “Few-shot Text-driven Adaptation of Foundation Models for Surgical Workflow Analysis” de...

juil. 28 2025

ICube participant à l’atelier "Si j’étais un objet d’étude scientifique" – Édition 2025

💡 Et si les sciences se racontaient à la première personne ? C’est l’idée originale au cœur de...

juil. 22 2025

Écouter couler la science : la plateforme MechaniCS dans le podcast Savoirs-Unistra

Que se passe-t-il lorsque l’eau envahit un quartier urbain ? Comment circule-t-elle entre les...

Toutes les actualités

Agenda

Le 7 novembre 2025

À 09h00

Amphithéâtre de Dietrich

Thèse : Conception, modélisation, fabrication et identification des tenségrités souples robotisées

Le 14 novembre 2025

Amphi Frank, Faculté de Chirurgie Dentaire Rue St. Elizabeth 67000 Strasbourg

Journée scientifique conjointe ITI HealthTech & FMTS – 14 novembre 2025

Du 17 novembre 2025 au 20 novembre 2025

Strasbourg - Campus de la Chambre de Commerce et d'Industrie

Conférences ROSCon Fr & De 2025

Le 20 novembre 2025

À 14h00

Manufacture des Tabacs - "Terre" amphitheater

ICube MSII Seminar: Interactions between plankton cells and turbulence

Le 15 décembre 2025

Amphithéâtre A302 - TPS, Illkirch

ICube MSII Seminar: Evolution of Image Analysis for Microscopy: From Classical Methods to Artificial Intelligence

Agenda complet