Intranet

Home

ICube Laboratory > Events : PhD defense : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

PhD defense : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

January 27, 2017

13:30

Strasbourg - IRCAD - Amphi Hirsch

PhD defense : Andru Putra TWINANDA

Team : AVR

Title : Vision-based approaches for surgical activity recognition using laparoscopic and RGBD videos

Abstract : The main objective of this thesis is to address the problem of activity recognition in the operating room (OR). Activity recognition is an essential component in the development of context-aware and data management systems, which will allow various applications, such as automated assistance during difficult procedures and automatic report generation. Here, we focus on vision-based approaches since cameras are a common source of information to observe the OR without disrupting the surgical workflow. Specifically, we propose to use two complementary types of videos: laparoscopic and OR-scene RGBD videos. Laparoscopic videos record in detail the tool-tissue interactions inside the patients during minimally invasive surgeries, while OR-scene RGBD videos are recordings from a multi-view ceiling-mounted camera system which captures the activities occurring in the whole room. Despite the vast literature on activity recognition in computer vision, activity recognition in surgical setups is still far from being solved. The OR is a very particular and challenging environment where the objects are of similar colors and the scene contains a lot of clutter and occlusions. Furthermore, laparoscopic videos capture a scene completely different from the conventional videos used in the computer vision community, which typically contain humans in the scene. The laparoscopic videos also contain inherent visual challenges, such as rapid camera motion and specular reflection.

In this thesis, we investigate how state-of-the-art computer vision approaches perform on these videos and propose novel approaches to overcome some of the aforementioned challenges. First, we establish recognition pipelines to address activity recognition problems on both laparoscopic and OR-scene RGBD videos using the bag-of-word (BOW) approach, Support Vector Machines (SVM), and hidden Markov models (HMM). Second, we propose an extension to the BOW approach used on multi-view RGBD data to retain more spatial and temporal information during the encoding process. Ultimately, to alleviate the difficulties in manually designing the visual features to represent the data, we design deep learning architectures for surgical activity recognition. We also propose an end-to-end approach with an LSTM network to eliminate the need for SVM and HMM. To evaluate our proposed approaches, we generate large datasets of real surgical videos, including either laparoscopic videos or multi-view RGBD videos. The results demonstrate that the proposed approaches outperform the state-of-the-art methods in performing surgical activity recognition on these new datasets.

This thesis was directed by Michel de Mathelin.

The presentation in english will take place at IRCAD auditorium Hirsch on Friday, January 27th 2016 at 1.30 pm.

À la une

Le professeur Afshin Gangi récompensé par la Médaille d’or du CIRSE 2025 et contribution majeure en cryothérapie

Lors du congrès annuel du CIRSE 2025, organisé du 13 au 17 septembre à Barcelone en Espagne, le...

Latest News

Feb 19 2026

Stage de 3ᵉ : une semaine d’immersion au cœur de la recherche à ICube

Du 2 au 6 février 2026, dix élèves de 3ᵉ ont poussé les portes du laboratoire ICube pour découvrir...

Feb 18 2026

Challenge Mature your PhD 2026 – Candidatures ouvertes !

Le Challenge Mature your PhD 2026 est officiellement lancé !Vous êtes doctorant en 2e ou 3e année...

Feb 10 2026

2PhaseEx : réunion de mi-parcours du projet Interreg

Le 5 février 2026, les partenaires du projet Interreg 2PhaseEx se sont réunis à la Manufacture des...

Feb 5 2026

IMAGINE-STIM : réunion de mi-parcours du projet Interreg

La réunion de mi-parcours du projet Interreg IMAGINE-STIM s’est tenue le 29 janvier. Elle a permis...

Feb 5 2026

ICube au festival Alsascience : des émotions bien réelles en réalité virtuelle

Les vendredi 30 et samedi 31 janvier, à Schirmeck, le festival Alsascience, organisé par le Jardin...

Jan 30 2026

Portrait de Maria Fiori – Doctorante en neurosciences au laboratoire ICube

Après un parcours en biologie et en neurosciences, Maria Fiori a choisi de s’engager dans la...

Jan 20 2026

Unistra et Inria signent une convention pour renforcer leurs collaborations

Le 16 janvier 2026, l’Université de Strasbourg et Inria ont signé une convention cadre pour...

Jan 8 2026

ICube engagé dans six nouveaux projets Interreg du Rhin supérieur

La nouvelle année débute avec le lancement de quatre nouveaux projets Interreg auxquels le...

Dec 31 2025

ICube vous présente ses meilleurs voeux pour l'année 2026

Dec 19 2025

Le professeur Afshin Gangi récompensé par la Médaille d’or du CIRSE 2025 et contribution majeure en cryothérapie

Lors du congrès annuel du CIRSE 2025, organisé du 13 au 17 septembre à Barcelone en Espagne, le...

View all news

Upcoming Events

March 5, 2026

08:30

Semia, Strasbourg

Café Découverte SEMIA – Spécial doctorants et chercheurs

March 13, 2026

15:00

Auditorium de l'ICPMS

Séminaire ICPMS

April 9, 2026

Manufacture des Tabacs

Invitation journée inter-ITI « Oser l’interdisciplinarité ! »

From April 12, 2026 until April 16, 2026

Palais des Congrès, Strasbourg

SPIE Photonics Europe 2026

View all events