• Skip to main content
  • Skip to header right navigation
  • Skip to site footer

Log in
www.sages.org

SAGES

Reimagining surgical care for a healthier world

  • Home
    • SAGES Home
    • SAGES Foundation Home
  • About
    • Awards
    • Who Is SAGES?
    • Leadership
    • Our Mission
    • Advocacy
    • Committees
      • SAGES Board of Governors
      • Officers and Representatives of the Society
      • Committee Chairs and Co-Chairs
      • Committee Rosters
      • SAGES Past Presidents
    • Why Should You Support SAGES?
    • SAGES Swag
  • Meetings
    • SAGES NBT Innovation Weekend
    • SAGES Annual Meeting
      • 2026 Annual Meeting
      • 2027 Scientific Session Call for Abstracts
      • 2027 Emerging Technology Call for Abstracts
    • CME Claim Form
    • SAGES Past, Present, Future, and Related Meeting Information
    • SAGES Related Meetings & Events Calendar
  • Join SAGES!
    • Membership Application
    • Membership Benefits
    • Membership Types
      • Requirements and Applications for Active Membership in SAGES
      • Requirements and Applications for Affiliate Membership in SAGES
      • Requirements and Applications for Associate Active Membership in SAGES
      • Requirements and Applications for Candidate Membership in SAGES
      • Requirements and Applications for International Membership in SAGES
      • Requirements for Medical Student Membership
    • Member Spotlight
    • Give the Gift of SAGES Membership
  • Patients
    • Join the SAGES Patient Partner Network (PPN)
    • Patient Information Brochures
    • Healthy Sooner – Patient Information for Minimally Invasive Surgery
    • Choosing Wisely – An Initiative of the ABIM Foundation
    • All in the Recovery: Colorectal Cancer Alliance
    • Find A SAGES Surgeon
  • Publications
    • Clinical / Practice / Training Guidelines, Statements, and Standards of Practice
    • Sustainability in Surgical Practice
    • SAGES Stories Podcast
    • SAGES Lead Up Podcast
    • Patient Information Brochures
    • Patient Information From SAGES
    • TAVAC – Technology and Value Assessments
    • Surgical Endoscopy and Other Journal Information
    • Innovative Surgical Trends
    • SAGES Manuals
    • MesSAGES – The SAGES Newsletter
    • COVID-19 Archive
    • Troubleshooting Guides
  • Education
    • Wellness Resources – You Are Not Alone
    • Avoid Opiates After Surgery
    • SAGES Subscription Catalog
    • SAGES TV: Home of SAGES Surgical Videos
    • The SAGES Safe Cholecystectomy Program
    • Masters Program
    • Resident and Fellow Opportunities
      • MIS Fellows Course
      • SAGES Robotics Residents and Fellows Courses
      • SAGES Free Resident Webinar Series
      • Advanced Laparoscopy and Fluorescence-Guided Surgery Course for Fellows
      • Fellows’ Career Development Course
    • SAGES S.M.A.R.T. Enhanced Recovery Program
    • SAGES @ Cine-Med Products
      • SAGES Top 21 Minimally Invasive Procedures Every Practicing Surgeon Should Know
      • SAGES Pearls Step-by-Step
      • SAGES Flexible Endoscopy 101
    • SAGES OR SAFETY Video Activity
    • Foregut Video Atlas
  • Opportunities
    • Join the SAGES Patient Partner Network (PPN)
    • Fellowship Recognition Opportunities
    • SAGES Advanced Flexible Endoscopy Area of Concentrated Training (ACT) SEAL
    • Multi-Society Foregut Fellowship Certification
    • Research Opportunities
    • FLS
    • FES
    • FUSE
    • Jobs Board
    • SAGES Go Global: Global Affairs
  • Learning Hub
You are here: Home / Abstracts / Unsupervised detection of tool presence in endoscopic video frames

Unsupervised detection of tool presence in endoscopic video frames

David Z Li1, Masaru Ishii, MD, PhD2, Russell H Taylor, PhD1, Gregory D Hager, PhD1, Ayushi Sinha, PhD1. 1The Johns Hopkins University, 2Johns Hopkins Medical Institutions

Objective: The aim of our work is to develop an unsupervised approach for tool detection in endoscopic video data. There is an abundance of medical imaging data available, but most of this data is unlabeled because manually annotating data is extremely tedious. In order to overcome this limitation in endoscopic video data, we hope to coarsely classify endoscopic video frames into two classes – with tools and without tools. These coarse labels can then open up the potential for more fine-grained labels, like tool segmentation, tool pose classification, etc.

Description: During endoscopic procedures, surgical tools enter and leave the endoscopic field of view. Knowing when these events occur provides crucial information about surgical phase and activity. Additionally, computer vision-based navigation systems rely on anatomical features from endoscopic video to align video and preoperative image data. Therefore, being able to ignore frames with tools or to mask out tools in frames containing tools is important. By detecting frames with and without surgical tools, these two distinct classes of data can be exploited in different ways, allowing us to then perform more fine-grained learning tasks.

Method: Variational autoencoders (VAEs) are generative models that are often used for performing unsupervised learning since they do not require labeled training data. VAEs model the data distribution as nonlinear transformations of latent variables. Inference of these latent variables given the observed data is performed by an encoding model that learns a lower-dimensional representation of the data. The decoding model infers samples belonging to the modeled distribution given the latent variables.

The first step in our study is to use VAEs to learn a useful latent representation of sequences of endoscopic video. We hope to manipulate the variables in latent space and study their effect on the decoded output to learn, for instance, which latent variables are responsible for encoding tool movement or background movement, etc. We expect that the latent variables encoding tool movement will change drastically when the tool in not present in the frame. We can leverage this to separate frames with and without tools.

Conclusion: Our unsupervised approach to detect tool presence in endoscopic video data will allow us to separate video frames with and without tools and to treat these two classes differently. This initial coarse classification can make more fine-grained learning tasks easier to approach. For instance, most anatomical structures can be expected move coherently relative to the endoscope, while a surgical tool can move randomly. Knowing whether a sequence of frames contains tools allows us to look for differences in, for instance, the optical flow between frames to detect or segment the tool from the background tissue. Additionally, by encoding tool movement, we can learn whether different surgical tasks or phases appear different in latent space, enabling surgical phase detection.


Presented at the SAGES 2017 Annual Meeting in Houston, TX.

Abstract ID: 98877

Program Number: ETP749

Presentation Session: Emerging Technology Poster Session (Non CME)

Presentation Type: Poster

View this Poster

Related



Hours & Info

15821 Ventura Blvd Ste 400
Encino, CA 91436

1-310-437-0544

[email protected]

Monday – Friday
8am to 5pm Pacific Time

Find Us Around the Web!

  • Bluesky
  • X
  • Instagram
  • Facebook
  • YouTube

Copyright © 2026 · SAGES · All Rights Reserved

Important Links

Healthy Sooner: Patient Information

SAGES Guidelines, Statements, & Standards of Practice

SAGES Manuals

Refine Search