← Back to Benchmarks
simmediumvision-robotmetric · varies

VLM-Guided Iterative Refinement for Surgical Image Segmentation with Foundation Models

Description

Surgical image segmentation is essential for robot-assisted surgery and intraoperative guidance. However, existing methods are constrained to predefined categories, produce one-shot predictions without adaptive refinement, and lack mechanisms for clinician interaction. We propose IR-SIS, an iterative refinement system for surgical image segmentation that accepts natural language descriptions. IR-SIS leverages a fine-tuned SAM3 for initial segmentation, employs a Vision-Language Model to detect i

Source

http://arxiv.org/abs/2602.09252v1