Search papers, labs, and topics across Lattice.
The Open Suturing Skills (OSS) Challenge benchmarked vision-based skill assessment methods for open surgery using a dataset of suturing task videos with instrument trajectories. Participants tackled tasks including skill level classification, OSATS score prediction, and hand/tool tracking, employing diverse solutions like deep learning video models and tracking-driven methods. Spatiotemporal video models performed best overall, but the challenge also highlighted the difficulty of fine-grained OSATS prediction and keypoint tracking due to occlusions.
Despite advances in AI-driven surgical skill assessment, reliably tracking hands and tools in open surgery videos remains a surprisingly difficult problem, hindering motion-based analysis.
Achieving high levels of surgical skill through effective training is essential for optimal patient outcomes. Automated, data-driven skill assessment holds significant potential to improve surgical training. While machine learning-based methods are increasingly popular for assessing skills in minimally invasive surgery, their application to open surgery remains limited. We present the results of a dedicated MICCAI challenge designed to benchmark and advance vision-based skill assessment in open surgery. The challenge dataset comprises videos of an open suturing training task recorded with a static GoPro camera in a dry-lab setting, with instrument trajectories available in addition to the primary video modality. The OSS Challenge was hosted over two consecutive years, comprising two and three independent tasks, respectively: (1) classifying skill level into four classes, (2) predicting the full Objective Structured Assessment of Technical Skills across eight categories, and (3) tracking hands and surgical tools. Participants submitted diverse solutions including deep learning-based video models, tracking-driven methods, and hybrid approaches. General-purpose spatiotemporal video models consistently achieved the strongest performance, though conceptually diverse approaches reached competitive levels when well-executed. Predicting fine-grained OSATS scores remains challenging but benefits substantially from increased training data. Keypoint tracking proves difficult given frequent occlusions and out-of-frame instances, limiting current applicability for motion-based skill analysis. This work benchmarks innovative and diverse solutions for surgical skill assessment, highlighting both the promise and current limitations of video-based evaluation in open surgery and identifying critical directions for advancing automated skill assessment toward clinical impact.