Search papers, labs, and topics across Lattice.
This paper introduces a real-time glottis segmentation network, GlottisNet, designed to improve vision-assisted Nasotracheal Intubation (NTI) by addressing challenges like scale variability and computational cost. GlottisNet employs a lightweight, multi-receptive field feature extraction module and an advanced label assignment method to reduce intra-class differences and enhance segmentation accuracy. The proposed network achieves a mDice score of 92.9% with a model size of 19MB and an inference speed of 170+ FPS, outperforming state-of-the-art methods on three datasets.
Real-time glottis segmentation during Nasotracheal Intubation just got a whole lot faster and more accurate, thanks to a new network that's both lightweight and scale-robust.
Nasotracheal intubation (NTI) is a critical clinical procedure for establishing and maintaining patient airway patency. Machine-assisted NTI has emerged as a pivotal approach for optimizing procedural efficiency and minimizing manual intervention. However, visual detection algorithms employed for NTI navigation encounter significant challenges, including complex anatomical environments and suboptimal illumination conditions surrounding the glottis. Additionally, the glottis presents considerable scale variability throughout the procedure, initially appearing as a small, difficult-to-capture structure before expanding to occupy nearly the entire field of view. Moreover, traditional visual detection methods often have high computational costs, making real-time, high-precision detection on portable devices challenging. To enhance NTI efficacy and address these challenges, this paper proposes a novel glottis segmentation framework optimized for vision-assisted NTI applications. First, we designed a lightweight, multi-receptive field feature extraction module to reduce intra-class differences, achieving robustness to scale variations of the glottis. This module was then stacked to form the backbone and neck of our network. Subsequently, we developed an advanced label assignment method and redefined the number of samples to further reduce intra-class differences and enhance accuracy in the complex NTI environment. Experiments on three distinct datasets demonstrate that our network surpasses state-of-the-art algorithms, achieving a segmentation mDice of 92.9\% with a compact model size of 19 MB and an inference speed exceeding 170 frames per second. % Our code and datasets will be open-sourced on GitHub after the manuscript is accepted. Our code and datasets are available at https://github.com/HBUT-CV/GlottisNet.