Construction and verification of intelligent endoscopic image analysis system for monitoring upper gastrointestinal blind spots
10.3760/cma.j.cn321463-20230905-00347
- VernacularTitle:监测上消化道盲区智能内镜影像分析系统的构建及验证
- Author:
Xiaoquan ZENG
1
;
Zehua DONG
;
Lianlian WU
;
Yanxia LI
;
Yunchao DENG
;
Honggang YU
Author Information
1. 武汉大学人民医院消化内科,武汉 430060
- Keywords:
Endoscopy, digestive system;
Upper gastrointestinal tract;
Deep learning;
Blind spot monitoring
- From:
Chinese Journal of Digestive Endoscopy
2024;41(5):391-396
- CountryChina
- Language:Chinese
-
Abstract:
Objective:To construct an intelligent endoscopic image analysis system that could monitor the blind spot of the upper gastrointestinal tract, and to test its performance.Methods:A total of 87 167 upper gastrointestinal endoscopy images (dataset 1) including 75 551 for training and 11 616 for testing, and a total of 2 414 pharyngeal images (dataset 2) including 2 233 for training and 181 for testing were retrospectively collected from the Digestive Endoscopy Center of Renmin Hospital of Wuhan University between 2016 to 2020. A 27-category-classification model for blind spot monitoring in the upper gastrointestinal tract (model 1, which distinguished 27 anatomical sites such as the pharynx, esophagus, and stomach) and a 5-category-classification model for blind spot monitoring in the pharynx (model 2, which distinguished palate, posterior pharyngeal wall, larynx, left and right pyriform sinuses) were constructed. The above models were trained and tested based on dataset 1 and 2, respectively, and trained based on the EfficientNet-B4, ResNet50 and VGG16 models of the keras framework. Thirty complete upper gastrointestinal endoscopy videos were retrospectively collected from the Digestive Endoscopy Center of Renmin Hospital of Wuhan University in 2021 to test model 2 blind spot monitoring performance.Results:The cross-sectional comparison results of the accuracy of model 1 in identifying 27 anatomical sites of the upper gastrointestinal tract in images showed that the mean accuracy of EfficientNet-B4, ResNet50, and VGG16 were 90.90%, 90.24%, and 89.22%, respectively, with the EfficientNet-B4 model performance the best, and the accuracy of EfficientNet-B4 model for each site ranged from 80.49% to 97.80%. The cross-sectional comparison results of the accuracy of model 2 in identifying the 5 anatomical sites of the pharynx in the images showed that the mean accuracy of EfficientNet-B4, ResNet50, and VGG16 were 99.40%, 98.56%, and 97.01%, respectively, in which the EfficientNet-B4 model had the best performance, and the accuracy of EfficientNet-B4 model for each site ranged from 96.15% to 100.00%. The overall accuracy of model 2 in identifying the 5 anatomical sites of the pharynx in the video was 97.33% (146/150).Conclusion:The intelligent endoscopic image analysis system based on deep learning can monitor blind spots in the upper gastrointestinal tract, coupled with pharyngeal blind spot monitoring and esophagogastroduodenal blind spot monitoring functions. The system shows high accuracy in both images and videos, which is expected to have a potential role in clinical practice and assisting endoscopists to achieve full observation of the upper gastrointestinal tract.