1.Development of an abdominal acupoint localization system based on AI deep learning.
Mo ZHANG ; Yuming LI ; Zongming SHI
Chinese Acupuncture & Moxibustion 2025;45(3):391-396
This study aims to develop an abdominal acupoint localization system based on computer vision and convolutional neural networks (CNNs). To address the challenge of abdominal acupoint localization, a multi-task CNNs architecture was constructed and trained to locate the Shenque (CV8) and human body boundaries. Based on the identified Shenque (CV8), the system further deduces key characteristics of four acupoints: Shangwan (CV13), Qugu (CV2), and bilateral Daheng (SP15). An affine transformation matrix is applied to accurately map image coordinates to an acupoint template space, achieving precise localization of abdominal acupoints. Testing has verified that this system can accurately identify and locate abdominal acupoints in images. The development of this localization system provides technical support for TCM remote education, diagnostic assistance, and advanced TCM equipment, such as intelligent acupuncture robots, facilitating the standardization and intelligent advancement of acupuncture.
Acupuncture Points
;
Humans
;
Deep Learning
;
Abdomen/diagnostic imaging*
;
Neural Networks, Computer
;
Acupuncture Therapy
;
Image Processing, Computer-Assisted
2.Large models in medical imaging: Advances and prospects.
Mengjie FANG ; Zipei WANG ; Sitian PAN ; Xin FENG ; Yunpeng ZHAO ; Dongzhi HOU ; Ling WU ; Xuebin XIE ; Xu-Yao ZHANG ; Jie TIAN ; Di DONG
Chinese Medical Journal 2025;138(14):1647-1664
Recent advances in large models demonstrate significant prospects for transforming the field of medical imaging. These models, including large language models, large visual models, and multimodal large models, offer unprecedented capabilities in processing and interpreting complex medical data across various imaging modalities. By leveraging self-supervised pretraining on vast unlabeled datasets, cross-modal representation learning, and domain-specific medical knowledge adaptation through fine-tuning, large models can achieve higher diagnostic accuracy and more efficient workflows for key clinical tasks. This review summarizes the concepts, methods, and progress of large models in medical imaging, highlighting their potential in precision medicine. The article first outlines the integration of multimodal data under large model technologies, approaches for training large models with medical datasets, and the need for robust evaluation metrics. It then explores how large models can revolutionize applications in critical tasks such as image segmentation, disease diagnosis, personalized treatment strategies, and real-time interactive systems, thus pushing the boundaries of traditional imaging analysis. Despite their potential, the practical implementation of large models in medical imaging faces notable challenges, including the scarcity of high-quality medical data, the need for optimized perception of imaging phenotypes, safety considerations, and seamless integration with existing clinical workflows and equipment. As research progresses, the development of more efficient, interpretable, and generalizable models will be critical to ensuring their reliable deployment across diverse clinical environments. This review aims to provide insights into the current state of the field and provide directions for future research to facilitate the broader adoption of large models in clinical practice.
Humans
;
Diagnostic Imaging/methods*
;
Precision Medicine/methods*
;
Image Processing, Computer-Assisted/methods*
3.Role of artificial intelligence in medical image analysis.
Lu WANG ; Shimin ZHANG ; Nan XU ; Qianqian HE ; Yuming ZHU ; Zhihui CHANG ; Yanan WU ; Huihan WANG ; Shouliang QI ; Lina ZHANG ; Yu SHI ; Xiujuan QU ; Xin ZHOU ; Jiangdian SONG
Chinese Medical Journal 2025;138(22):2879-2894
With the emergence of deep learning techniques based on convolutional neural networks, artificial intelligence (AI) has driven transformative developments in the field of medical image analysis. Recently, large language models (LLMs) such as ChatGPT have also started to achieve distinction in this domain. Increasing research shows the undeniable role of AI in reshaping various aspects of medical image analysis, including processes such as image enhancement, segmentation, detection in image preprocessing, and postprocessing related to medical diagnosis and prognosis in clinical settings. However, despite the significant progress in AI research, studies investigating the recent advances in AI technology in the aforementioned aspects, the changes in research hotspot trajectories, and the performance of studies in addressing key clinical challenges in this field are limited. This article provides an overview of recent advances in AI for medical image analysis and discusses the methodological profiles, advantages, disadvantages, and future trends of AI technologies.
Artificial Intelligence
;
Humans
;
Image Processing, Computer-Assisted/methods*
;
Neural Networks, Computer
;
Deep Learning
;
Diagnostic Imaging/methods*
4.A low-dose CT reconstruction method using sub-pixel anisotropic diffusion.
Shizhou TANG ; Ruolan SU ; Shuting LI ; Zhenzhen LAI ; Jinhong HUANG ; Shanzhou NIU
Journal of Southern Medical University 2025;45(1):162-169
OBJECTIVES:
We present a new low-dose CT reconstruction method using sub-pixel and anisotropic diffusion.
METHODS:
The sub-pixel intensity values and their second-order differences were obtained using linear interpolation techniques, and the new gradient information was then embedded into an anisotropic diffusion process, which was introduced into a penalty-weighted least squares model to reduce the noise in low-dose CT projection data. The high-quality CT image was finally reconstructed using the classical filtered back-projection (FBP) algorithm from the estimated data.
RESULTS:
In the Shepp-Logan phantom experiments, the structural similarity (SSIM) index of the CT image reconstructed by the proposed algorithm, as compared with FBP, PWLS-Gibbs and PWLS-TV algorithms, was increased by 28.13%, 5.49%, and 0.91%, the feature similarity (FSIM) index was increased by 21.08%, 1.78%, and 1.36%, and the root mean square error (RMSE) was reduced by 69.59%, 18.96%, and 3.90%, respectively. In the digital XCAT phantom experiments, the SSIM index of the CT image reconstructed by the proposed algorithm, as compared with FBP, PWLS-Gibbs and PWLS-TV algorithms, was increased by 14.24%, 1.43% and 7.89%, the FSIM index was increased by 9.61%, 1.78% and 5.66%, and the RMSE was reduced by 26.88%, 9.41% and 18.39%, respectively. In clinical experiments, the SSIM index of the image reconstructed using the proposed algorithm was increased by 19.24%, 15.63% and 3.68%, the FSIM index was increased by 4.30%, 2.92% and 0.43%, and the RMSE was reduced by 44.60%, 36.84% and 15.22% in comparison with FBP, PWLS-Gibbs and PWLS-TV algorithms, respectively.
CONCLUSIONS
The proposed method can effectively reduce the noises and artifacts while maintaining the structural details in low-dose CT images.
Tomography, X-Ray Computed/methods*
;
Algorithms
;
Phantoms, Imaging
;
Anisotropy
;
Image Processing, Computer-Assisted/methods*
;
Humans
;
Radiation Dosage
5.A sparse-view cone-beam CT reconstruction algorithm based on bidirectional flow field- guided projection completion.
Wenwei LI ; Zerui MAO ; Yongbo WANG ; Zhaoying BIAN ; Jing HUANG
Journal of Southern Medical University 2025;45(2):395-408
OBJECTIVES:
We propose a sparse-view cone-beam CT reconstruction algorithm based on bidirectional flow field guided projection completion (BBC-Recon) to solve the ill-posed inverse problem in sparse-view cone-beam CT imaging.
METHODS:
The BBC-Recon method consists of two main modules: the projection completion module and the image restoration module. Based on flow field estimation, the projection completion module, through the designed bidirectional and multi-scale correlators, fully calculates the correlation information and redundant information among projections to precisely guide the generation of bidirectional flow fields and missing frames, thus achieving high-precision completion of missing projections and obtaining pseudo complete projections. The image restoration module reconstructs the obtained pseudo complete projections and then refines the image to remove the residual artifacts and further improve the image quality.
RESULTS:
The experimental results on the public datasets of Mayo Clinic and Guilin Medical University showed that in the case of a 4-fold sparse angle, compared with the suboptimal method, the BBC-Recon method increased the PSNR index by 1.80% and the SSIM index by 0.29%, and reduced the RMSE index by 4.12%; In the case of an 8-fold sparse angle, the BBC-Recon method increased the PSNR index by 1.43% and the SSIM index by 1.49%, and reduced the RMSE index by 0.77%.
CONCLUSIONS
The BBC-Recon algorithm fully exploits the correlation information between projections to allow effective removal of streak artifacts while preserving image structure information, and demonstrates significant advantages in maintaining inter-slice consistency.
Algorithms
;
Cone-Beam Computed Tomography/methods*
;
Image Processing, Computer-Assisted/methods*
;
Humans
6.A segmented backprojection tensor degradation feature encoding model for motion artifacts correction in dental cone beam computed tomography.
Zhixiong ZENG ; Yongbo WANG ; Zongyue LIN ; Zhaoying BIAN ; Jianhua MA
Journal of Southern Medical University 2025;45(2):422-436
OBJECTIVES:
We propose a segmented backprojection tensor degradation feature encoding (SBP-MAC) model for motion artifact correction in dental cone beam computed tomography (CBCT) to improve the quality of the reconstructed images.
METHODS:
The proposed motion artifact correction model consists of a generator and a degradation encoder. The segmented limited-angle reconstructed sub-images are stacked into the tensors and used as the model input. A degradation encoder is used to extract spatially varying motion information in the tensor, and the generator's skip connection features are adaptively modulated to guide the model for correcting artifacts caused by different motion waveforms. The artifact consistency loss function was designed to simplify the learning task of the generator.
RESULTS:
The proposed model could effectively remove motion artifacts and improve the quality of the reconstructed images. For simulated data, the proposed model increased the peak signal-to-noise ratio by 8.28%, increased the structural similarity index measurement by 2.29%, and decreased the root mean square error by 23.84%. For real clinical data, the proposed model achieved the highest expert score of 4.4221 (against a 5-point scale), which was significantly higher than those of all the other comparison methods.
CONCLUSIONS
The SBP-MAC model can effectively extract spatially varying motion information in the tensors and achieve adaptive artifact correction from the tensor domain to the image domain to improve the quality of reconstructed dental CBCT images.
Cone-Beam Computed Tomography/methods*
;
Artifacts
;
Humans
;
Motion
;
Image Processing, Computer-Assisted/methods*
;
Signal-To-Noise Ratio
;
Algorithms
7.A multi-scale supervision and residual feedback optimization algorithm for improving optic chiasm and optic nerve segmentation accuracy in nasopharyngeal carcinoma CT images.
Jinyu LIU ; Shujun LIANG ; Yu ZHANG
Journal of Southern Medical University 2025;45(3):632-642
OBJECTIVES:
We propose a novel deep learning segmentation algorithm (DSRF) based on multi-scale supervision and residual feedback strategy for precise segmentation of the optic chiasm and optic nerves in CT images of nasopharyngeal carcinoma (NPC) patients.
METHODS:
We collected 212 NPC CT images and their ground truth labels from SegRap2023, StructSeg2019 and HaN-Seg2023 datasets. Based on a hybrid pooling strategy, we designed a decoder (HPS) to reduce small organ feature loss during pooling in convolutional neural networks. This decoder uses adaptive and average pooling to refine high-level semantic features, which are integrated with primary semantic features to enable network learning of finer feature details. We employed multi-scale deep supervision layers to learn rich multi-scale and multi-level semantic features under deep supervision, thereby enhancing boundary identification of the optic chiasm and optic nerves. A residual feedback module that enables multiple iterations of the network was designed for contrast enhancement of the optic chiasm and optic nerves in CT images by utilizing information from fuzzy boundaries and easily confused regions to iteratively refine segmentation results under supervision. The entire segmentation framework was optimized with the loss from each iteration to enhance segmentation accuracy and boundary clarity. Ablation experiments and comparative experiments were conducted to evaluate the effectiveness of each component and the performance of the proposed model.
RESULTS:
The DSRF algorithm could effectively enhance feature representation of small organs to achieve accurate segmentation of the optic chiasm and optic nerves with an average DSC of 0.837 and an ASSD of 0.351. Ablation experiments further verified the contributions of each component in the DSRF method.
CONCLUSIONS
The proposed deep learning segmentation algorithm can effectively enhance feature representation to achieve accurate segmentation of the optic chiasm and optic nerves in CT images of NPC.
Humans
;
Tomography, X-Ray Computed/methods*
;
Optic Chiasm/diagnostic imaging*
;
Optic Nerve/diagnostic imaging*
;
Algorithms
;
Nasopharyngeal Carcinoma
;
Deep Learning
;
Nasopharyngeal Neoplasms/diagnostic imaging*
;
Neural Networks, Computer
;
Image Processing, Computer-Assisted/methods*
8.A low-dose CT image restoration method based on central guidance and alternating optimization.
Xiaoyu ZHANG ; Hao WANG ; Dong ZENG ; Zhaoying BIAN
Journal of Southern Medical University 2025;45(4):844-852
OBJECTIVES:
We propose a low-dose CT image restoration method based on central guidance and alternating optimization (FedGP).
METHODS:
The FedGP framework revolutionizes the traditional federated learning model by adopting a structure without a fixed central server, where each institution alternatively serves as the central server. This method uses an institution-modulated CT image restoration network as the core of client-side local training. Through a federated learning approach of central guidance and alternating optimization, the central server leverages local labeled data to guide client-side network training to enhance the generalization capability of the CT imaging model across multiple institutions.
RESULTS:
In the low-dose and sparse-view CT image restoration tasks, the FedGP method showed significant advantages in both visual and quantitative evaluation and achieved the highest PSNR (40.25 and 38.84), the highest SSIM (0.95 and 0.92), and the lowest RMSE (2.39 and 2.56). Ablation study of FedGP demonstrated that compared with FedGP(w/o GP) without central guidance, the FedGP method better adapted to data heterogeneity across institutions, thus ensuring robustness and generalization capability of the model in different imaging conditions.
CONCLUSIONS
FedGP provides a more flexible FL framework to solve the problem of CT imaging heterogeneity and well adapts to multi-institutional data characteristics to improve generalization ability of the model under diverse imaging geometric configurations.
Tomography, X-Ray Computed/methods*
;
Humans
;
Radiation Dosage
;
Image Processing, Computer-Assisted/methods*
;
Algorithms
9.AConvLSTM U-Net: a multi-scale jaw cyst segmentation model based on bidirectional dense connection and attention mechanism.
Suqiang LI ; Zhouyang WANG ; Sixian CHAN ; Xiaolong ZHOU
Journal of Southern Medical University 2025;45(5):1082-1092
OBJECTIVES:
We propose a multi-scale jaw cyst segmentation model, AConvLSTM U-Net, which is based on bidirectional dense connections and attention mechanisms to achieve accurate automatic segmentation of mandibular cyst images.
METHODS:
A dataset consisting of 2592 jaw cyst images was used. AConvLSTM U-Net designs a MBC on the encoding path to enhance feature extraction capabilities. A DPD was used to connect the encoder and decoder, and a bidirectional ConvLSTM was introduced in the jump connection to obtain rich semantic information. A decoding block based on scSE was then used on the decoding path to enhance the focus on important information. Finally, a DS was designed, and the model was optimized by integrating a joint loss function to further improve the segmentation accuracy.
RESULTS:
The experiment with AConvLSTM U-Net for jaw cyst lesion segmentation showed a MCC of 93.8443%, a DSC of 93.9067%, and a JSC of 88.5133%, outperforming all the other comparison segmentation models.
CONCLUSIONS
The proposed algorithm shows a high accuracy and robustness on the jaw cyst dataset, demonstrating its superior performance over many existing methods for automatic segmentation of jaw cyst images and its potential to assist clinical diagnosis.
Humans
;
Jaw Cysts/diagnostic imaging*
;
Algorithms
;
Image Processing, Computer-Assisted/methods*
;
Neural Networks, Computer
10.SG-UNet: a melanoma segmentation model enhanced with global attention and self-calibrated convolution.
Huanyu JI ; Rui WANG ; Shengxiang GAO ; Wengang CHE
Journal of Southern Medical University 2025;45(6):1317-1326
OBJECTIVES:
We propose a new melanoma segmentation model, SG-UNet, to enhance the precision of melanoma segmentation in dermascopy images to facilitate early melanoma detection.
METHODS:
We utilized a U-shaped convolutional neural network, UNet, and made improvements to its backbone, skip connections, and downsampling pooling sections. In the backbone, with reference to the structure of VGG, we increased the number of convolutions from 10 to 13 in the downsampling part of UNet to achieve a deepened network hierarchy that allowed capture of more refined feature representations. To further enhance feature extraction and detail recognition, we replaced the traditional convolution the backbone section with self-calibrated convolution to enhance the model's ability to capture both spatial and channel dimensional features. In the pooling part, the original pooling layer was replaced by Haar wavelet downsampling to achieve more effective multi-scale feature fusion and reduce the spatial resolution of the feature map. The global attention mechanism was then incorporated into the skip connections at each layer to enhance the understanding of contextual information of the image.
RESULTS:
The experimental results showed that the SG-UNet model achieved significantly improved segmentation accuracy on ISIC 2017 and ISIC 2018 datasets as compared with other current state-of-the-art segmentation models, with Dice reached 92.41% and 86.62% and IoU reaching 92.31% and 86.48% on the two datasets, respectively.
CONCLUSIONS
The proposed model is capable of effective and accurate segmentation of melanoma from dermoscopy images.
Melanoma/diagnosis*
;
Humans
;
Neural Networks, Computer
;
Dermoscopy/methods*
;
Skin Neoplasms
;
Image Processing, Computer-Assisted/methods*
;
Calibration
;
Algorithms

Result Analysis
Print
Save
E-mail