Search Results

1.Role of artificial intelligence in medical image analysis.

Lu WANG ; Shimin ZHANG ; Nan XU ; Qianqian HE ; Yuming ZHU ; Zhihui CHANG ; Yanan WU ; Huihan WANG ; Shouliang QI ; Lina ZHANG ; Yu SHI ; Xiujuan QU ; Xin ZHOU ; Jiangdian SONG

Chinese Medical Journal 2025;138(22):2879-2894

With the emergence of deep learning techniques based on convolutional neural networks, artificial intelligence (AI) has driven transformative developments in the field of medical image analysis. Recently, large language models (LLMs) such as ChatGPT have also started to achieve distinction in this domain. Increasing research shows the undeniable role of AI in reshaping various aspects of medical image analysis, including processes such as image enhancement, segmentation, detection in image preprocessing, and postprocessing related to medical diagnosis and prognosis in clinical settings. However, despite the significant progress in AI research, studies investigating the recent advances in AI technology in the aforementioned aspects, the changes in research hotspot trajectories, and the performance of studies in addressing key clinical challenges in this field are limited. This article provides an overview of recent advances in AI for medical image analysis and discusses the methodological profiles, advantages, disadvantages, and future trends of AI technologies.
Artificial Intelligence ; Humans ; Image Processing, Computer-Assisted/methods* ; Neural Networks, Computer ; Deep Learning ; Diagnostic Imaging/methods*

2.Optimization of extraction process for Shenxiong Huanglian Jiedu Granules based on AHP-CRITIC hybrid weighting method, grey correlation analysis, and BP-ANN.

Zi-An LI ; De-Wen LIU ; Xin-Jian LI ; Bing-Yu WU ; Qun LAN ; Meng-Jia GUO ; Jia-Hui SUN ; Nan-Yang LIU ; Hui PEI ; Hao LI ; Hong YI ; Jin-Yu WANG ; Liang-Mian CHEN

China Journal of Chinese Materia Medica 2025;50(10):2674-2683

By employing the analytic hierarchy process(AHP), the CRITIC method(a weight determination method based on indicator correlations), and the AHP-CRITIC hybrid weighting method, the weight coefficients of evaluation indicators were determined, followed by a comprehensive score comparison. The grey correlation analysis was then performed to analyze the results calculated using the hybrid weighting method. Subsequently, a backpropagation-artificial neural network(BP-ANN) model was constructed to predict the extraction process parameters and optimize the extraction process for Shenxiong Huanglian Jiedu Granules(SHJG). In the extraction process, an L_9(3~4) orthogonal experiment was designed to optimize three factors at three levels, including extraction frequency, water addition amount, and extraction time. The evaluation indicators included geniposide, berberine, ginsenoside Rg_1 + Re, ginsenoside Rb_1, ferulic acid, and extract yield. Finally, the optimal extraction results obtained by the orthogonal experiment, grey correlation analysis, and BP-ANN method were compared, and validation experiments were conducted. The results showed that the optimal extraction process involved two rounds of aqueous extraction, each lasting one hour; the first extraction used ten times the amount of added water, while the second extraction used eight times the amount. In the validation experiments, the average content of each indicator component was higher than the average content obtained in the orthogonal experiment, with a higher comprehensive score. The optimized extraction process parameters were reliable and stable, making them suitable for subsequent preparation process research.
Drugs, Chinese Herbal/analysis* ; Neural Networks, Computer

3.The joint analysis of heart health and mental health based on continual learning.

Hongxiang GAO ; Zhipeng CAI ; Jianqing LI ; Chengyu LIU

Journal of Biomedical Engineering 2025;42(1):1-8

Cardiovascular diseases and psychological disorders represent two major threats to human physical and mental health. Research on electrocardiogram (ECG) signals offers valuable opportunities to address these issues. However, existing methods are constrained by limitations in understanding ECG features and transferring knowledge across tasks. To address these challenges, this study developed a multi-resolution feature encoding network based on residual networks, which effectively extracted local morphological features and global rhythm features of ECG signals, thereby enhancing feature representation. Furthermore, a model compression-based continual learning method was proposed, enabling the structured transfer of knowledge from simpler tasks to more complex ones, resulting in improved performance in downstream tasks. The multi-resolution learning model demonstrated superior or comparable performance to state-of-the-art algorithms across five datasets, including tasks such as ECG QRS complex detection, arrhythmia classification, and emotion classification. The continual learning method achieved significant improvements over conventional training approaches in cross-domain, cross-task, and incremental data scenarios. These results highlight the potential of the proposed method for effective cross-task knowledge transfer in ECG analysis and offer a new perspective for multi-task learning using ECG signals.
Humans ; Electrocardiography/methods* ; Mental Health ; Algorithms ; Signal Processing, Computer-Assisted ; Machine Learning ; Arrhythmias, Cardiac/diagnosis* ; Cardiovascular Diseases ; Neural Networks, Computer ; Mental Disorders

4.Research on motor imagery recognition based on feature fusion and transfer adaptive boosting.

Yuxin ZHANG ; Chenrui ZHANG ; Shihao SUN ; Guizhi XU

Journal of Biomedical Engineering 2025;42(1):9-16

This paper proposes a motor imagery recognition algorithm based on feature fusion and transfer adaptive boosting (TrAdaboost) to address the issue of low accuracy in motor imagery (MI) recognition across subjects, thereby increasing the reliability of MI-based brain-computer interfaces (BCI) for cross-individual use. Using the autoregressive model, power spectral density and discrete wavelet transform, time-frequency domain features of MI can be obtained, while the filter bank common spatial pattern is used to extract spatial domain features, and multi-scale dispersion entropy is employed to extract nonlinear features. The IV-2a dataset from the 4 ^th International BCI Competition was used for the binary classification task, with the pattern recognition model constructed by combining the improved TrAdaboost integrated learning algorithm with support vector machine (SVM), k nearest neighbor (KNN), and mind evolutionary algorithm-based back propagation (MEA-BP) neural network. The results show that the SVM-based TrAdaboost integrated learning algorithm has the best performance when 30% of the target domain instance data is migrated, with an average classification accuracy of 86.17%, a Kappa value of 0.723 3, and an AUC value of 0.849 8. These results suggest that the algorithm can be used to recognize MI signals across individuals, providing a new way to improve the generalization capability of BCI recognition models.
Brain-Computer Interfaces ; Humans ; Support Vector Machine ; Algorithms ; Neural Networks, Computer ; Imagination/physiology* ; Pattern Recognition, Automated/methods* ; Electroencephalography ; Wavelet Analysis

5.Research on emotion recognition methods based on multi-modal physiological signal feature fusion.

Zhiwen ZHANG ; Naigong YU ; Yan BIAN ; Jinhan YAN

Journal of Biomedical Engineering 2025;42(1):17-23

Emotion classification and recognition is a crucial area in emotional computing. Physiological signals, such as electroencephalogram (EEG), provide an accurate reflection of emotions and are difficult to disguise. However, emotion recognition still faces challenges in single-modal signal feature extraction and multi-modal signal integration. This study collected EEG, electromyogram (EMG), and electrodermal activity (EDA) signals from participants under three emotional states: happiness, sadness, and fear. A feature-weighted fusion method was applied for integrating the signals, and both support vector machine (SVM) and extreme learning machine (ELM) were used for classification. The results showed that the classification accuracy was highest when the fusion weights were set to EEG 0.7, EMG 0.15, and EDA 0.15, achieving accuracy rates of 80.19% and 82.48% for SVM and ELM, respectively. These rates represented an improvement of 5.81% and 2.95% compared to using EEG alone. This study offers methodological support for emotion classification and recognition using multi-modal physiological signals.
Humans ; Emotions/physiology* ; Electroencephalography ; Support Vector Machine ; Electromyography ; Signal Processing, Computer-Assisted ; Galvanic Skin Response/physiology* ; Machine Learning ; Male

6.Dynamic continuous emotion recognition method based on electroencephalography and eye movement signals.

Yangmeng ZOU ; Lilin JIE ; Mingxun WANG ; Yong LIU ; Junhua LI

Journal of Biomedical Engineering 2025;42(1):32-41

Existing emotion recognition research is typically limited to static laboratory settings and has not fully handle the changes in emotional states in dynamic scenarios. To address this problem, this paper proposes a method for dynamic continuous emotion recognition based on electroencephalography (EEG) and eye movement signals. Firstly, an experimental paradigm was designed to cover six dynamic emotion transition scenarios including happy to calm, calm to happy, sad to calm, calm to sad, nervous to calm, and calm to nervous. EEG and eye movement data were collected simultaneously from 20 subjects to fill the gap in current multimodal dynamic continuous emotion datasets. In the valence-arousal two-dimensional space, emotion ratings for stimulus videos were performed every five seconds on a scale of 1 to 9, and dynamic continuous emotion labels were normalized. Subsequently, frequency band features were extracted from the preprocessed EEG and eye movement data. A cascade feature fusion approach was used to effectively combine EEG and eye movement features, generating an information-rich multimodal feature vector. This feature vector was input into four regression models including support vector regression with radial basis function kernel, decision tree, random forest, and K-nearest neighbors, to develop the dynamic continuous emotion recognition model. The results showed that the proposed method achieved the lowest mean square error for valence and arousal across the six dynamic continuous emotions. This approach can accurately recognize various emotion transitions in dynamic situations, offering higher accuracy and robustness compared to using either EEG or eye movement signals alone, making it well-suited for practical applications.
Humans ; Electroencephalography/methods* ; Emotions/physiology* ; Eye Movements/physiology* ; Signal Processing, Computer-Assisted ; Support Vector Machine ; Algorithms

7.Research on arrhythmia classification algorithm based on adaptive multi-feature fusion network.

Mengmeng HUANG ; Mingfeng JIANG ; Yang LI ; Xiaoyu HE ; Zefeng WANG ; Yongquan WU ; Wei KE

Journal of Biomedical Engineering 2025;42(1):49-56

Deep learning method can be used to automatically analyze electrocardiogram (ECG) data and rapidly implement arrhythmia classification, which provides significant clinical value for the early screening of arrhythmias. How to select arrhythmia features effectively under limited abnormal sample supervision is an urgent issue to address. This paper proposed an arrhythmia classification algorithm based on an adaptive multi-feature fusion network. The algorithm extracted RR interval features from ECG signals, employed one-dimensional convolutional neural network (1D-CNN) to extract time-domain deep features, employed Mel frequency cepstral coefficients (MFCC) and two-dimensional convolutional neural network (2D-CNN) to extract frequency-domain deep features. The features were fused using adaptive weighting strategy for arrhythmia classification. The paper used the arrhythmia database jointly developed by the Massachusetts Institute of Technology and Beth Israel Hospital (MIT-BIH) and evaluated the algorithm under the inter-patient paradigm. Experimental results demonstrated that the proposed algorithm achieved an average precision of 75.2%, an average recall of 70.1% and an average F ₁-score of 71.3%, demonstrating high classification accuracy and being able to provide algorithmic support for arrhythmia classification in wearable devices.
Humans ; Arrhythmias, Cardiac/diagnosis* ; Algorithms ; Electrocardiography/methods* ; Neural Networks, Computer ; Signal Processing, Computer-Assisted ; Deep Learning ; Classification Algorithms

8.Research on intelligent fetal heart monitoring model based on deep active learning.

Bin QUAN ; Yajing HUANG ; Yanfang LI ; Qinqun CHEN ; Honglai ZHANG ; Li LI ; Guiqing LIU ; Hang WEI

Journal of Biomedical Engineering 2025;42(1):57-64

Cardiotocography (CTG) is a non-invasive and important tool for diagnosing fetal distress during pregnancy. To meet the needs of intelligent fetal heart monitoring based on deep learning, this paper proposes a TWD-MOAL deep active learning algorithm based on the three-way decision (TWD) theory and multi-objective optimization Active Learning (MOAL). During the training process of a convolutional neural network (CNN) classification model, the algorithm incorporates the TWD theory to select high-confidence samples as pseudo-labeled samples in a fine-grained batch processing mode, meanwhile low-confidence samples annotated by obstetrics experts were also considered. The TWD-MOAL algorithm proposed in this paper was validated on a dataset of 16 355 prenatal CTG records collected by our group. Experimental results showed that the algorithm proposed in this paper achieved an accuracy of 80.63% using only 40% of the labeled samples, and in terms of various indicators, it performed better than the existing active learning algorithms under other frameworks. The study has shown that the intelligent fetal heart monitoring model based on TWD-MOAL proposed in this paper is reasonable and feasible. The algorithm significantly reduces the time and cost of labeling by obstetric experts and effectively solves the problem of data imbalance in CTG signal data in clinic, which is of great significance for assisting obstetrician in interpretations CTG signals and realizing intelligence fetal monitoring.
Humans ; Pregnancy ; Female ; Cardiotocography/methods* ; Deep Learning ; Neural Networks, Computer ; Algorithms ; Fetal Monitoring/methods* ; Heart Rate, Fetal ; Fetal Distress/diagnosis* ; Fetal Heart/physiology*

9.Neural network for auditory speech enhancement featuring feedback-driven attention and lateral inhibition.

Yudong CAI ; Xue LIU ; Xiang LIAO ; Yi ZHOU

Journal of Biomedical Engineering 2025;42(1):82-89

The processing mechanism of the human brain for speech information is a significant source of inspiration for the study of speech enhancement technology. Attention and lateral inhibition are key mechanisms in auditory information processing that can selectively enhance specific information. Building on this, the study introduces a dual-branch U-Net that integrates lateral inhibition and feedback-driven attention mechanisms. Noisy speech signals input into the first branch of the U-Net led to the selective feedback of time-frequency units with high confidence. The generated activation layer gradients, in conjunction with the lateral inhibition mechanism, were utilized to calculate attention maps. These maps were then concatenated to the second branch of the U-Net, directing the network's focus and achieving selective enhancement of auditory speech signals. The evaluation of the speech enhancement effect was conducted by utilising five metrics, including perceptual evaluation of speech quality. This method was compared horizontally with five other methods: Wiener, SEGAN, PHASEN, Demucs and GRN. The experimental results demonstrated that the proposed method improved speech signal enhancement capabilities in various noise scenarios by 18% to 21% compared to the baseline network across multiple performance metrics. This improvement was particularly notable in low signal-to-noise ratio conditions, where the proposed method exhibited a significant performance advantage over other methods. The speech enhancement technique based on lateral inhibition and feedback-driven attention mechanisms holds significant potential in auditory speech enhancement, making it suitable for clinical practices related to artificial cochleae and hearing aids.
Humans ; Attention/physiology* ; Speech Perception/physiology* ; Neural Networks, Computer ; Speech ; Noise ; Feedback

10.Research on multi-scale convolutional neural network hand muscle strength prediction model improved based on convolutional attention module.

Yihao DU ; Mengyu SUN ; Jingjin LI ; Xiaoran WANG ; Tianfu CAO

Journal of Biomedical Engineering 2025;42(1):90-95

In order to realize the quantitative assessment of muscle strength in hand function rehabilitation and then formulate scientific and effective rehabilitation training strategies, this paper constructs a multi-scale convolutional neural network (MSCNN) - convolutional block attention module (CBAM) - bidirectional long short-term memory network (BiLSTM) muscle strength prediction model to fully explore the spatial and temporal features of the data and simultaneously suppress useless features, and finally achieve the improvement of the accuracy of the muscle strength prediction model. To verify the effectiveness of the model proposed in this paper, the model in this paper is compared with traditional models such as support vector machine (SVM), random forest (RF), convolutional neural network (CNN), CNN - squeeze excitation network (SENet), MSCNN-CBAM and MSCNN-BiLSTM, and the effect of muscle strength prediction by each model is investigated when the hand force application changes from 40% of the maximum voluntary contraction force (MVC) to 60% of the MVC. The research results show that as the hand force application increases, the effect of the muscle strength prediction model becomes worse. Then the ablation experiment is used to analyze the influence degree of each module on the muscle strength prediction result, and it is found that the CBAM module plays a key role in the model. Therefore, by using the model in this article, the accuracy of muscle strength prediction can be effectively improved, and the characteristics and laws of hand muscle activities can be deeply understood, providing assistance for further exploring the mechanism of hand functions .
Neural Networks, Computer ; Humans ; Hand Strength/physiology* ; Support Vector Machine ; Muscle Strength/physiology* ; Hand/physiology* ; Convolutional Neural Networks