1.Establishment and evaluation of a machine learning prediction model for sepsis-related encephalopathy in the elderly.
Xiao YUE ; Yiwen WANG ; Zhifang LI ; Lei WANG ; Li HUANG ; Shuo WANG ; Yiming HOU ; Shu ZHANG ; Zhengbin WANG
Chinese Critical Care Medicine 2025;37(10):937-943
OBJECTIVE:
To construct machine learning prediction model for sepsis-associated encephalopathy (SAE), and analyze the application value of the model on early identification of SAE risk in elderly septic patients.
METHODS:
Patients aged over 60 years with a primary diagnosis of sepsis admitted to intensive care unit (ICU) from 2008 to 2023 were selected from Medical Information Mart for Intensive Care-IV 2.2 (MIMIC-IV 2.2). Demographic variables, disease severity scores, comorbidities, interventions, laboratory indicators, and hospitalization details were collected. Key factors associated with SAE were identified using univariate Logistic regression analysis. The data were randomly divided into training and validation sets in a 7 : 3 ratio. Multivariable Logistic regression analysis was conducted in the training set and visualized using a nomogram model for prediction of SAE. The discrimination of the model was evaluated in the validation set using the receiver operator characteristic curve (ROC curve), and its calibration was assessed using calibration curve. Furthermore, multiple machine learning algorithms, including multi-layer perceptron (MLP), support vector machine (SVM), naive bayes (NB), gradient boosting machine (GBM), random forest (RF), and extreme gradient boosting (XGB), were constructed in the training set. Their predictive performance was subsequently evaluated on the validation set. Taking the XGB model as an example, the interpretability of the model through the SHapley Additive exPlanations (SHAP) algorithm was enhanced to identify the key predictive factors and their contributions.
RESULTS:
A total of 2 204 septic patients were finally enrolled, of whom 840 developed SAE (38.1%). A total of 21 variables associated with SAE were screened through univariate Logistic regression analysis. Multivariable Logistic regression analysis showed that endotracheal intubation [odds ratio (OR) = 0.40, 95% confidence interval (95%CI) was 0.19-0.88, P < 0.001], oxygen therapy (OR = 0.76, 95%CI was 0.53-0.95, P = 0.023), tracheotomy (OR = 0.20, 95%CI was 0.07-0.53, P < 0.001), continuous renal replacement therapy (CRRT; OR = 0.32, 95%CI was 0.15-0.70, P < 0.001), cerebrovascular disease (OR = 0.31, 95%CI was 0.16-0.60, P < 0.001), rheumatic disease (OR = 0.44, 95%CI was 0.19-0.99, P < 0.001), male (OR = 0.68, 95%CI was 0.54-0.86, P = 0.001), and maximum anion gap (AG; OR = 0.95, 95%CI was 0.93-0.97, P < 0.001) were associated with an decreased probability of SAE, and age (OR = 1.05, 95%CI was 1.03-1.06, P < 0.001), acute physiology score III (APSIII; OR = 1.02, 95%CI was 1.01-1.02, P < 0.001), Oxford acute severity of illness score (OASIS; OR = 1.04, 95%CI was 1.03-1.06, P < 0.001), and length of hospital stay (OR = 1.01, 95%CI was 1.01-1.02, P < 0.001) were associated with an increased probability of SAE. A nomogram model was constructed based on these variables. In the validation set, ROC curve analysis showed that the model achieved an area under the ROC curve (AUC) of 0.723, and the calibration curve showed good consistency between the predicted probability of the model and the observed probability. Among the machine learning algorithms, including MLP, SVM, NB, GBM, RF, and XGB, the SVM model and RF model demonstrated relatively good predictive performance, with AUC of 0.748 and 0.739, respectively, and the sensitivity was both exceeding 85%. The predictive performance of the XGB model was explained through SHAP analysis, and the results indicated that APSIII score (SHAP value was 0.871), age (SHAP value was 0.521), and OASIS score (SHAP value was 0.443) were important factors affecting the predictive performance of the model.
CONCLUSIONS
The machine learning-based SAE prediction model exhibits good predictive capability and holds significant application value for the early identification of SAE risk in elderly septic patients.
Humans
;
Machine Learning
;
Aged
;
Sepsis-Associated Encephalopathy
;
Sepsis/complications*
;
Intensive Care Units
;
Logistic Models
;
Middle Aged
;
Male
;
ROC Curve
;
Female
;
Bayes Theorem
;
Nomograms
;
Support Vector Machine
;
Algorithms
2.KG-CNNDTI: a knowledge graph-enhanced prediction model for drug-target interactions and application in virtual screening of natural products against Alzheimer's disease.
Chengyuan YUE ; Baiyu CHEN ; Long CHEN ; Le XIONG ; Changda GONG ; Ze WANG ; Guixia LIU ; Weihua LI ; Rui WANG ; Yun TANG
Chinese Journal of Natural Medicines (English Ed.) 2025;23(11):1283-1292
Accurate prediction of drug-target interactions (DTIs) plays a pivotal role in drug discovery, facilitating optimization of lead compounds, drug repurposing and elucidation of drug side effects. However, traditional DTI prediction methods are often limited by incomplete biological data and insufficient representation of protein features. In this study, we proposed KG-CNNDTI, a novel knowledge graph-enhanced framework for DTI prediction, which integrates heterogeneous biological information to improve model generalizability and predictive performance. The proposed model utilized protein embeddings derived from a biomedical knowledge graph via the Node2Vec algorithm, which were further enriched with contextualized sequence representations obtained from ProteinBERT. For compound representation, multiple molecular fingerprint schemes alongside the Uni-Mol pre-trained model were evaluated. The fused representations served as inputs to both classical machine learning models and a convolutional neural network-based predictor. Experimental evaluations across benchmark datasets demonstrated that KG-CNNDTI achieved superior performance compared to state-of-the-art methods, particularly in terms of Precision, Recall, F1-Score and area under the precision-recall curve (AUPR). Ablation analysis highlighted the substantial contribution of knowledge graph-derived features. Moreover, KG-CNNDTI was employed for virtual screening of natural products against Alzheimer's disease, resulting in 40 candidate compounds. 5 were supported by literature evidence, among which 3 were further validated in vitro assays.
Alzheimer Disease/drug therapy*
;
Biological Products/therapeutic use*
;
Humans
;
Neural Networks, Computer
;
Machine Learning
;
Drug Discovery/methods*
;
Algorithms
;
Drug Evaluation, Preclinical/methods*
3.Optimization of fermentation processes in intelligent biomanufacturing: on online monitoring, artificial intelligence, and digital twin technologies.
Jianye XIA ; Dongjiao LONG ; Min CHEN ; Anxiang CHEN
Chinese Journal of Biotechnology 2025;41(3):1179-1196
As a strategic emerging industry, biomanufacturing faces core challenges in achieving precise optimization and efficient scale-up of fermentation processes. This review focuses on two critical aspects of fermentation-real-time sensing and intelligent control-and systematically summarizes the advancements in online monitoring technologies, artificial intelligence (AI)-driven optimization strategies, and digital twin applications. First, online monitoring technologies, ranging from conventional parameters (e.g., temperature, pH, and dissolved oxygen) to advanced sensing systems (e.g., online viable cell sensors, spectroscopy, and exhaust gas analysis), provide a data foundation for real-time microbial metabolic state characterization. Second, conventional static control relying on expert experience is evolving toward AI-driven dynamic optimization. The integration of machine learning technologies (e.g., artificial neural networks and support vector machines) and genetic algorithms significantly enhances the regulation efficiency of feeding strategies and process parameters. Finally, digital twin technology, integrating real-time sensing data with multi-scale models (e.g., cellular metabolic kinetics and reactor hydrodynamics), offers a novel paradigm for lifecycle optimization and rational scale-up of fermentation. Future advancements in closed-loop control systems based on intelligent sensing and digital twin are expected to accelerate the industrialization of innovative achievements in synthetic biology and drive biomanufacturing toward higher efficiency, intelligence, and sustainability.
Artificial Intelligence
;
Fermentation
;
Bioreactors/microbiology*
;
Neural Networks, Computer
;
Algorithms
;
Biotechnology/methods*
4.pLM4ACP: a model for predicting anticancer peptides based on machine learning and protein language models.
Yitong LIU ; Wenxin CHEN ; Juanjuan LI ; Xue CHI ; Xiang MA ; Yanqiong TANG ; Hong LI
Chinese Journal of Biotechnology 2025;41(8):3252-3261
Cancer is a serious global health problem and a major cause of human death. Conventional cancer treatments often run the risk of impairing vital organ functions. Anticancer peptides (ACPs) are considered to be one of the most promising therapeutic agents against common human cancers due to their small sizes, high specificity, and low toxicity. Since ACP recognition is highly limited to the laboratory, expensive, and time-consuming, we proposed pLM4ACP, a model for predicting ACPs based on machine learning and protein language models. In this model, the protein language model ProtT5 was used to extract the features of ACPs, and the extracted features were input into the support vector machine (SVM) classification algorithm for optimization and performance evaluation. The model showcased significantly higher accuracy than other methods, with the overall accuracy of 0.763, F1-score of 0.767, Matthews correlation coefficient of 0.527, and area under the curve of 0.827 on the independent test set. This study constructs an efficient anticancer peptide prediction model based on protein language models, further advancing the application of artificial intelligence in the biomedical field and promoting the development of precision medicine and computational biology.
Machine Learning
;
Antineoplastic Agents/chemistry*
;
Humans
;
Peptides/chemistry*
;
Support Vector Machine
;
Algorithms
;
Computational Biology/methods*
;
Neoplasms/drug therapy*
5.An intelligent recognition method for crop density based on Faster R-CNN.
Xiuhua LI ; Qian LI ; Hanwen ZHANG ; Lu DING ; Zeping WANG
Chinese Journal of Biotechnology 2025;41(10):3828-3839
Accurately obtaining the crop quantity and density is not only crucial for the demand-based input of water and fertilizer in the field but also vital for ensuring the yield and quality of crops. Aerial photography by unmanned aerial vehicles (UAVs) can quickly acquire the distribution image information of crops over a large area. However, the accurate recognition of a single type of dense targets is a huge challenge for most recognition algorithms. Taking banana seedlings as an example in this study, we captured the images of banana plantations by UAVs from high altitudes to explore an efficient recognition method for dense targets. We proposed a strategy of "cut-recognition-stitch" and constructed a counting method based on the improved Faster R-CNN algorithm. First, the images containing highly dense targets were cropped into a large number of image tiles according to different sizes (simulating different flight altitudes), and the Contrast Limited Adaptive Histogram Equalization (CLAHE) algorithm was adopted to improve the image quality. A banana seedling dataset containing 36 000 image tiles was constructed. Then, the Faster R-CNN network with optimized parameters was used to train the banana seedling recognition model. Finally, the recognition results were reversely stitched together, and a boundary deduplication algorithm was designed to correct the final counting results to reduce the repeated recognition caused by image cropping. The results show that the recognition accuracy of the Faster R-CNN with optimized parameters for banana image datasets of different sizes can reach up to 0.99 at most. The deduplication algorithm can reduce the average counting error for the original aerial images from 1.60% to 0.60%, and the average counting accuracy of banana seedlings reaches 99.4%. The proposed method effectively addresses the challenge of recognizing dense small objects in high-resolution aerial images, providing an efficient and reliable technical solution for intelligent crop density monitoring in precision agriculture.
Musa/growth & development*
;
Crops, Agricultural/growth & development*
;
Algorithms
;
Neural Networks, Computer
;
Unmanned Aerial Devices
;
Seedlings/growth & development*
;
Image Processing, Computer-Assisted/methods*
;
Photography
;
Agriculture/methods*
6.Personalized mandibular reconstruction assisted by three-dimensional retrieval model based on fully connected neural network and a database of mandibles.
Shiyu QIU ; Yang LIAN ; Yifan KANG ; Lei ZHANG ; Yiwang CAI ; Xiaofeng SHAN ; Zhigang CAI
Journal of Peking University(Health Sciences) 2025;57(2):360-368
OBJECTIVE:
To propose a new protocol for personalized mandibular reconstruction assisted by three-dimensional (3D) retrieval model based on fully connected neural network (FCNN) and a database of mandibles, and to verify clinical feasibility of the protocol.
METHODS:
A database of mandibles of 300 normal northern Chinese Han people was established. On the basis of cephalometry, the mandible landmarks with good stability were further screened. Mandibular landmarks were selected and geometric features of the mandible were extracted. A 3D retrieval algorithm was developed, which could retrieve the mandible most similar to a given mandible from the database. A FCNN was built to train the algorithm to improve accuracy of the 3D retrieval model. Using Geomagic Control 2014 software, matching accuracy of the 3D retrieval model was based on aforementioned mandible database and algorithm. From December 2019 to March 2021, a total of 5 patients underwent personalized mandibular reconstruction assisted by a 3D retrieval model based on mandible database and FCNN in the Department of Oral and Maxillofacial Surgery, Peking University School and Hospital of Stomatology. The most similar mandible was retrieved from mandible database through 3D retrieval algorithm. It was used to restore the premorbid morphology of defect area and guide mandibular reconstruction. For the 5 patients, mandible was reconstructed with iliac flap. Virtual surgical plan was transformed using individual surgical guides.
RESULTS:
Through screening, mandibular landmarks with high reproducibility and stability were identified and composed of mandibular landmarker protocols. After training, the average deviation between most similar mandible retrieved from the 300-case mandible database through 3D retrieval model based on FCNN and given mandible was (1.77±0.44) mm. And the root-mean-square deviation between the most similar mandible retrieved from the database and given mandible was (2.58±0.86) mm. The mandibular reconstruction surgery was successful in all the 5 patients. Their facial symmetry and occlusion were restored. All the patients were satisfied with postoperative appearance. The mean deviation between postoperative mandible and preoperative design was (0.98±0.17) mm. The area with a deviation ≤1 mm accounted for 61.34%±14. 13%, ≤2 mm accounted for 83.82%±7.35%, and ≤3 mm accounted for 93.94%± 2.87%.
CONCLUSION
The personalized mandibular reconstruction assisted by 3D retrieval model based on the 300-case mandible database and FCNN is feasible clinically.
Humans
;
Neural Networks, Computer
;
Mandibular Reconstruction/methods*
;
Mandible/diagnostic imaging*
;
Imaging, Three-Dimensional/methods*
;
Adult
;
Databases, Factual
;
Female
;
Male
;
Algorithms
;
Middle Aged
;
Cephalometry
7.3D Pulse Image Detection and Pulse Pattern Recognition Based on Subtle Motion Magnification Technology.
Chongyang YAO ; Yongxin CHOU ; Zhiwei LIANG ; Haiping YANG ; Jicheng LIU ; Dongmei LIN
Chinese Journal of Medical Instrumentation 2025;49(3):255-262
To address the problem of large reconstruction errors in 3D pulse signals caused by excessively small out-of-plane displacement of the contact membrane in the existing traditional Chinese medicine fingertip tactile binocular vision detection technology, this study proposes a 3D pulse image detection method based on subtle motion magnification technology and explores its application in pulse pattern recognition. Firstly, a 3D pulse image detection system based on binocular vision to obtain pulse image signals is developed as experimental data. Then, the phase motion video magnification algorithm is used to amplify the original signals, and the amplified signals are reconstructed in three dimensions to obtain 3D pulse signals. On this basis, nine features are extracted from the 3D pulse signals and features selection is performed using a two-sample Kolmogorov-Smirnov test. Finally, machine learning algorithms such as decision trees and random forests are used to identify the five types of pulse conditions: deep pulse, intermittent pulse, flooding pulse, slippery pulse, and rapid pulse. The experimental results show that compared to the methods without subtle motion magnification technology, the proposed method significantly improves waveform clarity, amplitude stability, and periodic regularity. Meanwhile, the average accuracy in pulse pattern recognition reaches 96.29%±0.26%.
Algorithms
;
Imaging, Three-Dimensional/methods*
;
Pattern Recognition, Automated
;
Medicine, Chinese Traditional
;
Motion
;
Humans
;
Pulse
;
Signal Processing, Computer-Assisted
;
Machine Learning
8.An Adaptive LSTM Method for Parameter Calibration of Medical Robotic Arms.
Chinese Journal of Medical Instrumentation 2025;49(5):473-478
Medical robotic arm often encounters multi-source and nonlinear errors during the calibration process, making it difficult for traditional mathematical modeling methods to fully characterize system error features, thereby limiting further improvement in calibration accuracy. In this study, a robotic arm parameter error identification model is established, and a calibration method based on an adaptive long short-term memory (ALSTM) neural network is proposed. The method incorporates a particle swarm optimization (PSO) algorithm to optimize the weights of each layer of the LSTM neural network, enabling more effective fitting of robotic arm kinematic errors and ultimately yielding more accurate Denavit-Hartenberg (D-H) parameters. To validate the proposed approach, 110 sets of experimental data are collected using the HSR-JR680 robotic arm calibration system. Experimental results demonstrate that the ALSTM model reduces the root mean square error (RMSE) by 23.07%-80.39% compared to traditional calibration methods, and shortens the convergence time by 32.44% compared to a standard LSTM model. The optimized D-H parameters obtained meet the high-precision calibration requirements of medical robotic arm, confirming the effectiveness of the proposed method.
Calibration
;
Neural Networks, Computer
;
Algorithms
;
Robotics
;
Robotic Surgical Procedures
;
Models, Theoretical
9.Radiogenomics-based prediction of KRAS and EGFR gene mutation in non-small cell lung cancer patients.
Jianing LIN ; Zhihang YAN ; Longyu HE ; Hao ZHANG ; Mingxuan XIE
Journal of Central South University(Medical Sciences) 2025;50(5):805-814
OBJECTIVES:
Non-small cell lung cancer (NSCLC) is associated with poor prognosis, with 30% of patients diagnosed at an advanced stage. Mutations in the EGFR and KRAS genes are important prognostic factors for NSCLC, and targeted therapies can significantly improve survival in these patients. Although tissue biopsy remains the gold standard for detecting gene mutations, it has limitations, including invasiveness, sampling errors due to tumor heterogeneity, and poor reproducibility. This study aims to develop machine learning models based on radiomic features to predict EGFR and KRAS gene mutation status in NSCLC patients, thereby providing a reference for precision oncology.
METHODS:
Imaging and mutation data from eligible NSCLC patients were obtained from the publicly available Lung-PET-CT-Dx dataset in The Cancer Imaging Archive (TCIA). A three-dimensional-convolutional neural network (3D-CNN) was used to extract imaging features from the regions of interest (ROI). The LightGBM algorithm was employed to build classification models for predicting EGFR and KRAS gene mutation status. Model performance was evaluated using 5-fold cross-validation, with receiver operator characteristic (ROC) curves, area under the curve (AUC), accuracy, sensitivity, and specificity used for validation.
RESULTS:
The models effectively predicted EGFR and KRAS mutations in NSCLC patients, achieving an AUC of 0.95 for EGFR mutations and 0.90 for KRAS. The models also demonstrated high accuracy (EGFR 89.66%; KRAS 87.10%), sensitivity (EGFR 93.33%; KRAS 87.50%), and specificity (EGFR 85.71%; KRAS 86.67%).
CONCLUSIONS
A radiogenomics-machine learning predictive model can serve as a non-invasive tool for anticipating EGFR and KRAS gene mutation status in NSCLC patients.
Humans
;
Carcinoma, Non-Small-Cell Lung/diagnostic imaging*
;
Lung Neoplasms/diagnostic imaging*
;
Mutation
;
Proto-Oncogene Proteins p21(ras)/genetics*
;
ErbB Receptors/genetics*
;
Machine Learning
;
Positron Emission Tomography Computed Tomography
;
Female
;
Male
;
Neural Networks, Computer
;
Middle Aged
;
Aged
10.An efficient and lightweight skin pathology detection method based on multi-scale feature fusion using an improved RT-DETR model.
Yuying REN ; Lingxiao HUANG ; Fang DU ; Xinbo YAO
Journal of Southern Medical University 2025;45(2):409-421
OBJECTIVES:
The presence of multi-scale skin lesion regions and image noise interference and limited resources of auxiliary diagnostic equipment affect the accuracy of skin disease detection in skin disease detection tasks. To solve these problems, we propose a highly efficient and lightweight skin disease detection model using an improved RT-DETR model.
METHODS:
A lightweight FasterNet was introduced as the backbone network and the FasterNetBlock module was parametrically refined. A Convolutional and Attention Fusion Module (CAFM) was used to replace the multi-head self-attention mechanism in the neck network to enhance the ability of the AIFI-CAFM module for capturing global dependencies and local detail information. The DRB-HSFPN feature pyramid network was designed to replace the Cross-Scale Feature Fusion Module (CCFM) to allow the integration of contextual information across different scales to improve the semantic feature expression capacity of the neck network. Finally, combining the advantages of Inner-IoU and EIoU, the Inner-EIoU was used to replace the original loss function GIOU to further enhance the model's inference accuracy and convergence speed.
RESULTS:
The experimental results on the HAM10000 dataset showed that the improved RT-DETR model, as compared with the original model, had increased mAP@50 and mAP@50:95 by 4.5% and 2.8%, respectively, with a detection speed of 59.1 frames per second (FPS). The improved model had a parameter count of 10.9 M and a computational load of 19.3 GFLOPs, which were reduced by 46.0% and 67.2% compared to those of the original model, validating the effectiveness of the improved model.
CONCLUSIONS
The proposed SD-DETR model significantly improves the performance of skin disease detection tasks by effectively extracting and integrating multi-scale features while reducing both parameter count and computational load.
Humans
;
Skin Diseases/diagnosis*
;
Skin/pathology*
;
Neural Networks, Computer
;
Algorithms

Result Analysis
Print
Save
E-mail