1.Research on motor imagery recognition based on feature fusion and transfer adaptive boosting.
Yuxin ZHANG ; Chenrui ZHANG ; Shihao SUN ; Guizhi XU
Journal of Biomedical Engineering 2025;42(1):9-16
This paper proposes a motor imagery recognition algorithm based on feature fusion and transfer adaptive boosting (TrAdaboost) to address the issue of low accuracy in motor imagery (MI) recognition across subjects, thereby increasing the reliability of MI-based brain-computer interfaces (BCI) for cross-individual use. Using the autoregressive model, power spectral density and discrete wavelet transform, time-frequency domain features of MI can be obtained, while the filter bank common spatial pattern is used to extract spatial domain features, and multi-scale dispersion entropy is employed to extract nonlinear features. The IV-2a dataset from the 4 th International BCI Competition was used for the binary classification task, with the pattern recognition model constructed by combining the improved TrAdaboost integrated learning algorithm with support vector machine (SVM), k nearest neighbor (KNN), and mind evolutionary algorithm-based back propagation (MEA-BP) neural network. The results show that the SVM-based TrAdaboost integrated learning algorithm has the best performance when 30% of the target domain instance data is migrated, with an average classification accuracy of 86.17%, a Kappa value of 0.723 3, and an AUC value of 0.849 8. These results suggest that the algorithm can be used to recognize MI signals across individuals, providing a new way to improve the generalization capability of BCI recognition models.
Brain-Computer Interfaces
;
Humans
;
Support Vector Machine
;
Algorithms
;
Neural Networks, Computer
;
Imagination/physiology*
;
Pattern Recognition, Automated/methods*
;
Electroencephalography
;
Wavelet Analysis
2.Research on emotion recognition methods based on multi-modal physiological signal feature fusion.
Zhiwen ZHANG ; Naigong YU ; Yan BIAN ; Jinhan YAN
Journal of Biomedical Engineering 2025;42(1):17-23
Emotion classification and recognition is a crucial area in emotional computing. Physiological signals, such as electroencephalogram (EEG), provide an accurate reflection of emotions and are difficult to disguise. However, emotion recognition still faces challenges in single-modal signal feature extraction and multi-modal signal integration. This study collected EEG, electromyogram (EMG), and electrodermal activity (EDA) signals from participants under three emotional states: happiness, sadness, and fear. A feature-weighted fusion method was applied for integrating the signals, and both support vector machine (SVM) and extreme learning machine (ELM) were used for classification. The results showed that the classification accuracy was highest when the fusion weights were set to EEG 0.7, EMG 0.15, and EDA 0.15, achieving accuracy rates of 80.19% and 82.48% for SVM and ELM, respectively. These rates represented an improvement of 5.81% and 2.95% compared to using EEG alone. This study offers methodological support for emotion classification and recognition using multi-modal physiological signals.
Humans
;
Emotions/physiology*
;
Electroencephalography
;
Support Vector Machine
;
Electromyography
;
Signal Processing, Computer-Assisted
;
Galvanic Skin Response/physiology*
;
Machine Learning
;
Male
3.Dynamic continuous emotion recognition method based on electroencephalography and eye movement signals.
Yangmeng ZOU ; Lilin JIE ; Mingxun WANG ; Yong LIU ; Junhua LI
Journal of Biomedical Engineering 2025;42(1):32-41
Existing emotion recognition research is typically limited to static laboratory settings and has not fully handle the changes in emotional states in dynamic scenarios. To address this problem, this paper proposes a method for dynamic continuous emotion recognition based on electroencephalography (EEG) and eye movement signals. Firstly, an experimental paradigm was designed to cover six dynamic emotion transition scenarios including happy to calm, calm to happy, sad to calm, calm to sad, nervous to calm, and calm to nervous. EEG and eye movement data were collected simultaneously from 20 subjects to fill the gap in current multimodal dynamic continuous emotion datasets. In the valence-arousal two-dimensional space, emotion ratings for stimulus videos were performed every five seconds on a scale of 1 to 9, and dynamic continuous emotion labels were normalized. Subsequently, frequency band features were extracted from the preprocessed EEG and eye movement data. A cascade feature fusion approach was used to effectively combine EEG and eye movement features, generating an information-rich multimodal feature vector. This feature vector was input into four regression models including support vector regression with radial basis function kernel, decision tree, random forest, and K-nearest neighbors, to develop the dynamic continuous emotion recognition model. The results showed that the proposed method achieved the lowest mean square error for valence and arousal across the six dynamic continuous emotions. This approach can accurately recognize various emotion transitions in dynamic situations, offering higher accuracy and robustness compared to using either EEG or eye movement signals alone, making it well-suited for practical applications.
Humans
;
Electroencephalography/methods*
;
Emotions/physiology*
;
Eye Movements/physiology*
;
Signal Processing, Computer-Assisted
;
Support Vector Machine
;
Algorithms
4.Research on multi-scale convolutional neural network hand muscle strength prediction model improved based on convolutional attention module.
Yihao DU ; Mengyu SUN ; Jingjin LI ; Xiaoran WANG ; Tianfu CAO
Journal of Biomedical Engineering 2025;42(1):90-95
In order to realize the quantitative assessment of muscle strength in hand function rehabilitation and then formulate scientific and effective rehabilitation training strategies, this paper constructs a multi-scale convolutional neural network (MSCNN) - convolutional block attention module (CBAM) - bidirectional long short-term memory network (BiLSTM) muscle strength prediction model to fully explore the spatial and temporal features of the data and simultaneously suppress useless features, and finally achieve the improvement of the accuracy of the muscle strength prediction model. To verify the effectiveness of the model proposed in this paper, the model in this paper is compared with traditional models such as support vector machine (SVM), random forest (RF), convolutional neural network (CNN), CNN - squeeze excitation network (SENet), MSCNN-CBAM and MSCNN-BiLSTM, and the effect of muscle strength prediction by each model is investigated when the hand force application changes from 40% of the maximum voluntary contraction force (MVC) to 60% of the MVC. The research results show that as the hand force application increases, the effect of the muscle strength prediction model becomes worse. Then the ablation experiment is used to analyze the influence degree of each module on the muscle strength prediction result, and it is found that the CBAM module plays a key role in the model. Therefore, by using the model in this article, the accuracy of muscle strength prediction can be effectively improved, and the characteristics and laws of hand muscle activities can be deeply understood, providing assistance for further exploring the mechanism of hand functions .
Neural Networks, Computer
;
Humans
;
Hand Strength/physiology*
;
Support Vector Machine
;
Muscle Strength/physiology*
;
Hand/physiology*
;
Convolutional Neural Networks
5.Correlation between oxidative balance score and benign prostatic hyperplasia assessed by machine learning.
Hao-Ran WANG ; Jia-Xin NING ; Hui-Min HOU ; Ming LIU ; Jian-Ye WANG
National Journal of Andrology 2025;31(2):121-130
OBJECTIVE:
The relationship between benign prostatic hyperplasia (BPH) and the oxidative balance score (OBS) will be discussed in this study.
METHODS:
The clinical data on 16 dimensions of diet and 4 dimensions of lifestyle from the National Health and Nutrition Examination Survey (NHANES) from 2001 to 2008 were used to calculate OBS. We considered BPH as the outcome and investigated the linear and nonlinear relationships between the two. Additionally, subgroup analyses and interaction tests were conducted as well. Furthermore, the methods of machine learning including XGBoost, support vector machine (SVM) and naive Bayes (NB) were used to establish a predictive model for BPH.
RESULTS:
Higher OBS was consistently associated with an increased prevalence of BPH, with Restricted Cubic Splines highlighting a significant positive nonlinear association (P=0.015). Subgroup analyses revealed differences and interactive relationships based on alcohol consumption. Among the seven machine learning models that we included the OBS score in, the XGBoost model emerged as the best, with an AUC value of 0.769.
CONCLUSION
There is a significant association between OBS and the prevalence of BPH in the American population, which provides a valuable insight for further diagnosis and research of the disease.
Humans
;
Male
;
Prostatic Hyperplasia/epidemiology*
;
Machine Learning
;
Bayes Theorem
;
Nutrition Surveys
;
Support Vector Machine
;
Life Style
;
Oxidative Stress
;
Aged
;
Diet
;
Prevalence
6.Construction of recognition models for subthreshold depression based on multiple machine learning algorithms and vocal emotional characteristics.
Meimei CHEN ; Yang WANG ; Huangwei LEI ; Fei ZHANG ; Ruina HUANG ; Zhaoyang YANG
Journal of Southern Medical University 2025;45(4):711-717
OBJECTIVES:
To construct vocal recognition classification models using 6 machine learning algorithms and vocal emotional characteristics of individuals with subthreshold depression to facilitate early identification of subthreshold depression.
METHODS:
We collected voice data from both normal individuals and participants with subthreshold depression by asking them to read specifically chosen words and texts. From each voice sample, 384-dimensional vocal emotional feature variables were extracted, including energy feature, Meir frequency cepstrum coefficient, zero cross rate feature, sound probability feature, fundamental frequency feature, difference feature. The Recursive Feature Elimination (RFE) method was employed to select voice feature variables. Classification models were then built using the machine learning algorithms Adaptive Boosting (AdaBoost), Random Forest (RF), Linear Discriminant Analysis (LDA), Logistic Regression (LR), Lasso Regression (LRLasso), and Support Vector Machine (SVM), and the performance of these models was evaluated. To assess generalization capability of the models, we used real-world speech data to evaluate the best speech recognition classification model.
RESULTS:
The AdaBoost, RF, and LDA models achieved high prediction accuracies of 100%, 100%, and 93.3% on word-reading speech test set, respectively. In the text-reading speech test set, the accuracies of the AdaBoost, RF, and LDA models were 90%, 80%, and 90%, respectively, while the accuracies of the other 3 models were all below 80%. On real-world word-reading and text-reading speech data, the classification models using AdaBoost and Random Forest still achieved high predictive accuracies (91.7% and 80.6% for AdaBoost and 86.1% and 77.8% for Random, respectively).
CONCLUSIONS
Analyzing vocal emotional characteristics allows effective identification of individuals with subthreshold depression. The AdaBoost and RF models show excellent performance for classifying subthreshold depression individuals, and may thus potentially offer valuable assistance in the clinical and research settings.
Humans
;
Machine Learning
;
Emotions
;
Depression/diagnosis*
;
Algorithms
;
Voice
;
Support Vector Machine
;
Male
;
Female
7.Construction of risk prediction models of hypothermia after transurethral holmium laser enucleation of the prostate based on three machine learning algorithms.
Jun JIANG ; Shuo FENG ; Yingui SUN ; Yan AN
Journal of Southern Medical University 2025;45(9):2019-2025
OBJECTIVES:
To develop risk prediction models for postoperative hypothermia after transurethral holmium laser enucleation of the prostate (HoLEP) using machine learning algorithms.
METHODS:
We retrospectively analyzed the clinical data of 403 patients from our center (283 patients in the training set and 120in the internal validation set) and 120 patients from Weifang People's Hospital (as the external validation set). The risk prediction models were built using logistic regression, decision tree and support vector machine (SVM), and model performance was evaluated in terms of accuracy, recall, precision, F1 score and AUC.
RESULTS:
Operation duration, prostate weight, intraoperative irrigation volume, and being underweight were identified as the predictors of postoperative hypothermia following HoLEP. Among the 3 algorithms, SVM showed the best precision rate and accuracy in all the 3 data sets and the best area under the ROC (AUC) in the training set and validation set, followed by logistic regression, which had a similar AUC in the two data sets. SVM outperformed logistic regression and decision tree models in the validation set in precision, accuracy, recall, F1 score, and AUC, and performed well in the external validation set with better precision rate and accuracy than logistic regression and decision tree models but slightly lower recall rate, F1 index, and AUC value than the decision tree model. SVM outperformed logistic regression and decision tree models in precision, accuracy, F1 score, and AUC in the training set, but had slightly lower recall rate than the decision tree.
CONCLUSIONS
Among the 3 models, SVM has the best performance and generalizability for predicting post-HoLEP hypothermia risk to provide support for clinical decisions.
Humans
;
Male
;
Retrospective Studies
;
Machine Learning
;
Transurethral Resection of Prostate/adverse effects*
;
Hypothermia/etiology*
;
Prostatic Hyperplasia/surgery*
;
Algorithms
;
Lasers, Solid-State
;
Risk Assessment
;
Postoperative Complications
;
Decision Trees
;
Logistic Models
;
Aged
;
Middle Aged
;
Support Vector Machine
8.Establishment and evaluation of a machine learning prediction model for sepsis-related encephalopathy in the elderly.
Xiao YUE ; Yiwen WANG ; Zhifang LI ; Lei WANG ; Li HUANG ; Shuo WANG ; Yiming HOU ; Shu ZHANG ; Zhengbin WANG
Chinese Critical Care Medicine 2025;37(10):937-943
OBJECTIVE:
To construct machine learning prediction model for sepsis-associated encephalopathy (SAE), and analyze the application value of the model on early identification of SAE risk in elderly septic patients.
METHODS:
Patients aged over 60 years with a primary diagnosis of sepsis admitted to intensive care unit (ICU) from 2008 to 2023 were selected from Medical Information Mart for Intensive Care-IV 2.2 (MIMIC-IV 2.2). Demographic variables, disease severity scores, comorbidities, interventions, laboratory indicators, and hospitalization details were collected. Key factors associated with SAE were identified using univariate Logistic regression analysis. The data were randomly divided into training and validation sets in a 7 : 3 ratio. Multivariable Logistic regression analysis was conducted in the training set and visualized using a nomogram model for prediction of SAE. The discrimination of the model was evaluated in the validation set using the receiver operator characteristic curve (ROC curve), and its calibration was assessed using calibration curve. Furthermore, multiple machine learning algorithms, including multi-layer perceptron (MLP), support vector machine (SVM), naive bayes (NB), gradient boosting machine (GBM), random forest (RF), and extreme gradient boosting (XGB), were constructed in the training set. Their predictive performance was subsequently evaluated on the validation set. Taking the XGB model as an example, the interpretability of the model through the SHapley Additive exPlanations (SHAP) algorithm was enhanced to identify the key predictive factors and their contributions.
RESULTS:
A total of 2 204 septic patients were finally enrolled, of whom 840 developed SAE (38.1%). A total of 21 variables associated with SAE were screened through univariate Logistic regression analysis. Multivariable Logistic regression analysis showed that endotracheal intubation [odds ratio (OR) = 0.40, 95% confidence interval (95%CI) was 0.19-0.88, P < 0.001], oxygen therapy (OR = 0.76, 95%CI was 0.53-0.95, P = 0.023), tracheotomy (OR = 0.20, 95%CI was 0.07-0.53, P < 0.001), continuous renal replacement therapy (CRRT; OR = 0.32, 95%CI was 0.15-0.70, P < 0.001), cerebrovascular disease (OR = 0.31, 95%CI was 0.16-0.60, P < 0.001), rheumatic disease (OR = 0.44, 95%CI was 0.19-0.99, P < 0.001), male (OR = 0.68, 95%CI was 0.54-0.86, P = 0.001), and maximum anion gap (AG; OR = 0.95, 95%CI was 0.93-0.97, P < 0.001) were associated with an decreased probability of SAE, and age (OR = 1.05, 95%CI was 1.03-1.06, P < 0.001), acute physiology score III (APSIII; OR = 1.02, 95%CI was 1.01-1.02, P < 0.001), Oxford acute severity of illness score (OASIS; OR = 1.04, 95%CI was 1.03-1.06, P < 0.001), and length of hospital stay (OR = 1.01, 95%CI was 1.01-1.02, P < 0.001) were associated with an increased probability of SAE. A nomogram model was constructed based on these variables. In the validation set, ROC curve analysis showed that the model achieved an area under the ROC curve (AUC) of 0.723, and the calibration curve showed good consistency between the predicted probability of the model and the observed probability. Among the machine learning algorithms, including MLP, SVM, NB, GBM, RF, and XGB, the SVM model and RF model demonstrated relatively good predictive performance, with AUC of 0.748 and 0.739, respectively, and the sensitivity was both exceeding 85%. The predictive performance of the XGB model was explained through SHAP analysis, and the results indicated that APSIII score (SHAP value was 0.871), age (SHAP value was 0.521), and OASIS score (SHAP value was 0.443) were important factors affecting the predictive performance of the model.
CONCLUSIONS
The machine learning-based SAE prediction model exhibits good predictive capability and holds significant application value for the early identification of SAE risk in elderly septic patients.
Humans
;
Machine Learning
;
Aged
;
Sepsis-Associated Encephalopathy
;
Sepsis/complications*
;
Intensive Care Units
;
Logistic Models
;
Middle Aged
;
Male
;
ROC Curve
;
Female
;
Bayes Theorem
;
Nomograms
;
Support Vector Machine
;
Algorithms
9.pLM4ACP: a model for predicting anticancer peptides based on machine learning and protein language models.
Yitong LIU ; Wenxin CHEN ; Juanjuan LI ; Xue CHI ; Xiang MA ; Yanqiong TANG ; Hong LI
Chinese Journal of Biotechnology 2025;41(8):3252-3261
Cancer is a serious global health problem and a major cause of human death. Conventional cancer treatments often run the risk of impairing vital organ functions. Anticancer peptides (ACPs) are considered to be one of the most promising therapeutic agents against common human cancers due to their small sizes, high specificity, and low toxicity. Since ACP recognition is highly limited to the laboratory, expensive, and time-consuming, we proposed pLM4ACP, a model for predicting ACPs based on machine learning and protein language models. In this model, the protein language model ProtT5 was used to extract the features of ACPs, and the extracted features were input into the support vector machine (SVM) classification algorithm for optimization and performance evaluation. The model showcased significantly higher accuracy than other methods, with the overall accuracy of 0.763, F1-score of 0.767, Matthews correlation coefficient of 0.527, and area under the curve of 0.827 on the independent test set. This study constructs an efficient anticancer peptide prediction model based on protein language models, further advancing the application of artificial intelligence in the biomedical field and promoting the development of precision medicine and computational biology.
Machine Learning
;
Antineoplastic Agents/chemistry*
;
Humans
;
Peptides/chemistry*
;
Support Vector Machine
;
Algorithms
;
Computational Biology/methods*
;
Neoplasms/drug therapy*
10.Geographical origin authentication of Gongju at different spatial scales based on hyperspectral technology.
Xue GUO ; Rui-Bin BAI ; Hui WANG ; Wei-Wen LI ; Ling DONG ; Jia-Hui SUN ; Xiao-Bo ZHANG ; Jian YANG
China Journal of Chinese Materia Medica 2024;49(22):6073-6081
Gongju(Chrysanthemum morifolium) is one of the five major medicinal Chrysanthemum varieties included in the Chinese Pharmacopoeia. In recent years, its cultivation areas have changed significantly, resulting in mixed quality of the medicinal herbs. In this study, Gongju cultivated in Anhui, Yunnan, Chongqing, and other places were selected as research objects. Hyperspectral data were collected in the visible-near-infrared(VNIR) and short-wave infrared(SWIR) bands using different modes, such as corolla facing up(A) and flower base facing up(B). After pre-processing the hyperspectral data using five methods, including multiplicative scatter correction(MSC), Savitzky-Golay smoothing(SG), first derivative(D1), second derivative(D2), and standard normal variate(SNV), partial least squares discriminant analysis(PLSDA), random forest(RF), and support vector machine(SVM) were used to establish origin identification models of Gongju at the two geographical scales of the province and the city-county in Anhui province. The accuracy of the prediction results was used as an evaluation index to select the optimal models, and the classification performance of the models was evaluated by confusion matrix. The results showed that the flower base facing up(B) collection model combined with second derivative pretreatment and RF method was the best model for both geographical scale identification models. The modeling effect of the full-band(VNIR + SWIR) was slightly better than that of the single band, with the accuracy of the prediction set in the province and city-county regions reaching 99.69% and 99.40%, respectively. The competitive adaptive reweighted sampling algorithm(CARS), successive projections algorithm(SPA), and variable iterative space shrinkage approach(VISSA) were further used to screen the feature wavelength modeling. The number of feature wavelengths screened by CARS was fewer, and the prediction set accuracy of the two geographical scales models after optimization could reach 99.56% and 98.65%, which was basically comparable to the full-band model. However, the removal of redundant variables could greatly reduce the complexity of the model. The hyperspectral technology combined with the chemometrics model established in this study can achieve the origin identification of Gongju at different geographical scales, providing a theoretical basis and technical reference for the construction of a rapid detection system for Gongju origin and the development of exclusive miniaturized instrumentation and equipment systems.
Chrysanthemum/growth & development*
;
China
;
Support Vector Machine
;
Geography
;
Discriminant Analysis
;
Spectroscopy, Near-Infrared/methods*
;
Spectrum Analysis/methods*
;
Drugs, Chinese Herbal/analysis*
;
Least-Squares Analysis

Result Analysis
Print
Save
E-mail