1.Alternative Polyadenylation in Mammalian
Yu ZHANG ; Hong-Xia CHI ; Wu-Ri-Tu YANG ; Yong-Chun ZUO ; Yong-Qiang XING
Progress in Biochemistry and Biophysics 2025;52(1):32-49
With the rapid development of sequencing technologies, the detection of alternative polyadenylation (APA) in mammals has become more precise. APA precisely regulates gene expression by altering the length and position of the poly(A) tail, and is involved in various biological processes such as disease occurrence and embryonic development. The research on APA in mammals mainly focuses on the following aspects:(1) identifying APA based on transcriptome data and elucidating their characteristics; (2) investigating the relationship between APA and gene expression regulation to reveal its important role in life regulation;(3) exploring the intrinsic connections between APA and disease occurrence, embryonic development, differentiation, and other life processes to provide new perspectives and methods for disease diagnosis and treatment, as well as uncovering embryonic development regulatory mechanisms. In this review, the classification, mechanisms and functions of APA were elaborated in detail and the methods for APA identifying and APA data resources based on various transcriptome data were systematically summarized. Moreover, we epitomized and provided an outlook on research on APA, emphasizing the role of sequencing technologies in driving studies on APA in mammals. In the future, with the further development of sequencing technology, the regulatory mechanisms of APA in mammals will become clearer.
2.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
3.Structural and Spatial Analysis of The Recognition Relationship Between Influenza A Virus Neuraminidase Antigenic Epitopes and Antibodies
Zheng ZHU ; Zheng-Shan CHEN ; Guan-Ying ZHANG ; Ting FANG ; Pu FAN ; Lei BI ; Yue CUI ; Ze-Ya LI ; Chun-Yi SU ; Xiang-Yang CHI ; Chang-Ming YU
Progress in Biochemistry and Biophysics 2025;52(4):957-969
ObjectiveThis study leverages structural data from antigen-antibody complexes of the influenza A virus neuraminidase (NA) protein to investigate the spatial recognition relationship between the antigenic epitopes and antibody paratopes. MethodsStructural data on NA protein antigen-antibody complexes were comprehensively collected from the SAbDab database, and processed to obtain the amino acid sequences and spatial distribution information on antigenic epitopes and corresponding antibody paratopes. Statistical analysis was conducted on the antibody sequences, frequency of use of genes, amino acid preferences, and the lengths of complementarity determining regions (CDR). Epitope hotspots for antibody binding were analyzed, and the spatial structural similarity of antibody paratopes was calculated and subjected to clustering, which allowed for a comprehensively exploration of the spatial recognition relationship between antigenic epitopes and antibodies. The specificity of antibodies targeting different antigenic epitope clusters was further validated through bio-layer interferometry (BLI) experiments. ResultsThe collected data revealed that the antigen-antibody complex structure data of influenza A virus NA protein in SAbDab database were mainly from H3N2, H7N9 and H1N1 subtypes. The hotspot regions of antigen epitopes were primarily located around the catalytic active site. The antibodies used for structural analysis were primarily derived from human and murine sources. Among murine antibodies, the most frequently used V-J gene combination was IGHV1-12*01/IGHJ2*01, while for human antibodies, the most common combination was IGHV1-69*01/IGHJ6*01. There were significant differences in the lengths and usage preferences of heavy chain CDR amino acids between antibodies that bind within the catalytic active site and those that bind to regions outside the catalytic active site. The results revealed that structurally similar antibodies could recognize the same epitopes, indicating a specific spatial recognition between antibody and antigen epitopes. Structural overlap in the binding regions was observed for antibodies with similar paratope structures, and the competitive binding of these antibodies to the epitope was confirmed through BLI experiments. ConclusionThe antigen epitopes of NA protein mainly ditributed around the catalytic active site and its surrounding loops. Spatial complementarity and electrostatic interactions play crucial roles in the recognition and binding of antibodies to antigenic epitopes in the catalytic region. There existed a spatial recognition relationship between antigens and antibodies that was independent of the uniqueness of antibody sequences, which means that antibodies with different sequences could potentially form similar local spatial structures and recognize the same epitopes.
4.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
5.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
6.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
7.Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
Xing-Xuan DONG ; Jian-Hua LIU ; Tian-Yang ZHANG ; Chen-Wei PAN ; Chun-Hua ZHAO ; Yi-Bo WU ; Dan-Dan CHEN
Psychiatry Investigation 2025;22(3):267-278
Objective:
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods:
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results:
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.
8.Four Weeks of HIIT Modulates Lactate-mediated Synaptic Plasticity to Improve Depressive-like Behavior in CUMS Rats
Yu-Mei HAN ; Zi-Wei ZHANG ; Jia-Ren LIANG ; Chun-Hui BAO ; Jun-Sheng TIAN ; Shi ZHOU ; Huan XIANG ; Yong-Hong YANG
Progress in Biochemistry and Biophysics 2025;52(6):1499-1510
ObjectiveThis study aimed to investigate the effects of 4-week high-intensity interval training (HIIT) on synaptic plasticity in the prefrontal cortex (PFC) of rats exposed to chronic unpredictable mild stress (CUMS), and to explore its potential mechanisms. MethodsA total of 48 male Sprague-Dawley rats were randomly divided into 4 groups: control (C), model (M), control plus HIIT (HC), and model plus HIIT (HM). Rats in groups M and HM underwent 8 weeks of CUMS to establish depression-like behaviors, while groups HC and HM received HIIT intervention beginning from the 5th week for 4 consecutive weeks. The HIIT protocol consisted of repeated intervals of 3 min at high speed (85%-90% maximal training speed, Smax) alternated with one minute at low speed (50%-55% Smax), with 3 to 5 sets per session, conducted 5 d per week. Behavioral assessments and tail-vein blood lactate levels were measured at the end of the 4th and 8th weeks. After the intervention, rat PFC tissues were collected for Golgi staining to analyze synaptic morphology. Enzyme-linked immunosorbent assays (ELISA) were employed to detect brain-derived neurotrophic factor (BDNF), monocarboxylate transporter 1 (MCT1), lactate, and glutamate levels in the PFC, as well as serotonin (5-HT) levels in serum. Additionally, Western blot analysis was conducted to quantify the expression of synaptic plasticity-related proteins, including c-Fos, activity-regulated cytoskeleton-associated protein (Arc), and N-methyl-D-aspartate receptor 1 (NMDAR1). ResultsCompared to the control group (C), the CUMS-exposed rats (group M) exhibited significant reductions in sucrose preference rates, number of grid crossings, frequency of upright postures, and entries into and duration spent in open arms of the elevated plus maze, indicating marked depressive-like behaviors. Additionally, the group M showed significantly reduced dendritic spine density in the PFC, along with elevated levels of c-Fos, Arc, NMDAR1 protein expression, and increased concentrations of lactate and glutamate. Conversely, BDNF and MCT1 contents in the PFC and 5-HT levels in serum were significantly decreased. Following HIIT intervention, rats in the group HM displayed considerable improvement in behavioral indicators compared with the group M, accompanied by significant elevations in PFC MCT1 and lactate concentrations. Furthermore, HIIT notably normalized the expression levels of c-Fos, Arc, NMDAR1, as well as glutamate and BDNF contents in the PFC. Synaptic spine density also exhibited significant recovery. ConclusionFour weeks of HIIT intervention may alleviate depressive-like behaviors in CUMS rats by increasing lactate levels and reducing glutamate concentration in the PFC, thereby downregulating the overexpression of NMDAR, attenuating excitotoxicity, and enhancing synaptic plasticity.
9.An interpretable machine learning modeling method for the effect of manual acupuncture manipulations on subcutaneous muscle tissue.
Wenqi ZHANG ; Yanan ZHANG ; Yan SHEN ; Chun SUN ; Jie CHEN ; Yuhe WEI ; Jian KANG ; Ziyi CHEN ; Jingqi YANG ; Jingwen YANG ; Chong SU
Chinese Acupuncture & Moxibustion 2025;45(10):1371-1382
OBJECTIVE:
To investigate the effect of manual acupuncture manipulations (MAMs) on subcutaneous muscle tissue, by developing quantitative models of "lifting and thrusting" and "twisting and rotating", based on machine learning techniques.
METHODS:
A depth camera was used to capture the acupuncture operator's hand movements during "lifting and thrusting" and "twisting and rotating" of needle. Simultaneously, the ultrasound imaging was employed to record the muscle tissue responses of the participants. Amplitude and angular features were extracted from the movement data of operators, and muscle fascicle slope features were derived from the data of ultrasound images. The dynamic time warping barycenter averaging algorithm was adopted to align the dual-source data. Various machine learning techniques were applied to build quantitative models, and the performance of each model was compared. The most optimal model was further analyzed for its interpretability.
RESULTS:
Among the quantitative models built for the two types of MAMs, the random forest model demonstrated the best performance. For the quantitative model of the "lifting and thrusting" technique, the coefficient of determination (R2) was 0.825. For the "twisting and rotating" technique, R2 reached 0.872.
CONCLUSION
Machine learning can be used to effectively develop the models and quantify the effects of MAMs on subcutaneous muscle tissue. It provides a new perspective to understand the mechanism of acupuncture therapy and lays a foundation for optimizing acupuncture technology and designing personalized treatment regimen in the future.
Humans
;
Acupuncture Therapy/methods*
;
Machine Learning
;
Male
;
Adult
;
Female
;
Subcutaneous Tissue/diagnostic imaging*
;
Young Adult
10.Clinical course, causes of worsening, and outcomes of severe ischemic stroke: A prospective multicenter cohort study.
Simiao WU ; Yanan WANG ; Ruozhen YUAN ; Meng LIU ; Xing HUA ; Linrui HUANG ; Fuqiang GUO ; Dongdong YANG ; Zuoxiao LI ; Bihua WU ; Chun WANG ; Jingfeng DUAN ; Tianjin LING ; Hao ZHANG ; Shihong ZHANG ; Bo WU ; Cairong ZHU ; Craig S ANDERSON ; Ming LIU
Chinese Medical Journal 2025;138(13):1578-1586
BACKGROUND:
Severe stroke has high rates of mortality and morbidity. This study aimed to investigate the clinical course, causes of worsening, and outcomes of severe ischemic stroke.
METHODS:
This prospective, multicenter cohort study enrolled adult patients admitted ≤30 days after ischemic stroke from nine hospitals in China between September 2017 and December 2019. Severe stroke was defined as a score of ≥15 on the National Institutes of Health Stroke Scale (NIHSS). Clinical worsening was defined as an increase of 4 in the NIHSS score from baseline. Unfavorable functional outcome was defined as a modified Rankin scale score ≥3 at 3 months and 1 year after stroke onset, respectively. We performed Logistic regression to explore baseline features and reperfusion therapies associated with clinical worsening and functional outcomes.
RESULTS:
Among 4201 patients enrolled, 854 patients (20.33%) had severe stroke on admission. Of 3347 patients without severe stroke on admission, 142 (4.24%) patients developed severe stroke in hospital. Of 854 patients with severe stroke on admission, 33.95% (290/854) experienced clinical worsening (median time from stroke onset: 43 h, Q1-Q3: 20-88 h), with brain edema (54.83% [159/290]) as the leading cause; 24.59% (210/854) of these patients died by 30 days, and 81.47% (677/831) and 78.44% (633/807) had unfavorable functional outcomes at 3 months and 1 year respectively. Reperfusion reduced the risk of worsening (adjusted odds ratio [OR]: 0.24, 95% confidence interval [CI]: 0.12-0.49, P <0.01), 30-day death (adjusted OR: 0.22, 95% CI: 0.11-0.41, P <0.01), and unfavorable functional outcomes at 3 months (adjusted OR: 0.24, 95% CI: 0.08-0.68, P <0.01) and 1 year (adjusted OR: 0.17, 95% CI: 0.06-0.50, P <0.01).
CONCLUSIONS:
Approximately one-fifth of patients with ischemic stroke had severe neurological deficits on admission. Clinical worsening mainly occurred in the first 3 to 4 days after stroke onset, with brain edema as the leading cause of worsening. Reperfusion reduced the risk of clinical worsening and improved functional outcomes.
REGISTRATION
ClinicalTrials.gov , NCT03222024.
Humans
;
Male
;
Female
;
Prospective Studies
;
Ischemic Stroke/mortality*
;
Aged
;
Middle Aged
;
Aged, 80 and over
;
Stroke
;
Brain Ischemia

Result Analysis
Print
Save
E-mail