1.Construction and preliminary validation of machine learning predictive models for cervical cancer screening based on human DNA methylation
Yuan YANG ; Hang ZHOU ; Yakun WANG ; Yu DAI ; Ruoji PI ; Hua ZHANG ; Ziyue HUANG ; Ting WU ; Jinghong YANG ; Wen CHEN
Chinese Journal of Oncology 2025;47(2):193-200
Objective:Using methylation characteristics of human genes to construct machine learning predictive models for screening cervical cancer and precancerous lesions.Methods:Human DNA methylation detection was performed on 224 cervical exfoliated cell specimens from the Cancer Hospital of the Chinese Academy of Medical Sciences, Tianjin Central Hospital of Gynecology Obstetrics, Xinmi Maternal and Child Health Hospital of Henan Province, West China Second Affiliated Hospital of Sichuan University, and Heping Hospital Affiliated to Changzhi Medical College collected during April 2014 and March 2015. The hypermethylated gene fragments related to cervical cancer were selected by high-density, high-association, and hypermethylated gene fragment screening and the LASSO regression algorithm. Taking cervical intraepithelial neoplasia grade 2 (CIN2) or more severe lesions as the research outcome, machine learning predictive models based on the random forest (RF), naive Bayes (NB), and support vector machine (SVM) algorithm, respectively, were constructed. A total of 144 outpatient specimens were used as the training set and 80 cervical exfoliated cell specimens from women participating in the cervical cancer screening program were used as the test set to verify the predictive models. Using histological diagnosis results as the gold standard, the detection efficacy for CIN2 or more severe lesions of the three machine learning predictive models were compared with that of the human papilloma virus (HPV) detection and cytological diagnosis.Results:In the training set of 144 cases, there were 34 cases of HPV positivity, with a positive rate of 23.61%. Cytologically, there were 37 cases diagnosed as no intraepithelial lesion or malignancy (NILM), and 107 cases diagnosed as atypical squamous cells of undetermined significance (ASC-US) or above. Histologically, there were 28 cases without cervical intraepithelial neoplasia or benign cervical lesions, 31 cases of CIN1, 18 cases of CIN2, 31 cases of CIN3, and 36 cases of squamous cell carcinoma. Seven hypermethylated gene fragments were selected from 45 genes, and three machine learning prediction models based on the RF, NB, and SVM algorithm, respectively, were constructed. In the validation set of 80 cases, there were 28 cases of HPV positivity, with a positive rate of 35.00%. Cytologically, there were 65 cases diagnosed as NILM and 15 cases as ASC-US or above. Histologically, there were 39 cases without cervical intraepithelial neoplasia or benign cervical lesions, 10 cases of CIN1, 10 cases of CIN2, 11 cases of CIN3, and 10 cases of squamous cell carcinoma. In the validation set, the area under the curve (AUC) values of the RF model, NB model, SVM model, HPV detection, and cytological diagnosis of CIN2 or above were 0.90, 0.88, 0.82, 0.68, and 0.45, respectively. The DeLong test showed that there was no statistically significant difference in the AUC values between the RF, NB, and SVM models (all P>0.05), and the AUC values of the RF and NB models were higher than that of HPV detection (both P<0.01), and the AUC values of the RF, NB, and SVM models were higher than that of cytological diagnosis (all P<0.01). Compared with the NB model, the sensitivity of the RF model was similar (80.65% vs. 77.42%), but the specificity of the NB model was much higher than that of the RF model (93.88% vs. 73.47%). Conclusion:Among the machine learning prediction models for cervical cancer and precancerous lesions constructed based on human DNA methylation, the NB model has good predictive performance for CIN2 and above lesions, and may be used for screening of cervical cancer and precancerous lesions.
2.Associations between statins and all-cause mortality and cardiovascular events among peritoneal dialysis patients: A multi-center large-scale cohort study.
Shuang GAO ; Lei NAN ; Xinqiu LI ; Shaomei LI ; Huaying PEI ; Jinghong ZHAO ; Ying ZHANG ; Zibo XIONG ; Yumei LIAO ; Ying LI ; Qiongzhen LIN ; Wenbo HU ; Yulin LI ; Liping DUAN ; Zhaoxia ZHENG ; Gang FU ; Shanshan GUO ; Beiru ZHANG ; Rui YU ; Fuyun SUN ; Xiaoying MA ; Li HAO ; Guiling LIU ; Zhanzheng ZHAO ; Jing XIAO ; Yulan SHEN ; Yong ZHANG ; Xuanyi DU ; Tianrong JI ; Yingli YUE ; Shanshan CHEN ; Zhigang MA ; Yingping LI ; Li ZUO ; Huiping ZHAO ; Xianchao ZHANG ; Xuejian WANG ; Yirong LIU ; Xinying GAO ; Xiaoli CHEN ; Hongyi LI ; Shutong DU ; Cui ZHAO ; Zhonggao XU ; Li ZHANG ; Hongyu CHEN ; Li LI ; Lihua WANG ; Yan YAN ; Yingchun MA ; Yuanyuan WEI ; Jingwei ZHOU ; Yan LI ; Caili WANG ; Jie DONG
Chinese Medical Journal 2025;138(21):2856-2858
3.Epidemiological characteristics of surgical site infection outbreaks at home and abroad
Lanping SHI ; Mengru LI ; Ping ZHOU ; Jianyun CHEN ; Jinghong YU ; Yuhua GAO ; Yang LI
Chinese Journal of Nosocomiology 2025;35(20):3063-3067
OBJECTIVE To analyze the characteristics of global surgical site infection(SSI)outbreaks and provide references for targeted prevention and control measures.METHODS SSI outbreak events from Jan.1,1990 to Dec.31,2023 were searched from Wanfang Med,CNKI,VIP and PubMed.Data on department distribution dur-ing the outbreak,duration,investigation methods,main causes,transmission modes,pathogen composition and outbreak outcomes were summarized for analysis.RESULTS A total of 111 SSI outbreaks in 20 countries were identi-fied,involving 1 382 patients and 24 deaths.The source of the outbreak was identified in 78 cases,mainly involving med-ical personnel in 27 cases(34.62%),hospital equipment in 19 cases(24.36%),environmental factors in 11 cases(14.10%),workflow factors in 10 cases(12.82%),hospital water factors in 7 cases(8.97%)and disinfectant fac-tors in 4 cases(5.13%).The duration of SSI outbreaks abroad was 5.00(1.50,12.00)months,longer than that in China[1.00(1.00,2.00)month](P<0.05).In China,the outbreaks mainly occurred in neurosurgery,cardiac surger-y,orthopedics,obstetrics and general surgery,while at abroad,they mainly occurred in cardiac surgery,orthopedics,and involved multiple surgical departments.Environmental hygiene methods were adopted in up to 90.63%of cases in China.Compared with China,analytical epidemiological methods and molecular epidemiological methods were more com-monly used abroad.Targeted measures and strengthened basic measures were implemented for different outbreak sources to terminate the SSI outbreaks.Pulsed-field gel electrophoresis(34 times)was a widely used molecular typing method in outbreak investigations.CONCLUSIONS Based on the analysis of SSI outbreak characteristics and risk factors,it is neces-sary to strengthen medical personnel training and monitoring of special pathogens.At the same time,the epidemiologi-cal investigation capabilities of infection control professionals should be strengthened.
4.Epidemiological characteristics of surgical site infection outbreaks at home and abroad
Lanping SHI ; Mengru LI ; Ping ZHOU ; Jianyun CHEN ; Jinghong YU ; Yuhua GAO ; Yang LI
Chinese Journal of Nosocomiology 2025;35(20):3063-3067
OBJECTIVE To analyze the characteristics of global surgical site infection(SSI)outbreaks and provide references for targeted prevention and control measures.METHODS SSI outbreak events from Jan.1,1990 to Dec.31,2023 were searched from Wanfang Med,CNKI,VIP and PubMed.Data on department distribution dur-ing the outbreak,duration,investigation methods,main causes,transmission modes,pathogen composition and outbreak outcomes were summarized for analysis.RESULTS A total of 111 SSI outbreaks in 20 countries were identi-fied,involving 1 382 patients and 24 deaths.The source of the outbreak was identified in 78 cases,mainly involving med-ical personnel in 27 cases(34.62%),hospital equipment in 19 cases(24.36%),environmental factors in 11 cases(14.10%),workflow factors in 10 cases(12.82%),hospital water factors in 7 cases(8.97%)and disinfectant fac-tors in 4 cases(5.13%).The duration of SSI outbreaks abroad was 5.00(1.50,12.00)months,longer than that in China[1.00(1.00,2.00)month](P<0.05).In China,the outbreaks mainly occurred in neurosurgery,cardiac surger-y,orthopedics,obstetrics and general surgery,while at abroad,they mainly occurred in cardiac surgery,orthopedics,and involved multiple surgical departments.Environmental hygiene methods were adopted in up to 90.63%of cases in China.Compared with China,analytical epidemiological methods and molecular epidemiological methods were more com-monly used abroad.Targeted measures and strengthened basic measures were implemented for different outbreak sources to terminate the SSI outbreaks.Pulsed-field gel electrophoresis(34 times)was a widely used molecular typing method in outbreak investigations.CONCLUSIONS Based on the analysis of SSI outbreak characteristics and risk factors,it is neces-sary to strengthen medical personnel training and monitoring of special pathogens.At the same time,the epidemiologi-cal investigation capabilities of infection control professionals should be strengthened.
5.Construction and preliminary validation of machine learning predictive models for cervical cancer screening based on human DNA methylation
Yuan YANG ; Hang ZHOU ; Yakun WANG ; Yu DAI ; Ruoji PI ; Hua ZHANG ; Ziyue HUANG ; Ting WU ; Jinghong YANG ; Wen CHEN
Chinese Journal of Oncology 2025;47(2):193-200
Objective:Using methylation characteristics of human genes to construct machine learning predictive models for screening cervical cancer and precancerous lesions.Methods:Human DNA methylation detection was performed on 224 cervical exfoliated cell specimens from the Cancer Hospital of the Chinese Academy of Medical Sciences, Tianjin Central Hospital of Gynecology Obstetrics, Xinmi Maternal and Child Health Hospital of Henan Province, West China Second Affiliated Hospital of Sichuan University, and Heping Hospital Affiliated to Changzhi Medical College collected during April 2014 and March 2015. The hypermethylated gene fragments related to cervical cancer were selected by high-density, high-association, and hypermethylated gene fragment screening and the LASSO regression algorithm. Taking cervical intraepithelial neoplasia grade 2 (CIN2) or more severe lesions as the research outcome, machine learning predictive models based on the random forest (RF), naive Bayes (NB), and support vector machine (SVM) algorithm, respectively, were constructed. A total of 144 outpatient specimens were used as the training set and 80 cervical exfoliated cell specimens from women participating in the cervical cancer screening program were used as the test set to verify the predictive models. Using histological diagnosis results as the gold standard, the detection efficacy for CIN2 or more severe lesions of the three machine learning predictive models were compared with that of the human papilloma virus (HPV) detection and cytological diagnosis.Results:In the training set of 144 cases, there were 34 cases of HPV positivity, with a positive rate of 23.61%. Cytologically, there were 37 cases diagnosed as no intraepithelial lesion or malignancy (NILM), and 107 cases diagnosed as atypical squamous cells of undetermined significance (ASC-US) or above. Histologically, there were 28 cases without cervical intraepithelial neoplasia or benign cervical lesions, 31 cases of CIN1, 18 cases of CIN2, 31 cases of CIN3, and 36 cases of squamous cell carcinoma. Seven hypermethylated gene fragments were selected from 45 genes, and three machine learning prediction models based on the RF, NB, and SVM algorithm, respectively, were constructed. In the validation set of 80 cases, there were 28 cases of HPV positivity, with a positive rate of 35.00%. Cytologically, there were 65 cases diagnosed as NILM and 15 cases as ASC-US or above. Histologically, there were 39 cases without cervical intraepithelial neoplasia or benign cervical lesions, 10 cases of CIN1, 10 cases of CIN2, 11 cases of CIN3, and 10 cases of squamous cell carcinoma. In the validation set, the area under the curve (AUC) values of the RF model, NB model, SVM model, HPV detection, and cytological diagnosis of CIN2 or above were 0.90, 0.88, 0.82, 0.68, and 0.45, respectively. The DeLong test showed that there was no statistically significant difference in the AUC values between the RF, NB, and SVM models (all P>0.05), and the AUC values of the RF and NB models were higher than that of HPV detection (both P<0.01), and the AUC values of the RF, NB, and SVM models were higher than that of cytological diagnosis (all P<0.01). Compared with the NB model, the sensitivity of the RF model was similar (80.65% vs. 77.42%), but the specificity of the NB model was much higher than that of the RF model (93.88% vs. 73.47%). Conclusion:Among the machine learning prediction models for cervical cancer and precancerous lesions constructed based on human DNA methylation, the NB model has good predictive performance for CIN2 and above lesions, and may be used for screening of cervical cancer and precancerous lesions.
6.Hospitalization costs of road traffic injury in Shanghai,2017‒2020
Yan YU ; Naiting XU ; Jinghong LIU ; Deding ZHOU ; Ning GAO ; Juanjuan PENG
Shanghai Journal of Preventive Medicine 2024;36(7):686-691
ObjectiveTo assess the direct economic burden and influencing factors of road traffic injuries in Shanghai, providing a basis for future prevention of road traffic injuries. MethodsInformation of inpatient cases caused by road traffic injuries was collected from 22 pilot hospitals in Shanghai that participated in the registration and reporting system of injury hospitalizations from 2017 to 2020. The main content of the collected information included basic demographic information, total hospitalization costs for injuries, mode of transportation, cause of injury, nature of injury, location of injury, and hospital level. Univariate analysis was used to analyze the average hospitalization costs and length of hospital stay between different years. Kruskal-Wallis test was used to analyze the median hospitalization costs and the median length of hospital stay between different years, and the median hospitalization costs of different groups. A full subset regression model in multivariate regression was used to explore the main influencing factors of hospitalization costs. ResultsA total of 4 655 patients were hospitalized with road traffic injuries, including 2 675 males (57.47%) and 1 980 females (42.53%). The average age was (53.84±15.37) years old, with a minimum age of 4 years old and a maximum age of 97 years old. 1 907 patients (40.97%) had non-motor vehicle accidents, and 2 748 patients (59.03%) had motor vehicle accidents. The average hospitalization cost is 23 245.71 yuan, with a median of 9 793.94 yuan. The average length of stay was 13.61 days, with a median stay of 10.00 days. The main factors affecting hospitalization costs included year, nature of injury, location of injury, length of hospital stay, and hospital level. ConclusionRoad traffic injuries seriously affect the health of middle-aged and young people, especially vulnerable road user such as pedestrians and cyclists, causing serious injury and an increase of direct economic burden year by year. Therefore, road traffic injury prevention projects should be actively carried out to reduce the occurrence of injuries.
7.Correlation between serum vitamin D and thyroid function indicators among elderly patients with type 2 diabetes mellitus
LI Huan ; ZHANG Jinghong ; YU Dan ; JIN Wenshu ; NI Shaomei ; WU Tianfeng
Journal of Preventive Medicine 2024;36(8):702-705
Objective:
To examine the association between serum vitamin D level and thyroid function indicators among elderly patients with type 2 diabetes mellitus (T2DM), so as to provide the evidence for the prevention and treatment of thyroid function abnormality among elderly patients with T2DM.
Methods:
Inpatients aged 60 years and older and admitted to the department of endocrinology of Zhejiang Hospital were selected as the study subjects. Gender, age, course of disease and other basic information were collected through questionnaire surveys. The serum 25-hydroxyvitamin D[25-(OH) D], thyroid stimulating hormone (TSH), free triiodothyronine (FT3), free thyroxine (FT4), total triiodothyronine (TT3), and total thyroxine (TT4) were measured. The correlation between serum vitamin D level and thyroid function indicators in elderly patients with T2DM was evaluated by a multiple linear regression model.
Results:
A total of 402 elderly patients with T2DM were surveyed, including 210 males (52.24%) and 192 females (47.76%), and had a median age of 70.00 (interquartile range, 12.00) years and a median course of disease of 14.00 (interquartile range, 14.00) years. There were 162 patients with insufficiency of vitamin D (40.30%) and 182 patients with deficiency (45.27%). The levels of TSH and glycated hemoglobin in the vitamin D deficiency group were (2.34±1.66) μIU/mL and (8.83±2.14) %, respectively, which were higher than those in the normal group [(1.74±1.10) μIU/mL and (8.11±1.75) %; P<0.05]. The levels of FT3 and FT3/FT4 in the vitamin D deficiency group were (2.86±0.48) μIU/mL and 2.85±0.71, respectively, which were lower than those in the vitamin D insufficiency group [(3.09±0.47) pg/mL and 3.14±0.81, P<0.05]. Multiple linear regression analysis showed a negative correlation between 25- (OH) D and TSH (β'=-0.159, P=0.001).
Conclusion
The vitamin D deficiency may be associated with the increase of TSH level among the elderly patients with T2DM.
8.Characteristics of unintentional injuries reported to Shanghai hospitalized injury surveillance system, 2017‒2020
Naiting XU ; Juanjuan PENG ; Ning GAO ; Deding ZHOU ; Yan YU ; Jinghong LIU ; Yan SHI
Shanghai Journal of Preventive Medicine 2023;35(1):35-41
ObjectiveTo analyze the characteristics of hospitalized unintentional injuries and to provide evidence for formulating injury prevention and control strategies. MethodsDescriptive analysis was conducted on injury information of unintentional injury cases reported from 22 monitoring hospitals during 2017 to 2020 by injury surveillance system. The composition ratio index was used to analyze and explore the distribution(population, time and place)of injuries related to different causes. The disease burden was described by the length of hospital stay and hospitalization cost. ResultsA total of 32 716 hospitalized unintentional injury cases were reported from 22 monitoring hospitals. The male to female ratio was 1.18∶1. The majority of males were aged 15‒64 years and the majority of females were aged over 45 years. The top three causes of injuries were falls, traffic-related and blunt injury. Injuries occurred more frequently in July and August. 42.81% of the cases occurred at home. Fracture cases accounted for 75.79%. The median length of hospital stay was 10 days and the median cost was 15 431.50 yuan. The share of both falls and non-motor vehicle accidents increased year by year. ConclusionFalls among elderly people and road traffic injuries are the main causes of hospitalized unintentional injuries, and sharp or blunt instrument injuries are more severe in the male workforce. Considering high direct and indirect economic losses from injuries, steps should to be taken to improve injury surveillance system and to implement injury prevention and control strategies targeted on key groups and key injuries.
9.Correlation between soil environment and yield and quality of Sharen (Amomi Fructus) under different planting patterns
YIN Cuiyun ; LI Yihang ; YU Jing ; ZHAO Hongyou ; DENG Zhaoyou ; TANG Deying ; Aung Kyaw Oo ; ZHANG Lixia
Digital Chinese Medicine 2023;6(2):221-233
【Objective】 To study the effects of soil environment on the growth, yield, and quality of Sharen
(Amomi Fructus) under different planting patterns.
【Methods】 Soil physical and chemical indices and enzyme activities in four periods including early flowering (March), full flowering (June), fruit ripening (September), and late fruit picking (December), were measured under three planting patterns including natural forest, greenhouse, and rubber forest in Xishuangbanna, China. The changes in soil indices during the growth periods of Sharen (Amomi Fructus) under different planting patterns were analyzed, and the differences in plant growth, yield, and quality under different planting patterns were explored. Pearson correlation analysis was used to analyze the relationship between soil indices and Sharen (Amomi Fructus) growth, yield, and quality. Principal component analysis was used to investigate the effects of soil environment under different planting patterns on Sharen (Amomi Fructus) growth, yield, and quality.
【Results】 The soil moisture, available potassium content, and urease activity of the three planting patterns of Sharen (Amomi Fructus) increased initially and decreased afterwards throughout the year; pH and organic matter content showed little change in the whole year. Exchangeable manganese content and acid phosphatase activity gradually increased throughout the year. Hydrolyzed nitrogen content, exchangeable calcium content, available zinc content, protease activity, and sucrase activity decreased initially and increased afterwards throughout the year. Exchangeable magnesium content, available iron content, and catalase activity decreased annually. Total nitrogen content, total phosphorus content, and available phosphorus content fluctuated throughout the year. The total potassium content under natural forest and greenhouse planting decreased throughout the year, while the total potassium content under rubber forest showed an upward trend all year round. The organic
matter content, total nitrogen content, total potassium content, available potassium content, available zinc content, urease activity, acid phosphatase activity, and catalase activity under greenhouse were significantly lower than those under natural and rubber forests (P < 0.05). Correlation analysis showed that plant growth, yield, and quality of Sharen (Amomi Fructus) were significantly correlated with soil organic matter, total nitrogen, hydrolyzed nitrogen, total phosphorus, available phosphorus, total potassium, available potassium, exchangeable manganese, exchangeable magnesium, exchangeable calcium, available zinc, urease, acid phosphatase, and invertase (P < 0.05). The results of the principal component analysis indicated that the soil environment of Sharen (Amomi Fructus) under natural forest was the best, followed by rubber forest and greenhouse. The order of its advantages and disadvantages is consistent with the growth index of Sharen (Amomi Fructus), but contrary to the yield
of Sharen (Amomi Fructus), indicating that the soil environment directly affects the growth index and nutritional components of plants.
【Conclusion】 Different planting patterns of Sharen (Amomi Fructus) have different soil nutrient content, and the change rules in the growths period are not similar, with some differences. Soil indices have impacts on plant growth, yield, and quality of Sharen (Amomi Fructus). Soil ecological environment is positively correlated with the growth characteristics of Sharen (Amomi Fructus) plants, but has no direct correlation with yield and quality.
10.Principles and strategies for species identification based on analysis of whole-genome
Yu-tong GAN ; Tian-yi XIN ; Wen-jie XU ; Li-jun HAO ; Gui-hong QI ; Qian LOU ; Jing-yuan SONG
Acta Pharmaceutica Sinica 2023;58(8):2364-2374
The main sources of natural drugs include various biological species such as plants, animals, and microorganisms. The accurate identification of these species is the bedrock of natural drug development. We propose a novel method of species identification in this paper: analysis of whole-genome (AGE), a molecular diagnostic method used to identify species by finding species-specific sequences from the whole genome and precisely recognizing the specific target sequences. We elaborate that the principle for species identification based on AGE is that the genome sequences of diverse species must differ and divide the implementation strategy of the method into two levels of research and application. Based on our analysis of its characteristics, the method would have the potential advantages of reliable principle, high specificity, and wide applicability. Moreover, three crucial concerns related to building method systems including genome acquisition, bioinformatics analysis, and database construction, are further discussed. In summary, we offer theoretical underpinnings and methodological guidance for the development of bioinformatics software and commercial kits, indicating AGE has great application potential in objects, subjects, and industries.


Result Analysis
Print
Save
E-mail