1.Genetic diversity analysis and fingerprinting of 175 Chimonanthus praecox germplasm based on SSR molecular marker.
Xiujun WANG ; Yanbei ZHAO ; Jing WANG ; Zihang LI ; Jitang ZHANG ; Qingwei LI
Chinese Journal of Biotechnology 2024;40(1):252-268
The elucidation of resources pertaining to the Chimonanthus praecox varieties and the establishment of a fingerprint serve as crucial underpinnings for advancing scientific inquiry and industrial progress in relation to C. praecox. Employing the SSR molecular marker technology, an exploration of the genetic diversity of 175 C. praecox varieties (lines) in the Yanling region was conducted, and an analysis of the genetic diversity among these varieties was carried out using the UPDM clustering method in NTSYSpc 2.1 software. We analyzed the genetic structure of 175 germplasm using Structure v2.3.3 software based on a Bayesian model. General linear model (GLM) association was utilized to analyze traits and markers. The genetic diversity analysis revealed a mean number of alleles (Na) of 6.857, a mean expected heterozygosity (He) of 0.496 3, a mean observed heterozygosity (Ho) of 0.503 7, a mean genetic diversity index of Nei՚s of 0.494 9, and a mean Shannon information index of 0.995 8. These results suggest that the C. praecox population in Yanling exhibits a rich genetic diversity. Additionally, the population structure and the UPDM clustering were examined. In the GLM model, a total of fifteen marker loci exhibited significant (P < 0.05) association with eight phenotypic traits, with the explained phenotypic variation ranging from 14.90% to 36.03%. The construction of fingerprints for C. praecox varieties (lines) was accomplished by utilizing eleven primer pairs with the highest polymorphic information content, resulting in the analysis of 175 SSR markers. The present study offers a thorough examination of the genetic diversity and SSR molecular markers of C. praecox in Yanling, and establishes a fundamental germplasm repository of C. praecox, thereby furnishing theoretical underpinnings for the selection and cultivation of novel and superior C. praecox varieties, varietal identification, and resource preservation and exploitation.
Bayes Theorem
;
Biomarkers
;
Phenotype
;
Cluster Analysis
;
Genetic Variation
2.Analysis and prospects of common problems in clinical data mining of traditional Chinese medicine prescriptions.
Wen-Chao DAN ; Guo-Zhen ZHAO ; Qing-Yong HE ; Hui ZHANG ; Bo LI ; Guang-Zhong ZHANG
China Journal of Chinese Materia Medica 2023;48(17):4812-4818
Mining data from traditional Chinese medicine(TCM) prescriptions is one of the important methods for inheriting the experience of famous doctors and developing new drugs. However, current research work has problems such as to be optimized research plans and non-standard statistics. The main problems and corresponding solutions summarized by the research mainly include four aspects.(1)The research plan design needs to consider the efficacy and quality of individual cases.(2)The significance of the difference in confidence order of association rules needs to be further considered, and the lift should not be ignored.(3)The clustering analysis steps are complex. The selection of clustering variables should comprehensively consider factors such as the frequency of TCM, network topology parameters, and practical application significance. The selection of distance calculation and clustering methods should be improved based on the characteristics of TCM clinical data. Jaccard distance and its improvement plan should be given attention in the future. A single, unexplained clustering result should not be presented, but the final clustering plan should be selected based on a comprehensive consideration of TCM clinical characteristics and objective evaluation indicators for clustering.(4)When calculating correlation coefficients, algorithms that are only suitable for continuous variables should not be applied to binary variables. This article explained the connotations of the above problems based on the characteristics of TCM clinical research and statistical principles and proposed corresponding suggestions to provide important references for future data mining research work.
Humans
;
Medicine, Chinese Traditional
;
Prescriptions
;
Data Mining
;
Cluster Analysis
;
Physicians
;
Drugs, Chinese Herbal/therapeutic use*
3.Imputation method for dropout in single-cell transcriptome data.
Chao JIANG ; Longfei HU ; Chunxiang XU ; Qinyu GE ; Xiangwei ZHAO
Journal of Biomedical Engineering 2023;40(4):778-783
Single-cell transcriptome sequencing (scRNA-seq) can resolve the expression characteristics of cells in tissues with single-cell precision, enabling researchers to quantify cellular heterogeneity within populations with higher resolution, revealing potentially heterogeneous cell populations and the dynamics of complex tissues. However, the presence of a large number of technical zeros in scRNA-seq data will have an impact on downstream analysis of cell clustering, differential genes, cell annotation, and pseudotime, hindering the discovery of meaningful biological signals. The main idea to solve this problem is to make use of the potential correlation between cells and genes, and to impute the technical zeros through the observed data. Based on this, this paper reviewed the basic methods of imputing technical zeros in the scRNA-seq data and discussed the advantages and disadvantages of the existing methods. Finally, recommendations and perspectives on the use and development of the method were provided.
Cluster Analysis
;
Transcriptome
4.Analysis of the epidemiological characteristics of scarlet fever in Yantai City, Shandong Province from 2015 to 2019.
Chang Lan YU ; Xiu Wei LIU ; Xiao Dong MU ; Xing Jie PAN
Chinese Journal of Preventive Medicine 2023;57(3):411-415
From 2015 to 2019, the annual average incidence rate of scarlet fever was 7.80/100 000 in Yantai City, which showed an increasing trend since 2017 (χ2trend=233.59, P<0.001). The peak period of this disease was from April to July and November to January of the next year. The ratio of male to female was 1.49∶1, with a higher prevalence among cases aged 3 to 9 years (2 357/2 552, 92.36%). Children in kindergartens, primary and middle school students, and scattered children were the high risk population, with the incidence rate of 159.86/100 000, 25.57/100 000 and 26.77/100 000, respectively. The global spatial auto-correlation analysis showed that the global Moran's I index of the reported incidence rate of scarlet fever in Yantai from 2015 to 2019 was 0.28, 0.29, 0.44, 0.48, and 0.22, respectively (all P values<0.05), suggesting that the incidence rate of scarlet fever in Yantai from 2015 to 2019 was spatial clustering. The local spatial auto-correlation analysis showed that the "high-high" clustering areas were mainly located in Laizhou City, Zhifu District, Haiyang City, Fushan District and Kaifa District, while the "low-high" clustering areas were mainly located in Haiyang City and Fushan District.
Child
;
Humans
;
Male
;
Female
;
Scarlet Fever/epidemiology*
;
Spatial Analysis
;
Cities/epidemiology*
;
Seasons
;
Risk Factors
;
Incidence
;
Cluster Analysis
;
China/epidemiology*
5.Simultaneous determination of eleven volatile components in Cinnamomi Oleum by GC-MS.
Yang ZHOU ; Ting YAN ; Lin ZHENG ; Ming-Yan CHI ; Zi-Peng GONG ; Yue-Ting LI ; Jie PAN ; Yong HUANG ; Qing-Bo YANG
China Journal of Chinese Materia Medica 2023;48(6):1568-1577
A gas chromatography-triple quadrupole mass spectrometry(GC-MS) method was established for the simultaneous determination of eleven volatile components in Cinnamomi Oleum and the chemical pattern recognition was utilized to evaluate the quality of essential oil obtained from Cinnamomi Fructus medicinal materials in various habitats. The Cinnamomi Fructus medicinal materials were treated by water distillation, analyzed using GC-MS, and detected by selective ion monitoring(SIM), and the internal standards were used for quantification. The content results of Cinnamomi Oleum from various batches were analyzed by hierarchical clustering analysis(HCA), principal component analysis(PCA), and orthogonal partial least squares-discriminant analysis(OPLS-DA) for the statistic analysis. Eleven components showed good linear relationships within their respective concentration ranges(R~2>0.999 7), with average recoveries of 92.41%-102.1% and RSD of 1.2%-3.2%(n=6). The samples were classified into three categories by HCA and PCA, and 2-nonanone was screened as a marker of variability between batches in combination with OPLS-DA. This method is specific, sensitive, simple, and accurate, and the screened components can be utilized as a basis for the quality control of Cinnamomi Oleum.
Gas Chromatography-Mass Spectrometry
;
Plant Oils
;
Oils, Volatile
;
Drugs, Chinese Herbal/analysis*
;
Cluster Analysis
6.Resting-state electroencephalogram relevance state recognition of Parkinson's disease based on dynamic weighted symbolic mutual information and k-means clustering.
Hao DING ; Jinhui WU ; Xudong TANG ; Jiangnan YU ; Xuanheng CHEN ; Zhanxiong WU
Journal of Biomedical Engineering 2023;40(1):20-26
At present, the incidence of Parkinson's disease (PD) is gradually increasing. This seriously affects the quality of life of patients, and the burden of diagnosis and treatment is increasing. However, the disease is difficult to intervene in early stage as early monitoring means are limited. Aiming to find an effective biomarker of PD, this work extracted correlation between each pair of electroencephalogram (EEG) channels for each frequency band using weighted symbolic mutual information and k-means clustering. The results showed that State1 of Beta frequency band ( P = 0.034) and State5 of Gamma frequency band ( P = 0.010) could be used to differentiate health controls and off-medication Parkinson's disease patients. These findings indicated that there were significant differences in the resting channel-wise correlation states between PD patients and healthy subjects. However, no significant differences were found between PD-on and PD-off patients, and between PD-on patients and healthy controls. This may provide a clinical diagnosis reference for Parkinson's disease.
Humans
;
Parkinson Disease/diagnosis*
;
Quality of Life
;
Cluster Analysis
;
Electroencephalography
;
Healthy Volunteers
7.Spatial distribution of cognitive dysfunction and its risk factors in Chinese population aged 45 years and above.
Shuning HE ; Jiahao ZHANG ; Ruonan YANG ; Ping YUAN
Journal of Southern Medical University 2023;43(4):611-619
OBJECTIVE:
To analyze the spatial distribution of the prevalence of cognitive dysfunction and its risk factors in Chinese population aged 45 years and above to provide evidence for formulating regional prevention and control strategies.
METHODS:
The study subjects with complete cognitive function data were selected from the follow-up data of the China Health and Retirement Longitudinal Study (CHARLS) Phase IV. ArcGis 10.4 software was used for spatial analysis of the prevalence of cognitive dysfunction in the population aged 45 years and above for each province based on the geographic information system (GIS) technology.
RESULTS:
In 2018, the overall prevalence of cognitive dysfunction was 33.59% (5951/17716) in individuals aged 45 and above in China. Global spatial autocorrelation analysis indicated a spatial clustering and a positive autocorrelation (P < 0.001) of the prevalence of cognitive dysfunction in the study subjects, with a Moran's I value of 0.333085. The results of local spatial autocorrelation analysis showed that the southwestern region of China was the main aggregation area of patients with cognitive dysfunction. Geographically weighted regression analysis suggested that a male gender, an advanced age, and illiteracy were the major risk factors for cognitive dysfunction (P < 0.05). These 3 risk factors showed a spatial distribution heterogeneity with greater impact in the northern, western, and northwestern regions of China, respectively.
CONCLUSION
The prevalence of cognitive dysfunction is relatively high in individuals aged 45 years and above in China. A male gender, an advanced age, and illiteracy are the major risk factors for cognitive dysfunction and show different spatial distribution patterns, with the northern, western and northwestern regions of China as the key areas for prevention and control, where the prevention and control measures should be designed based on local conditions.
Humans
;
Male
;
China/epidemiology*
;
Cluster Analysis
;
Cognitive Dysfunction/epidemiology*
;
East Asian People
;
Longitudinal Studies
;
Risk Factors
;
Middle Aged
8.Spatial clustering analysis of scarlet fever incidence in China from 2016 to 2020.
Jiahao ZHANG ; Ruonan YANG ; Shuning HE ; Ping YUAN
Journal of Southern Medical University 2023;43(4):644-648
OBJECTIVE:
To investigate the incidence trend and spatial clustering characteristics of scarlet fever in China from 2016 to 2020 to provide evidence for development of regional disease prevention and control strategies.
METHODS:
The incidence data of scarlet fever in 31 provinces and municipalities in mainland China from 2016 to 2020 were obtained from the Chinese Health Statistics Yearbook and the Public Health Science Data Center led by the Chinese Center for Disease Control and Prevention.The three-dimensional spatial trend map of scarlet fever incidence in China was drawn using ArcGIS to determine the regional trend of scarlet fever incidence.GeoDa spatial autocorrelation analysis was used to explore the spatial aggregation of scarlet fever in China in recent years.
RESULTS:
From 2016 to 2020, a total of 310 816 cases of scarlet fever were reported in 31 provinces, municipalities directly under the central government and autonomous regions, with an average annual incidence of 4.48/100 000.The reported incidence decreased from 4.32/100 000 in 2016 to 1.18/100 000 in 2020(Z=103.47, P < 0.001).The incidence of scarlet fever in China showed an obvious regional clustering from 2016 to 2019(Moran's I>0, P < 0.05), but was randomly distributed in 2020(Moran's I>0, P=0.16).The incidence of scarlet fever showed a U-shaped distribution in eastern and western regions of China, and increased gradually from the southern to northern regions.Inner Mongolia Autonomous Region and Hebei and Gansu provinces had the High-high (H-H) clusters of scarlet fever in China.
CONCLUSION
Scarlet fever still has a high incidence in China with an obvious spatial clustering.For the northern regions of China with H-H clusters of scarlet fever, the allocation of health resources and public health education dynamics should be strengthened, and local scarlet fever prevention and control policies should be made to contain the hotspots of scarlet fever.
Humans
;
Incidence
;
Scarlet Fever/epidemiology*
;
China/epidemiology*
;
Spatial Analysis
;
Cluster Analysis
;
Spatio-Temporal Analysis
9.Spatial and temporal distribution characteristics of seasonal A(H3N2) influenza in China, 2014-2019.
Ya Yun HAN ; Jing YANG ; Xiao Xu ZENG ; Jia Ying YANG ; Guang Xue HE ; Da Yan WANG ; Tao CHEN
Chinese Journal of Epidemiology 2023;44(6):937-941
Objective: To analyze the spatial and temporal distribution characteristics of seasonal A(H3N2) influenza [influenza A(H3N2)] in China and to provide a reference for scientific prevention and control. Methods: The influenza A(H3N2) surveillance data in 2014-2019 was derived from China Influenza Surveillance Information System. A line chart described the epidemic trend analyzed and plotted. Spatial autocorrelation analysis was conducted using ArcGIS 10.7, and spatiotemporal scanning analysis was conducted using SaTScan 10.1. Results: A total of 2 603 209 influenza-like case sample specimens were detected from March 31, 2014, to March 31, 2019, and the influenza A(H3N2) positive rate was 5.96%(155 259/2 603 209). The positive rate of influenza A(H3N2) was statistically significant in the north and southern provinces in each surveillance year (all P<0.05). The high incidence seasons of influenza A (H3N2) were in winter in northern provinces and summer or winter in southern provinces. Influenza A (H3N2) clustered in 31 provinces in 2014-2015 and 2016-2017. High-high clusters were distributed in eight provinces, including Beijing, Tianjin, Hebei, Shandong, Shanxi, Henan, Shaanxi, and Ningxia Hui Autonomous Region in 2014-2015, and high-high clusters were distributed in five provinces including Shanxi, Shandong, Henan, Anhui, and Shanghai in 2016-2017. Spatiotemporal scanning analysis from 2014 to 2019 showed that Shandong and its surrounding twelve provinces clustered from November 2016 to February 2017 (RR=3.59, LLR=9 875.74, P<0.001). Conclusion: Influenza A (H3N2) has high incidence seasons with northern provinces in winter and southern provinces in summer or winter and obvious spatial and temporal clustering characteristics in China from 2014-2019.
Humans
;
Influenza, Human/epidemiology*
;
China/epidemiology*
;
Influenza A Virus, H3N2 Subtype
;
Seasons
;
Cluster Analysis
10.Epidemiological characteristics and spatio-temporal distribution of pulmonary tuberculosis cases reported in students from Guizhou Province, 2011-2020.
Long LIAO ; Hui Juan CHEN ; Shi Lin FANG ; Xiao Qi ZENG ; Su Fang XIONG ; Yun WANG
Chinese Journal of Epidemiology 2023;44(6):966-973
Objective: To analyze the trend of epidemiological characteristics and spatiotemporal distribution of pulmonary tuberculosis (PTB) among smear-positive or other types of students in Guizhou Province from 2011 to 2020, and to provide a reference for improving prevention and control measures. Methods: Data were collected from the Chinese Information System's Notifiable Disease and Tuberculosis Management Information System for disease control and prevention, the Joinpoint 4.9.1.0 software was used to analyze the trend of registration rate; the ArcGIS 10.6 software was used to construct a ring map and to perform spatial autocorrelation analysis; the SaTScan 9.7 software was used for spatial-temporal scan statistics. Results: A total of 32 682 student PTB cases were reported in Guizhou Province from 2011 to 2020, including 5 949 (18.20%) smear-positive cases. Most cases occurred from high school students of 16 to 18 years old (43.99%, 14 376/32 682); the annual average registered rate was 36.22/100 000, the highest in 2018 (52.90/100 000), and the registration rate showed an increasing trend. Meanwhile, a similar trend of registration rate was observed among smear-positive or other types of students. The spatialtemporal heterogeneity was found that the "high-high" clustering patterns of smear-positive or other types were aggregated in Bijie City. Six spatialtemporal clusters with statistically significant (all P<0.001) were detected among smear-positive or other cases, respectively. Conclusions: Upward trend with spatial- temporal clusters of PTB cases reported in students from Guizhou Province from 2011 to 2020. Surveillance should be strengthened for high school students, and regular screening should be conducted in high-risk areas to control the source of infection and reduce the risk of transmission.
Humans
;
Adolescent
;
Tuberculosis, Pulmonary/epidemiology*
;
Asian People
;
Cluster Analysis
;
Software
;
Students

Result Analysis
Print
Save
E-mail