1.Genetic diversity analysis and fingerprints of Chrysanthemum×morifolium based on SSR molecular markers.
Zihang LI ; Lifei HE ; Xiujun WANG ; Linfan GUO ; Chunyan LUO ; Qingwei LI
Chinese Journal of Biotechnology 2023;39(7):2839-2860
The present study aims to explore the genetic diversity of germplasm resources of Chrysanthemum×morifolium (hereinafter, C.×morifolium) at the molecular level and to establish a fingerprint database of C.×morifolium varieties. We employed 12 pairs of primers with high levels of polymorphism, clear bands, and high degrees of reproducibility to analyze the SSR molecular markers and genetic diversity of 91 C.×morifolium materials and 14 chrysanthemum- related materials. With regard to constructing the fingerprints of the tested materials, we chose 9 pairs of core primers. The findings revealed that 12 primer pairs detected 104 alleles in 105 samples, ranging from 2 to 26. The average number of observed alleles (Na) per site was 9.25. The average number of effective alleles (Ne) per site was 2.745 6, with its range being 1.276 0 to 4.742 5. Shannon genetic diversity index (I) values ranged between 0.513 3 and 2.239 9 (M=1.209 0). Nei's gene diversity index (H) ranged between 0.216 3 and 0.789 1 (M=0.578 0). The observed heterozygosity (Ho) ranged between 0.223 3 and 0.895 2 (M=0.557 5). The expected heterozygosity (He) ranged between 0.217 4 and 0.793 3 (M=0.580 8). The polymorphism information content (PIC) ranged between 0.211 5 and 0.774 0 (M=0.532 9). The genetic similarity (GS) ranged between 0.228 5 and 1.000 0 (M=0.608 3). Cluster analysis revealed that when the genetic distance (GD) equals to 0.30, the tested materials can be classified into 2 groups. When the GD equals to 0.27, the first group can be divided into 6 subgroups; accordingly, 105 tested materials can be divided into 7 subgroups. The cophenetic correlation test was carried out based on the cluster analysis, and the corresponding results showed that the cluster map correlated with the genetic similarity coefficient (r=0.952 73). According to the results of Structure population analysis, we obtained the optimal population number, with the true number of populations (K) being 3 and the population being divided concerning Q≥0.5. Three subgroups, i.e., Q1, Q2 and Q3, included 34, 33 and 28 germplasms, respectively, and the remaining 10 germplasms were identified as the mixed population. During the experiment, 9 pairs of core primers were screened among the total of 12 for a complete differentiation regarding 105 tested materials, and the fingerprints of 91 C.×morifolium materials and 14 chrysanthemum-related materials were further constructed. Overall, there were significant genetic differences and rich genetic diversity among C.×morifolium materials, which would shed light on the garden application and variety selection fields of C.×morifolium. The fingerprint database of 105 C.×morifolium varieties and chrysanthemum-related species may provide technical support for future research regarding the identification and screening system of C.×morifolium varieties.
Genetic Variation
;
Chrysanthemum/genetics*
;
Reproducibility of Results
;
Microsatellite Repeats/genetics*
;
Polymorphism, Genetic
;
Biomarkers
;
Phylogeny
2.Characteristics of the chloroplast genome of Dracaena marginata and phylogenetic analysis.
Zihao WANG ; Jiale GUO ; Qi FAN ; Zeyuan TIAN ; Xueqing WANG ; Wei ZHENG ; Luodong HUANG
Chinese Journal of Biotechnology 2023;39(7):2926-2938
Dracaena marginata is a widely cultivated horticultural plant in the world, which has high ornamental and medicinal value. In this study, the whole genome of leaves from D. marginata was sequenced by Illumina HiSeq 4000 platform. The chloroplast genome were assembled for functional annotation, sequence characteristics and phylogenetic analysis. The results showed that the chloroplast genome of D. marginata composed of four regions with a size of 154 926 bp, which was the smallest chloroplast genome reported for Dracaena species to date. A total of 132 genes were identified, including 86 coding genes, 38 tRNA genes and 8 rRNA genes. Codon bias analysis found that the codon usage bias was weak and there was a bias for using A/U base endings. 46 simple sequence repeat and 54 repeats loci were detected in the chloroplast genome, with the maximum detection rate in the large single copy region and inverted repeat region, respectively. The inverted repeats boundaries of D. marginata and Dracaena were highly conserved, whereas gene location differences occurred. Phylogenetic analysis revealed that D. serrulata and D. cinnabari form a monophyletic clade, which was the closest relationship and conformed to the morphological classification characteristics. The analysis of the chloroplast genome of D. marginata provides important data basis for species identification, genetic diversity and chloroplast genome engineering of Dracaena.
Phylogeny
;
Dracaena
;
Genome, Chloroplast/genetics*
;
Base Sequence
;
Genes, Plant
4.Complete chloroplast genome sequencing and phylogeny of wild Atractylodes lancea from Yuexi, Anhui province.
Jian-Peng HU ; Lu JIANG ; Rui XU ; Jun-Xian WU ; Feng-Ya GUAN ; Jin-Chen YAO ; Jun-Ling LIU ; Ya-Zhong ZHANG ; Liang-Ping ZHA
China Journal of Chinese Materia Medica 2023;48(1):52-59
This study investigated the choroplast genome sequence of wild Atractylodes lancea from Yuexi in Anhui province by high-throughput sequencing, followed by characterization of the genome structure, which laid a foundation for the species identification, analysis of genetic diversity, and resource conservation of A. lancea. To be specific, the total genomic DNA was extracted from the leaves of A. lancea with the improved CTAB method. The chloroplast genome of A. lancea was sequenced by the high-throughput sequencing technology, followed by assembling by metaSPAdes and annotation by CPGAVAS2. Bioiformatics methods were employed for the analysis of simple sequence repeats(SSRs), inverted repeat(IR) border, codon bias, and phylogeny. The results showed that the whole chloroplast genome of A. lancea was 153 178 bp, with an 84 226 bp large single copy(LSC) and a 18 658 bp small single copy(SSC) separated by a pair of IRs(25 147 bp). The genome had the GC content of 37.7% and 124 genes: 87 protein-coding genes, 8 rRNA genes, and 29 tRNA genes. It had 26 287 codons and encoded 20 amino acids. Phylogenetic analysis showed that Atractylodes species clustered into one clade and that A. lancea had close genetic relationship with A. koreana. This study established a method for sequencing the chloroplast genome of A. lancea and enriched the genetic resources of Compositae. The findings are expected to lay a foundation for species identification, analysis of genetic diversity, and resource conservation of A. lancea.
Phylogeny
;
Atractylodes/genetics*
;
Genome, Chloroplast
;
Whole Genome Sequencing
;
Microsatellite Repeats
;
Lamiales
5.Specific DNA barcodes, germplasm resources, and genetic diversity of Eleutherococcus senticosus.
Zhi-Fei ZHANG ; Zhong-Zhen ZHAO ; Xin WANG ; Guang-Yao YIN ; Ying CHEN ; Jin-Hui MAN ; Yue SHI ; Yu-Ying HUANG ; Shan-Hu LIU ; Zi-Qi LIU ; Xiao-Hui WANG ; Sheng-Li WEI
China Journal of Chinese Materia Medica 2023;48(5):1229-1237
Eleutherococcus senticosus is one of the Dao-di herbs in northeast China. In this study, the chloroplast genomes of three E. senticosus samples from different genuine producing areas were sequenced and then used for the screening of specific DNA barcodes. The germplasm resources and genetic diversity of E. senticosus were analyzed basing on the specific DNA barcodes. The chloroplast genomes of E. senticosus from different genuine producing areas showed the total length of 156 779-156 781 bp and a typical tetrad structure. Each of the chloroplast genomes carried 132 genes, including 87 protein-coding genes, 37 tRNAs, and 8 rRNAs. The chloroplast genomes were relatively conserved. Sequence analysis of the three chloroplast genomes indicated that atpI, ndhA, ycf1, atpB-rbcL, ndhF-rpl32, petA-psbJ, psbM-psbD, and rps16-psbK can be used as specific DNA barcodes of E. senticosus. In this study, we selected atpI and atpB-rbcL which were 700-800 bp and easy to be amplified for the identification of 184 E. senticosus samples from 13 genuine producing areas. The results demonstrated that 9 and 10 genotypes were identified based on atpI and atpB-rbcL sequences, respectively. Furthermore, the two barcodes identified 23 genotypes which were named H1-H23. The haplotype with the highest proportion and widest distribution was H10, followed by H2. The haplotype diversity and nucleotide diversity were 0.94 and 1.82×10~(-3), respectively, suggesting the high genetic diversity of E. senticosus. The results of the median-joining network analysis showed that the 23 genotypes could be classified into 4 categories. H2 was the oldest haplotype, and it served as the center of the network characterized by starlike radiation, which suggested that population expansion of E. senticosus occurred in the genuine producing areas. This study lays a foundation for the research on the genetic quality and chloroplast genetic engineering of E. senticosus and further research on the genetic mechanism of its population, providing new ideas for studying the genetic evolution of E. senticosus.
DNA Barcoding, Taxonomic
;
Eleutherococcus/genetics*
;
Base Sequence
;
Chloroplasts/genetics*
;
Genetic Variation
;
Phylogeny
6.A robust microsatellite instability detection model for unpaired colorectal cancer tissue samples.
Zili ZHANG ; Hua WAN ; Bing XU ; Hongyang HE ; Guangyu SHAN ; Jingbo ZHANG ; Qixi WU ; Tong LI
Chinese Medical Journal 2023;136(9):1082-1088
BACKGROUND:
Microsatellite instability (MSI) is a key biomarker for cancer immunotherapy and prognosis. Integration of MSI testing into a next-generation-sequencing (NGS) panel could save tissue sample, reduce turn-around time and cost, and provide MSI status and comprehensive genomic profiling in single test. We aimed to develop an MSI calling model to detect MSI status along with the NGS panel-based profiling test using tumor-only samples.
METHODS:
From January 2019 to December 2020, a total of 174 colorectal cancer (CRC) patients were enrolled, including 31 MSI-high (MSI-H) and 143 microsatellite stability (MSS) cases. Among them, 56 paired tumor and normal samples (10 MSI-H and 46 MSS) were used for modeling, and another 118 tumor-only samples were used for validation. MSI polymerase chain reaction (MSI-PCR) was performed as the gold standard. A baseline was built for the selected microsatellite loci using the NGS data of 56 normal blood samples. An MSI detection model was constructed by analyzing the NGS data of tissue samples. The performance of the model was compared with the results of MSI-PCR.
RESULTS:
We first intersected the target genomic regions of the NGS panels used in this study to select common microsatellite loci. A total of 42 loci including 23 mononucleotide repeat sites and 19 longer repeat sites were candidates for modeling. As mononucleotide repeat sites are more sensitive and specific for detecting MSI status than sites with longer length motif and the mononucleotide repeat sites performed even better than the total sites, a model containing 23 mononucleotide repeat sites was constructed and named Colorectal Cancer Microsatellite Instability test (CRC-MSI). The model achieved 100% sensitivity and 100% specificity when compared with MSI-PCR in both training and validation sets. Furthermore, the CRC-MSI model was robust with the tumor content as low as 6%. In addition, 8 out of 10 MSI-H samples showed alternations in the four mismatch repair genes ( MLH1 , MSH2 , MSH6 , and PMS2 ).
CONCLUSION
MSI status can be accurately determined along the targeted NGS panels using only tumor samples. The performance of mononucleotide repeat sites surpasses loci with longer repeat motif in MSI calling.
Humans
;
Microsatellite Instability
;
Colorectal Neoplasms/diagnosis*
;
Microsatellite Repeats/genetics*
;
DNA Mismatch Repair
7.Advances on the treatment of Fusobacterium nucleatum-promoted colorectal cancers using nanomaterials.
Hang WANG ; Xiaoxue HOU ; Jianfeng LIU ; Cuihong YANG
Chinese Journal of Biotechnology 2023;39(9):3670-3680
Fusobacterium nucleatum (Fn) is an oral anaerobic bacterium that has recently been found to colonize on the surface of colorectal cancer cells in humans, and its degree of enrichment is highly negatively correlated with the prognosis of tumor treatment. Numerous studies have shown that Fn is involved in the occurrence and development of colorectal cancer (CRC), and Fn interacts with multiple components in the tumor microenvironment to increase tumor resistance. In recent years, researchers have begun using nanomedicine to inhibit Fn's proliferation at the tumor site or directly target Fn to treat CRC. This review summarizes the mechanism of Fn in promoting CRC and the latest research progress on Fn-related CRC therapy using different nanomaterials. Finally, the applications perspective of nanomaterials in Fn-promoted CRC therapy was prospected.
Humans
;
Colorectal Neoplasms/pathology*
;
Fusobacterium nucleatum/genetics*
;
Base Composition
;
RNA, Ribosomal, 16S
;
Phylogeny
;
Sequence Analysis, DNA
;
Tumor Microenvironment
8.Exploration of transcriptome SSR markers and its application in genetic diversity assessment of Asarum sieboldii.
Meng-Ying CHEN ; Rui-Xian DAI ; Yu-Ling FAN ; Zhong LIU
China Journal of Chinese Materia Medica 2023;48(20):5519-5530
To explore the genetic diversity of Asarum sieboldii this study developed SSR markers based on transcriptome sequencing results and five populations of A.sieboldii from different regions were used as samples for genetic diversity assessment using software such as GenALEx 6.5, NTSYS 2.1, and Structure 2.3.4. The results showed that 16 SSR markers with high polymorphism and good repeatability were selected from the A.sieboldii transcriptome. Primers designed based on the flanking sequences of these markers successfully amplified 56 polymorphic fragments from 150 individual samples of the five A.sieboldii populations. On average, each primer amplified 3.5 polymorphic fragments, ranging from 2 to 8. The mean values of expected heterozygosity(H_e), Shannon's diversity index(I), Nei's gene diversity index(H), and the polymorphic information content(PIC) were 0.172, 0.281, 0.429, and 0.382, respectively. The mean population differentiation coefficient(F_(ST)) was 0.588, consistent with the analysis of molecular variance(AMOVA) results, which indicated greater genetic variation among A.sieboldii populations(69%) than that within populations(31%). The percentage of polymorphic loci(PPL) ranged from highest to lowest as SNJ>LN>SY>SZ>TB. Principal coordinate analysis(PCoA) and UPGMA clustering analysis further revealed genetic clustering of A.sieboldii individuals based on their geographical distribution, consistent with the results of the structure clustering analysis. In summary, the SSR markers developed from the transcriptome effectively assessed the genetic differentiation and population structure of natural A.sieboldii populations, revealing a relatively low genetic diversity in A.sieboldii, with genetic variation primarily observed at the population level and a correlation between population differentiation and geographic distance.
Humans
;
Genetic Variation
;
Asarum
;
Transcriptome/genetics*
;
Microsatellite Repeats/genetics*
;
Phylogeny
9.Clinical features and genetic analysis of a child with 3-methylglutenedioic aciduria type VII due to novel variants of CLPB gene.
Pengwu LIN ; Xuan FENG ; Shengju HAO ; Ling HUI ; Chuan ZHANG ; Bingbo ZHOU ; Lian WANG ; Jingyun SHI ; Qinghua ZHANG
Chinese Journal of Medical Genetics 2023;40(11):1377-1381
OBJECTIVE:
To explore the clinical features and genetic basis for a child with 3-methylglutaconic aciduria type VII.
METHODS:
A child who was diagnosed at the Gansu Provincial Maternity and Child Health Care Hospital on August 9, 2019 was selected as the study subject. Clinical data of the child, including urine gas chromatography and mass spectrometry, were collected. The child and her parents were subjected to whole exome sequencing.
RESULTS:
The child, a female neonate, had presented mainly with intermittent skin cyanosis, convulsions, hypomagnesemia, apnea, neutropenia after birth. Her urine 3-methylpentenedioic acid has increased to 17.53 μmol/L. DNA sequencing revealed that she has harbored compound heterozygous variants of the CLPB gene, namely c.1016delT (p.L339Rfs*5) and c.1087A>G (p.R363G), which were respectively inherited from her mother and father. Both variants were unreported previously. Based on the standards from the American College of Medical Genetics and Genomics (ACMG), the variants were respectively predicted to be pathogenic and likely pathogenic.
CONCLUSION
The child was diagnosed with 3-methylglutenedioic aciduria type VII. Discovery of the c.1016delT and c.1087A>G variants has enriched the mutational spectrum of the CLPB gene.
Female
;
Humans
;
Infant, Newborn
;
Pregnancy
;
Base Sequence
;
Metabolism, Inborn Errors/diagnosis*
;
Mutation
;
Neutropenia/genetics*
;
Sequence Analysis, DNA
10.Analysis of microsatellite instability in endometrial cancer: The significance of minimal microsatellite shift.
Li LIANG ; Xin LI ; Lin NONG ; Ying DONG ; Ji Xin ZHANG ; Dong LI ; Ting LI
Journal of Peking University(Health Sciences) 2023;55(2):254-261
OBJECTIVE:
To analyze the differences and characteristics of microsatellite instability (MSI) in endometrial cancer (EMC), by using colorectal cancer (CRC) as control.
METHODS:
In the study, 228 cases of EMC were collected. For comparative analysis, 770 cases of CRC were collected. Mismatch repair (MMR) expression was detected by immunohistochemistry (IHC), and microsatellite instability (MSI) was analyzed by PCR and capillary electrophoresis fragment analysis (MSI-PCR). MSI-PCR was detected using five mononucleotide repeat markers: BAT-25, BAT-26, NR-21, NR-24, and MONO-27.
RESULTS:
In EMC, we found 27.19% (62/228) of deficient mismatch repair (dMMR) using IHC, significantly higher than CRC (7.79%, 60/770). Meanwhile, subclonal expression of MMR protein was found in 4 cases of dMMR-EMC and 2 cases of dMMR-CRC. According to the criteria of major micro-satellite shift, we found 16.23% (37/228) of MSI-high (MSI-H), 2.63% (6/228) of MSI-low (MSI-L), and 81.14% (185/228) of microsatellite stability (MSS) in EMC using MSI-PCR. The discor-dance rate between MMR-IHC and MSI-PCR in EMC was 11.84% (27/228). In CRC, we found 8.05% (62/770) of MSI-H, 0.13% (1/770) of MSI-L, and 91.82% (707/770) of MSS. The discordance rate between MMR-IHC and MSI-PCR in CRC was only 0.52% (4/770). However, according to the criteria of minimal microsatellite shift, 12 cases of EMC showed minimal microsatellite shift including 8 cases of dMMR/MSS and 4 cases of dMMR/MSI-L and these cases were ultimately evaluated as dMMR/MSI-H. Then, 21.49% (49/228) of EMC showed MSI-H and the discordance rate MMR-IHC and MSI-PCR in EMC decreased to 6.58% (15/228). No minimal microsatellite shift was found in CRC. Compared with EMC group with major microsatellite shift, cases with minimal microsatellite shift showed younger age, better tumor differentiation, and earlier International Federation of Gynecology and Obstetrics (FIGO) stage. There were significant differences in histological variant and FIGO stage between the two groups (P < 0.001, P=0.006).
CONCLUSION
EMC was more prone to minimal microsatellite shift, which should not be ignored in the interpretation of MSI-PCR results. The combined detection of MMR-IHC and MSI-PCR is the most sensitive and specific method to capture MSI tumors.
Female
;
Humans
;
Microsatellite Instability
;
Colorectal Neoplasms
;
Microsatellite Repeats
;
Endometrial Neoplasms
;
DNA Mismatch Repair

Result Analysis
Print
Save
E-mail