1.Intraspecific variation of Forsythia suspensa chloroplast genome.
Yu-Han LI ; Lin-Lin CAO ; Chang GUO ; Yi-Heng WANG ; Dan LIU ; Jia-Hui SUN ; Sheng WANG ; Gang-Min ZHANG ; Wen-Pan DONG
China Journal of Chinese Materia Medica 2025;50(8):2108-2115
Forsythia suspensa is a traditional Chinese medicine and a commonly used landscaping plant. Its dried fruit is used in medicine for its functions of clearing heat, removing toxins, reducing swelling, dissipating masses, and dispersing wind and heat. It possesses extremely high medicinal and economic value. However, the genetic differentiation and diversity of its wild populations remain unclear. In this study, chloroplast genome sequences were obtained from 15 wild individuals of F. suspensa using high-throughput sequencing technology. The sequence characteristics and intraspecific variations were analyzed. The results were as follows:(1) The full length of the F. suspensa chloroplast genome ranged from 156 184 to 156 479 bp, comprising a large single-copy region, a small single-copy region, and two inverted repeat regions. The chloroplast genome encoded a total of 132 genes, including 87 protein-coding genes, 37 tRNA genes, and 8 rRNA genes.(2) A total of 166-174 SSR loci, 792 SNV loci, and 63 InDel loci were identified in the F. suspensa chloroplast genome, indicating considerable genetic variation among individuals.(3) Population structure analysis revealed that F. suspensa could be divided into five or six groups. Both the population structure analysis and phylogenetic reconstruction results indicated significant genetic variation within the wild populations of F. suspensa, with no obvious correlation between intraspecific genetic differentiation and geographical distribution. This study provides new insights into the genetic diversity and differentiation within F. suspensa species and offers additional references for the conservation of species diversity and the utilization of germplasm resources in wild F. suspensa.
Genome, Chloroplast
;
Forsythia/classification*
;
Phylogeny
;
Genetic Variation
;
Chloroplasts/genetics*
;
Microsatellite Repeats
2.Characteristics of the chloroplast genome of Camellia insularis.
Jin ZHANG ; Yongbiao DENG ; Bo ZHAO
Chinese Journal of Biotechnology 2024;40(1):280-291
In this study, the chloroplast genome of Camellia insularis Orel & Curry was sequenced using high-throughput sequencing technology. The results showed that the chloroplast genome of C. insularis was 156 882 bp in length with a typical tetrad structure, encoding 132 genes, including 88 protein-coding genes, 36 tRNA genes, and 8 rRNA genes. Codon preference analysis revealed that the highest number of codons coded for leucine, with a high A/U preference in the third codon position. Additionally, 67 simple sequence repeats (SSR) loci were identified, with a preference for A and T bases. The inverted repeat (IR) boundary regions of the chloroplast genome of C. insularis were relatively conserved, except for a few variable regions. Phylogenetic analysis indicated that C. insularis was most closely related to C. fascicularis. Yellow camellia is a valuable material for genetic engineering breeding. This study provides fundamental genetic information on chloroplast engineering and offers valuable resources for conducting in-depth research on the evolution, species identification, and genomic breeding of yellow Camellia.
Genome, Chloroplast/genetics*
;
Phylogeny
;
Plant Breeding
;
Camellia/genetics*
;
Chloroplasts/genetics*
3.Establishment and application of chloroplast genome database with the largest number of species in world.
Zi-Yuan CHEN ; Zhong-Yi HUA ; Yuan YUAN
China Journal of Chinese Materia Medica 2024;49(23):6257-6263
The chloroplast genome is an important tool for studying plant classification, evolution, and the heterologous production of secondary metabolites and protein drugs. With advancements in sequencing technology and reductions in sequencing costs, chloroplast genome data have rapidly accumulated. However, existing chloroplast genome databases suffer from issues such as incomplete data, inadequate management, and inconsistent, inaccurate information, posing significant challenges for the development and utilization of the chloroplast genome. Therefore, it is urgently necessary to establish a database that provides comprehensive and reliable chloroplast genome information. This article provides a brief introduction to the Chloroplast Genome Information Resource(CGIR), the most comprehensive chloroplast genome database globally in terms of species coverage. The database, consisting of five modules, i.e.,(1) genomes,(2) genes,(3) simple sequence repeats(SSRs),(4) DNA barcodes, and(5) DNA signature sequences(DSSs), currently includes 34 923 chloroplast genome assemblies from 16 717 species. Based on the functionalities of these modules, the article systematically summarizes the progress in the application of the database in plant phylogenetic analysis, species identification, and chloroplast genetic engineering. The chloroplast genome database will be continuously updated in the future to provide a solid and reliable data foundation for chloroplast genome research, further promoting studies on traditional Chinese medicine(TCM)identification, resource conservation, and germplasm innovation.
Genome, Chloroplast
;
Plants/classification*
;
Databases, Genetic
;
Phylogeny
;
Chloroplasts/genetics*
4.Chloroplast genomic characterization and phylogenetic analysis of Pellionia scabra.
Li YAN ; Xuelian YANG ; Yongfei WU ; Xia WANG ; Xiaojing HU
Chinese Journal of Biotechnology 2023;39(7):2914-2925
Pellionia scabra belongs to the genus Pellionia in the family of Urticaceae, and is a high-quality wild vegetables with high nutritional value. In this study, high-throughput techniques were used to sequence, assemble and annotate the chloroplast genome. We also analyzed its structure, and construct the phylogenetic trees from the P. scabra to further study the chloroplast genome characteristics. The results showed that the chloroplast genome size was 153 220 bp, and the GC content was 36.4%, which belonged to the typical tetrad structure in P. scabra. The chloroplast genome encodes 130 genes, including 85 protein-coding genes, 37 tRNA genes, and 8 rRNA genes in P. scabra. Among them, 15 genes contained 1 intron, 2 genes contained 2 introns, and rps12 had trans-splicing, respectively. In P. scabra, chloroplast genomes could be divided into four categories, including 43 photosynthesis, 64 self-replication, other 7 coding proteins, and 4 unknown functions. A total of 51 073 codons were detected in the chloroplast genome, among which the codon encoding leucine (Leu) accounted for the largest proportion, and the codon preferred to use A and U bases. There were 72 simple sequence repeats (SSRs) in the chloroplast genome of P. scabra, containing 58 single nucleotides, 12 dinucleotides, 1 trinucleotide, and 1 tetranucleotide. The ycf1 gene expansion was present at the IRb/SSC boundary. The phylogenetic trees showed that P. scabra (OL800583) was most closely related to Elatostema stewardii (MZ292972), Elatostema dissectum (MK227819) and Elatostema laevissimum var. laevissimum (MN189961). Taken together, our results provide worthwhile information for understanding the identification, genetic evolution, and genomics research of P. scabra species.
Phylogeny
;
Genome, Chloroplast/genetics*
;
Genomics
;
Chloroplasts/genetics*
;
Codon
;
Urticaceae/genetics*
5.Characteristics of the chloroplast genome of Dracaena marginata and phylogenetic analysis.
Zihao WANG ; Jiale GUO ; Qi FAN ; Zeyuan TIAN ; Xueqing WANG ; Wei ZHENG ; Luodong HUANG
Chinese Journal of Biotechnology 2023;39(7):2926-2938
Dracaena marginata is a widely cultivated horticultural plant in the world, which has high ornamental and medicinal value. In this study, the whole genome of leaves from D. marginata was sequenced by Illumina HiSeq 4000 platform. The chloroplast genome were assembled for functional annotation, sequence characteristics and phylogenetic analysis. The results showed that the chloroplast genome of D. marginata composed of four regions with a size of 154 926 bp, which was the smallest chloroplast genome reported for Dracaena species to date. A total of 132 genes were identified, including 86 coding genes, 38 tRNA genes and 8 rRNA genes. Codon bias analysis found that the codon usage bias was weak and there was a bias for using A/U base endings. 46 simple sequence repeat and 54 repeats loci were detected in the chloroplast genome, with the maximum detection rate in the large single copy region and inverted repeat region, respectively. The inverted repeats boundaries of D. marginata and Dracaena were highly conserved, whereas gene location differences occurred. Phylogenetic analysis revealed that D. serrulata and D. cinnabari form a monophyletic clade, which was the closest relationship and conformed to the morphological classification characteristics. The analysis of the chloroplast genome of D. marginata provides important data basis for species identification, genetic diversity and chloroplast genome engineering of Dracaena.
Phylogeny
;
Dracaena
;
Genome, Chloroplast/genetics*
;
Base Sequence
;
Genes, Plant
6.Characteristics and phylogenetic analysis of chloroplast genome of a new type of fruit Rubus rosaefolius.
Yongfei WU ; Xuelian YANG ; Xia WANG ; Li YAN ; Wanping ZHANG
Chinese Journal of Biotechnology 2023;39(7):2939-2953
The genomic DNA of Rubus rosaefolius was extracted and sequenced by Illumina NovaSeq platform to obtain the complete chloroplast genome sequence, and the sequence characteristics and phylogenetic analysis of chloroplast genes were carried out. The results showed that the complete chloroplast genome of the R. rosaefolius was 155 650 bp in length and had a typical tetrad structure, including two reverse repeats (25 748 bp each), a large copy region (85 443 bp) and a small copy region (18 711 bp). A total of 131 genes were identified in the whole genome of R. rosaefolius chloroplast, including 86 protein coding genes, 37 tRNA genes and 8 rRNA genes. The GC content of the whole genome was 36.9%. The genome of R. rosaefolius chloroplast contains 47 scattered repeats and 72 simple sequence repeating (SSR) loci. The codon preference is leucine codon, and the codon at the end of A/U is preferred. Phylogenetic analysis showed that R. rosaefolius had the closest relationship with R. taiwanicola, followed by R. rubraangustifolius and R. glandulosopunctatus. The chloroplast genome characteristics and phylogenetic analysis of R. rosaefolius provide a theoretical basis for its genetic diversity research and chloroplast development and utilization.
Phylogeny
;
Rubus/genetics*
;
Genome, Chloroplast
;
Fruit/genetics*
;
Codon/genetics*
7.Chloroplast genomic characterization and phylogenetic analysis of Castanopsis hystrix.
Guangyu XUE ; Zhiwen DENG ; Xueping ZHU ; Junduo WU ; Shitao DONG ; Xianjin XIE ; Ji ZENG
Chinese Journal of Biotechnology 2023;39(2):670-684
The structure and size of the chloroplast genome of Castanopsis hystrix was determined by Illumina HiSeq 2500 sequencing platform to understand the difference between C. hystrix and the chloroplast genome of the same genus, and the evolutionary position of C. hystrix in the genus, so as to facilitate species identification, genetic diversity analysis and resource conservation of the genus. Bioinformatics analysis was used to perform sequence assembly, annotation and characteristic analysis. R, Python, MISA, CodonW and MEGA 6 bioinformatics software were used to analyze the genome structure and number, codon bias, sequence repeats, simple sequence repeat (SSR) loci and phylogeny. The genome size of C. hystrix chloroplast was 153 754 bp, showing tetrad structure. A total of 130 genes were identified, including 85 coding genes, 37 tRNA genes and 8 rRNA genes. According to codon bias analysis, the average number of effective codons was 55.5, indicating that the codons were highly random and low in bias. Forty-five repeats and 111 SSR loci were detected by SSR and long repeat fragment analysis. Compared with the related species, chloroplast genome sequences were highly conserved, especially the protein coding sequences. Phylogenetic analysis showed that C. hystrix is closely related to the Hainanese cone. In summary, we obtained the basic information and phylogenetic position of the chloroplast genome of red cone, which will provide a preliminary basis for species identification, genetic diversity of natural populations and functional genomics research of C. hystrix.
Phylogeny
;
Genome, Chloroplast
;
Codon/genetics*
;
Genomics
;
Chloroplasts/genetics*
8.Complete chloroplast genome sequencing and phylogeny of wild Atractylodes lancea from Yuexi, Anhui province.
Jian-Peng HU ; Lu JIANG ; Rui XU ; Jun-Xian WU ; Feng-Ya GUAN ; Jin-Chen YAO ; Jun-Ling LIU ; Ya-Zhong ZHANG ; Liang-Ping ZHA
China Journal of Chinese Materia Medica 2023;48(1):52-59
This study investigated the choroplast genome sequence of wild Atractylodes lancea from Yuexi in Anhui province by high-throughput sequencing, followed by characterization of the genome structure, which laid a foundation for the species identification, analysis of genetic diversity, and resource conservation of A. lancea. To be specific, the total genomic DNA was extracted from the leaves of A. lancea with the improved CTAB method. The chloroplast genome of A. lancea was sequenced by the high-throughput sequencing technology, followed by assembling by metaSPAdes and annotation by CPGAVAS2. Bioiformatics methods were employed for the analysis of simple sequence repeats(SSRs), inverted repeat(IR) border, codon bias, and phylogeny. The results showed that the whole chloroplast genome of A. lancea was 153 178 bp, with an 84 226 bp large single copy(LSC) and a 18 658 bp small single copy(SSC) separated by a pair of IRs(25 147 bp). The genome had the GC content of 37.7% and 124 genes: 87 protein-coding genes, 8 rRNA genes, and 29 tRNA genes. It had 26 287 codons and encoded 20 amino acids. Phylogenetic analysis showed that Atractylodes species clustered into one clade and that A. lancea had close genetic relationship with A. koreana. This study established a method for sequencing the chloroplast genome of A. lancea and enriched the genetic resources of Compositae. The findings are expected to lay a foundation for species identification, analysis of genetic diversity, and resource conservation of A. lancea.
Phylogeny
;
Atractylodes/genetics*
;
Genome, Chloroplast
;
Whole Genome Sequencing
;
Microsatellite Repeats
;
Lamiales
9.Characterization and phylogenetic analysis of complete chloroplast genome of cultivated Qinan agarwood.
Qiao-Zhen LIU ; Jiang-Peng DAI ; Peng-Jian ZHU ; Yue-Xia LIN ; Xiao-Xia GAO ; Shuang ZHU
China Journal of Chinese Materia Medica 2023;48(20):5531-5539
"Tangjie" leaves of cultivated Qinan agarwood were used to obtain the complete chloroplast genome using high-throughput sequencing technology. Combined with 12 chloroplast genomes of Aquilaria species downloaded from NCBI, bioinformatics method was employed to determine the chloroplast genome characteristics and phylogenetic relationships. The results showed that the chloroplast genome sequence length of cultivated Qinan agarwood "Tangjie" leaves was 174 909 bp with a GC content of 36.7%. A total of 136 genes were annotated, including 90 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. Sequence repeat analysis detected 80 simple sequence repeats(SSRs) and 124 long sequence repeats, with most SSRs composed of A and T bases. Codon preference analysis revealed that AUU was the most frequently used codon, and codons with A and U endings were preferred. Comparative analysis of Aquilaria chloroplast genomes showed relative conservation of the IR region boundaries and identified five highly variable regions: trnD-trnY, trnT-trnL, trnF-ndhJ, petA-cemA, and rpl32, which could serve as potential DNA barcodes specific to the Aquilaria genus. Selection pressure analysis indicated positive selection in the rbcL, rps11, and rpl32 genes. Phylogenetic analysis revealed that cultivated Qinan agarwood "Tangjie" and Aquilaria agallocha clustered together(100% support), supporting the Chinese origin of Qinan agarwood from Aquilaria agallocha. The chloroplast genome data obtained in this study provide a foundation for studying the genetic diversity of cultivated Qinan agarwood and molecular identification of the Aquilaria genus.
Phylogeny
;
Genome, Chloroplast
;
Codon
;
Molecular Sequence Annotation
;
Thymelaeaceae/genetics*
10.Genome structure and variation of Reynoutria japonica Houtt. chloroplast genome.
Mengtao SUN ; Junxin ZHANG ; Tiran HUANG ; Mingfeng YANG ; Lanqing MA ; Liusheng DUAN
Chinese Journal of Biotechnology 2022;38(5):1953-1964
Reynoutria japonica Houtt., belonging to Polygoneae of Polygonaceae, is a Chinese medicinal herb with the functions of draining dampness and relieving jaundice, clearing heat and detoxifying, dispersing blood stasis and relieving pain, and relieving cough and resolving phlegm. In this study, we carried out high-throughput sequencing for the chloroplast genome sequences of five cultivars of R. japonica and analyzed the genome structure and variations. The chloroplast genomes of the five R. japonica cultivars had two sizes (163 376 bp and 163 371 bp) and a typical circular tetrad structure composed of a large single-copy (LSC) region of 85 784 bp, a small single-copy (SSC) region of 18 616 bp, and a pair of inverted repeat (IR) regions (IRa/IRb) which are spaced apart. A total of 161 genes were obtained by annotation, which consisted of 106 protein-coding genes, 10 rRNA-coding genes, and 45 tRNA-coding genes. The total GC content was 36.7%. Specifically, the GC content in the LSC, SSC, and IR regions were 34.8%, 30.7%, and 42.7%, respectively. Comparison of the whole chloroplast genome among the five cultivars showed that trnk-UUU, rpoC1, petD, rpl16, ndhA, and rpl12 in coding regions had sequence variations. In the phylogenetic tree constructed for the 11 samples of Polygoneae, the five cultivars of R. japonica clustered into one clade near the root and was a sister group of Fallopia multiflora (Thunb.).
Base Composition
;
Genome, Chloroplast/genetics*
;
Open Reading Frames
;
Phylogeny
;
Reynoutria

Result Analysis
Print
Save
E-mail