1.High-quality Arabidopsis thaliana Genome Assembly with Nanopore and HiFi Long Reads
Wang BO ; Yang XIAOFEI ; Jia YANYAN ; Xu YU ; Jia PENG ; Dang NINGXIN ; Wang SONGBO ; Xu TUN ; Zhao XIXI ; Gao SHENGHAN ; Dong QUANBIN ; Ye KAI
Genomics, Proteomics & Bioinformatics 2022;20(1):4-13
Arabidopsis thaliana is an important and long-established model species for plant molec-ular biology,genetics,epigenetics,and genomics.However,the latest version of reference genome still contains a significant number of missing segments.Here,we reported a high-quality and almost complete Col-0 genome assembly with two gaps(named Col-XJTU)by combining the Oxford Nanopore Technologies ultra-long reads,Pacific Biosciences high-fidelity long reads,and Hi-C data.The total genome assembly size is 133,725,193 bp,introducing 14.6 Mb of novel sequences compared to the TAIR1 0.1 reference genome.All five chromosomes of the Col-XJTU assembly are highly accurate with consensus quality(QV)scores>60(ranging from 62 to 68),which are higher than those of the TAIR10.1 reference(ranging from 45 to 52).We completely resolved chro-mosome(Chr)3 and Chr5 in a telomere-to-telomere manner.Chr4 was completely resolved except the nucleolar organizing regions,which comprise long repetitive DNA fragments.The Chr1 cen-tromere(CEN1),reportedly around 9 Mb in length,is particularly challenging to assemble due to the presence of tens of thousands of CEN180 satellite repeats.Using the cutting-edge sequencing data and novel computational approaches,we assembled a 3.8-Mb-long CEN1 and a 3.5-Mb-long CEN2.We also investigated the structure and epigenetics of centromeres.Four clusters of CEN180 monomers were detected,and the centromere-specific histone H3-like protein(CENH3)exhibited a strong preference for CEN 180 Cluster 3.Moreover,we observed hypomethylation patterns in CENH3-enriched regions.We believe that this high-quality genome assembly,Col-XJTU,would serve as a valuable reference to better understand the global pattern of centromeric polymorphisms,as well as the genetic and epigenetic features in plants.
2.Genome Assembly of Alfalfa Cultivar Zhongmu-4 and Identification of SNPs Associated with Agronomic Traits
Long RUICAI ; Zhang FAN ; Zhang ZHIWU ; Li MINGNA ; Chen LIN ; Wang XUE ; Liu WENWEN ; Zhang TIEJUN ; Yu LONG-XI ; He FEI ; Jiang XUEQIAN ; Yang XIJIANG ; Yang CHANGFU ; Wang ZHEN ; Kang JUNMEI ; Yang QINGCHUAN
Genomics, Proteomics & Bioinformatics 2022;20(1):14-28
Alfalfa(Medicago sativa L.)is the most important legume forage crop worldwide with high nutritional value and yield.For a long time,the breeding of alfalfa was hampered by lacking reliable information on the autotetraploid genome and molecular markers linked to important agro-nomic traits.We herein reported the de novo assembly of the allele-aware chromosome-level genome of Zhongmu-4,a cultivar widely cultivated in China,and a comprehensive database of genomic variations based on resequencing of 220 germplasms.Approximate 2.74 Gb contigs(N50 of 2.06 Mb),accounting for 88.39%of the estimated genome,were assembled,and 2.56 Gb contigs were anchored to 32 pseudo-chromosomes.A total of 34,922 allelic genes were identified from the allele-aware genome.We observed the expansion of gene families,especially those related to the nitrogen metabolism,and the increase of repetitive elements including transposable elements,which probably resulted in the increase of Zhongmu-4 genome compared with Medicago truncatula.Population structure analysis revealed that the accessions from Asia and South America had rela-tively lower genetic diversity than those from Europe,suggesting that geography may influence alfalfa genetic divergence during local adaption.Genome-wide association studies identified 101 sin-gle nucleotide polymorphisms(SNPs)associated with 27 agronomic traits.Two candidate genes were predicted to be correlated with fall dormancy and salt response.We believe that the allele-aware chromosome-level genome sequence of Zhongmu-4 combined with the resequencing data of the diverse alfalfa germplasms will facilitate genetic research and genomics-assisted breeding in variety improvement of alfalfa.
3.Resequencing 250 Soybean Accessions:New Insights into Genes Associated with Agronomic Traits and Genetic Networks
Yang CHUNMING ; Yan JUN ; Jiang SHUQIN ; Li XIA ; Min HAOWEI ; Wang XIANGFENG ; Hao ONGYUN
Genomics, Proteomics & Bioinformatics 2022;20(1):29-41
The limited knowledge of genomic diversity and functional genes associated with the traits of soybean varieties has resulted in slow progress in breeding.In this study,we sequenced the genomes of 250 soybean landraces and cultivars from China,America,and Europe,and inves-tigated their population structure,genetic diversity and architecture,and the selective sweep regions of these accessions.Five novel agronomically important genes were identified,and the effects of functional mutations in respective genes were examined.The candidate genes GSTT1,GL3,and GSTL3 associated with the isoflavone content,CKX3 associated with yield traits,and CYP85A2 associated with both architecture and yield traits were found.The phenotype-gene network analysis revealed that hub nodes play a crucial role in complex phenotypic associations.This study describes novel agronomic trait-associated genes and a complex genetic network,providing a valuable resource for future soybean molecular breeding.
4.A Chromosome-level Genome Assembly of Wild Castor Provides New Insights into Its Adaptive Evolution in Tropical Desert
Lu JIANJUN ; Pan CHENG ; Fan WEI ; Liu WANFEI ; Zhao HUAYAN ; Li DONGHAI ; Wang SEN ; Hu LIANLIAN ; He BING ; Qian KUN ; Qin RUI ; Ruan JUE ; Lin QIANG ; Lü SHIYOU ; Cui PENG
Genomics, Proteomics & Bioinformatics 2022;20(1):42-59
Wild castor grows in the high-altitude tropical desert of the African Plateau,a region known for high ultraviolet radiation,strong light,and extremely dry condition.To investigate the potential genetic basis of adaptation to both highland and tropical deserts,we generated a chromosome-level genome sequence assembly of the wild castor accession WT05,with a genome size of 316 Mb,a scaffold N50 of 31.93 Mb,and a contig N50 of 8.96 Mb,respectively.Compared with cultivated castor and other Euphorbiaceae species,the wild castor exhibits positive selection and gene family expansion for genes involved in DNA repair,photosynthesis,and abiotic stress responses.Genetic variations associated with positive selection were identified in several key genes,such as LIG1,DDB2,and RECGI,involved in nucleotide excision repair.Moreover,a study of genomic diversity among wild and cultivated accessions revealed genomic regions containing selection signatures associated with the adaptation to extreme environments.The identification of the genes and alleles with selection signatures provides insights into the genetic mechanisms under-lying the adaptation of wild castor to the high-altitude tropical desert and would facilitate direct improvement of modern castor varieties.
5.Genomic Perspectives on the Emerging SARS-CoV-2 Omicron Variant
Ma WENTAI ; Yang JING ; Fu HAOYI ; Su CHAO ; Yu CAIXIA ; Wang QIHUI ; Ana Tereza Ribeiro de Vasconcelos ; A.Bazykin GEORGII ; Bao YIMING ; Li MINGKUN
Genomics, Proteomics & Bioinformatics 2022;20(1):60-69
A new variant of concern for SARS-CoV-2,Omicron(B.1.1.529),was designated by the World Health Organization on November 26,2021.This study analyzed the viral genome sequenc-ing data of 108 samples collected from patients infected with Omicron.First,we found that the enrichment efficiency of viral nucleic acids was reduced due to mutations in the region where the primers anneal to.Second,the Omicron variant possesses an excessive number of mutations compared to other variants circulating at the same time(median:62 vs.45),especially in the Spike gene.Mutations in the Spike gene confer alterations in 32 amino acid residues,more than those observed in other SARS-CoV-2 variants.Moreover,a large number of nonsynonymous mutations occur in the codons for the amino acid residues located on the surface of the Spike protein,which could potentially affect the replication,infectivity,and antigenicity of SARS-CoV-2.Third,there are 53 mutations between the Omicron variant and its closest sequences available in public databases.Many of these mutations were rarely observed in public databases and had a low muta-tion rate.In addition,the linkage disequilibrium between these mutations was low,with a limited number of mutations concurrently observed in the same genome,suggesting that the Omicron vari-ant would be in a different evolutionary branch from the currently prevalent variants.To improve our ability to detect and track the source of new variants rapidly,it is imperative to further strengthen genomic surveillance and data sharing globally in a timely manner.
6.Single-cell Transcriptomic Analysis Reveals the Cellular Heterogeneity of Mesenchymal Stem Cells
Zhang CHEN ; Han XUESHUAI ; Liu JINGKUN ; Chen LEI ; Lei YING ; Chen KUNYING ; Si JIA ; Wang TIAN-YI ; Zhou HUI ; Zhao XIAOYUN ; Zhang XIAOHUI ; An YIHUA ; Li YUEYING ; Wang QIAN-FEI
Genomics, Proteomics & Bioinformatics 2022;20(1):70-86
Ex vivo-expanded mesenchymal stem cells(MSCs)have been demonstrated to be a heterogeneous mixture of cells exhibiting varying proliferative,multipotential,and immunomodu-latory capacities.However,the exact characteristics of MSCs remain largely unknown.By single-cell RNA sequencing of 61,296 MSCs derived from bone marrow and Wharton's jelly,we revealed five distinct subpopulations.The developmental trajectory of these five MSC subpopulations was mapped,revealing a differentiation path from stem-like active proliferative cells(APCs)to multipotent progenitor cells,followed by branching into two paths:1)unipotent preadipocytes or 2)bipotent prechondro-osteoblasts that were subsequently differentiated into unipotent prechondro-cytes.The stem-like APCs,expressing the perivascular mesodermal progenitor markers CSPG4/MCAM/NES,uniquely exhibited strong proliferation and stemness signatures.Remarkably,the prechondrocyte subpopulation specifically expressed immunomodulatory genes and was able to sup-press activated CD3+T cell proliferation in vitro,supporting the role of this population in immunoregulation.In summary,our analysis mapped the heterogeneous subpopulations of MSCs and identified two subpopulations with potential functions in self-renewal and immunoregulation.Our findings advance the definition of MSCs by identifying the specific functions of their heteroge-neous cellular composition,allowing for more specific and effective MSC application through the purification of their functional subpopulations.
7.Defining Proximity Proteome of Histone Modifications by Antibody-mediated Protein A-APEX2 Labeling
Li XINRAN ; Zhou JIAQI ; Zhao WENJUAN ; Wen QING ; Wang WEIJIE ; Peng HUIPAI ; Gao YUAN ; J.Bouchonville KELLY ; M.Offer STEVEN ; Chan KUIMING ; Wang ZHIQUAN ; Li NAN ; Gan HAIYUN
Genomics, Proteomics & Bioinformatics 2022;20(1):87-100
Proximity labeling catalyzed by promiscuous enzymes,such as APEX2,has emerged as a powerful approach to characterize multiprotein complexes and protein-protein interactions.How-ever,current methods depend on the expression of exogenous fusion proteins and cannot be applied to identify proteins surrounding post-translationally modified proteins.To address this limitation,we developed a new method to label proximal proteins of interest by antibody-mediated protein A-ascorbate peroxidase 2(pA-APEX2)labeling(AMAPEX).In this method,a modified protein is bound in situ by a specific antibody,which then tethers a pA-APEX2 fusion protein.Activation of APEX2 labels the nearby proteins with biotin;the biotinylated proteins are then purified using streptavidin beads and identified by mass spectrometry.We demonstrated the utility of this approach by profiling the proximal proteins of histone modifications including H3K27me3,H3K9me3,H3K4me3,H4K5ac,and H4K12ac,as well as verifying the co-localization of these iden-tified proteins with bait proteins by published ChIP-seq analysis and nucleosome immunoprecipi-tation.Overall,AMAPEX is an efficient method to identify proteins that are proximal to modified histones.
8.Epithelial Cells in 2D and 3D Cultures Exhibit Large Differences in Higher-order Genomic Interactions
Liu XIN ; Sun QIU ; Wang QI ; Hu CHUANSHENG ; Chen XUECHENG ; Li HUA ; M.Czajkowsky DANIEL ; Shao ZHIFENG
Genomics, Proteomics & Bioinformatics 2022;20(1):101-109
Recent studies have characterized the genomic structures of many eukaryotic cells,often focusing on their relation to gene expression.However,these studies have largely investigated cells grown in 2D cultures,although the transcriptomes of 3D-cultured cells are generally closer to their in vivo phenotypes.To examine the effects of spatial constraints on chromosome conformation,we investigated the genomic architecture of mouse hepatocytes grown in 2D and 3D cultures using in situ Hi-C.Our results reveal significant differences in higher-order genomic interactions,notably in compartment identity and strength as well as in topologically associating domain(TAD)-TAD interactions,but only minor differences are found at the TAD level.Our RNA-seq analysis reveals up-regulated expression of genes involved in physiological hepatocyte functions in the 3D-cultured cells.These genes are associated with a subset of structural changes,suggesting that differences in genomic structure are critically important for transcriptional regulation.However,there are also many structural differences that are not directly associated with changes in gene expression,whose cause remains to be determined.Overall,our results indicate that growth in 3D significantly alters higher-order genomic interactions,which may be consequential for a subset of genes that are impor-tant for the physiological functioning of the cell.
9.Npac Is A Co-factor of Histone H3K36me3 and Regulates Transcriptional Elongation in Mouse Embryonic Stem Cells
Yu SUE ; Li JIA ; Ji GUANXU ; Ng Long ZHEN ; Siew JIAMIN ; Lo Ning WAN ; Ye YING ; Chew Yuan YUAN ; Long Chau YUN ; Zhang WENSHENG ; Guccione ERNESTO ; Loh Han YUIN ; Jiang ZHI-HONG ; Yang HENRY ; Wu QIANG
Genomics, Proteomics & Bioinformatics 2022;20(1):110-128
Chromatin modification contributes to pluripotency maintenance in embryonic stem cells(ESCs).However,the related mechanisms remain obscure.Here,we show that Npac,a"reader"of histone H3 lysine 36 trimethylation(H3K36me3),is required to maintain mouse ESC(mESC)pluripotency since knockdown of Npac causes mESC differentiation.Depletion of Npac in mouse embryonic fibroblasts(MEFs)inhibits reprogramming efficiency.Furthermore,our chromatin immunoprecipitation followed by sequencing(ChIP-seq)results of Npac reveal that Npac co-localizes with histone H3K36me3 in gene bodies of actively transcribed genes in mESCs.Interestingly,we find that Npac interacts with positive transcription elongation factor b(p-TEFb),Ser2-phosphorylated RNA Pol Ⅱ(RNA Pol Ⅱ Ser2P),and Ser5-phosphorylated RNA Pol Ⅱ(RNA Pol Ⅱ Ser5P).Furthermore,depletion of Npac disrupts transcriptional elongation of the pluripotency genes Nanog and Rifl.Taken together,we propose that Npac is essential for the transcriptional elongation of pluripotency genes by recruiting p-TEFb and interact-ing with RNA Pol Ⅱ Ser2P and Ser5P.
10.SLM2 Is A Novel Cardiac Splicing Factor Involved in Heart Failure due to Dilated Cardiomyopathy
Boeckel JES-NIELS ; M?bius-Winkler MAXIMILIAN ; Müller MARION ; Rebs SABINE ; Eger NICOLE ; Schoppe LAURA ; Tappu REWATI ; E.Kokot KAROLINE ; M.Kneuer JASMIN ; Gaul SUSANNE ; M.Bordalo DIANA ; Lai ALAN ; Haas JAN ; Ghanbari MAHSA ; Drewe-Boss PHILIPP ; Liss MARTIN ; A.Katus HUGO ; Ohler UWE ; Gotthardt MICHAEL ; Laufs ULRICH ; Streckfuss-B?meke KATRIN ; Meder BENJAMIN
Genomics, Proteomics & Bioinformatics 2022;20(1):129-146
Alternative mRNA splicing is a fundamental process to increase the versatility of the gen-ome.In humans,cardiac mRNA splicing is involved in the pathophysiology of heart failure.Mutations in the splicing factor RNA binding motif protein 20(RBM20)cause severe forms of cardiomyopathy.To identify novel cardiomyopathy-associated splicing factors,RNA-seq and tissue-enrichment anal-yses were performed,which identified up-regulated expression of Sam68-Like mammalian protein 2(SLM2)in the left ventricle of dilated cardiomyopathy(DCM)patients.In the human heart,SLM2 binds to important transcripts of sarcomere constituents,such as those encoding myosin light chain 2(MYL2),troponin I3(TNNI3),troponin T2(TNNT2),tropomyosin 1/2(TPM1/2),and titin(TTN).Mechanistically,SLM2 mediates intron retention,prevents exon exclusion,and thereby medi-ates alternative splicing of the mRNA regions encoding the variable proline-,glutamate-,valine-,and lysine-rich(PEVK)domain and another part of the I-band region of titin.In summary,SLM2 is a novel cardiac splicing regulator with essential functions for maintaining cardiomyocyte integrity by binding to and processing the mRNAs of essential cardiac constituents such as titin.
Result Analysis
Print
Save
E-mail