Search Results

1.The Genome Sequence Archive Family:Toward Explosive Data Growth and Diverse Data Types

Chen TINGTING ; Chen XU ; Zhang SISI ; Zhu JUNWEI ; Tang BIXIA ; Wang ANKE ; Dong LILI ; Zhang ZHEWEN ; Yu CAIXIA ; Sun YANLING ; Chi LIANJIANG ; Chen HUANXIN ; Zhai SHUANG ; Sun YUBIN ; Lan LI ; Zhang XIN ; Xiao JINGFA ; Bao YIMING ; Wang YANQING ; Zhang ZHANG ; Zhao WENMING

Genomics, Proteomics & Bioinformatics 2021;19(4):578-583

2.Genome Warehouse: A Public Repository Housing Genome-scale Data

Chen MEILI ; Ma YINGKE ; Wu SONG ; Zheng XINCHANG ; Kang HONGEN ; Sang JIAN ; Xu XINGJIAN ; Hao LILI ; Li ZHAOHUA ; Gong ZHENG ; Xiao JINGFA ; Zhang ZHANG ; Zhao WENMING ; Bao YIMING

Genomics, Proteomics & Bioinformatics 2021;19(4):584-589

3.The Elements of Data Sharing.

Zhang ZHANG ; Shuhui SONG ; Jun YU ; Wenming ZHAO ; Jingfa XIAO ; Yiming BAO

Genomics, Proteomics & Bioinformatics 2020;18(1):1-4

4.Compositional Variability and MutationSpectra of Monophyletic SARS-CoV-2 Clades

Teng XUFEI ; Li QIANPENG ; Li ZHAO ; Zhang YUANSHENG ; Niu GUANGYI ; Xiao JINGFA ; Yu JUN ; Zhang ZHANG ; Song SHUHUI

Genomics, Proteomics & Bioinformatics 2020;18(6):648-663

COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a stag-gering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feed-back loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydropho-bicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased muta-tion trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral pro-teome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applica-tions. Such actions are in desperate need, especially in the middle of the War against COVID-19.

5.The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR

Song SHUHUI ; Ma LINA ; Zou DONG ; Tian DONGMEI ; Li CUIPING ; Zhu JUNWEI ; Chen MEILI ; Wang ANKE ; Ma YINGKE ; Li MENGWEI ; Teng XUFEI ; Cui YING ; Duan GUANGYA ; Zhang MOCHEN ; Jin TONG ; Shi CHENGMIN ; Du ZHENGLIN ; Zhang YADONG ; Liu CHUANDONG ; Li RUJIAO ; Zeng JINGYAO ; Hao LILI ; Jiang SHUAI ; Chen HUA ; Han DALI ; Xiao JINGFA ; Zhang ZHANG ; Zhao WENMING ; Xue YONGBIAO ; Bao YIMING

Genomics, Proteomics & Bioinformatics 2020;18(6):749-759

6.Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome.

Zhenglin DU ; Liang MA ; Hongzhu QU ; Wei CHEN ; Bing ZHANG ; Xi LU ; Weibo ZHAI ; Xin SHENG ; Yongqiao SUN ; Wenjie LI ; Meng LEI ; Qiuhui QI ; Na YUAN ; Shuo SHI ; Jingyao ZENG ; Jinyue WANG ; Yadong YANG ; Qi LIU ; Yaqiang HONG ; Lili DONG ; Zhewen ZHANG ; Dong ZOU ; Yanqing WANG ; Shuhui SONG ; Fan LIU ; Xiangdong FANG ; Hua CHEN ; Xin LIU ; Jingfa XIAO ; Changqing ZENG

Genomics, Proteomics & Bioinformatics 2019;17(3):229-247

To unravel the genetic mechanisms of disease and physiological traits, it requires comprehensive sequencing analysis of large sample size in Chinese populations. Here, we report the primary results of the Chinese Academy of Sciences Precision Medicine Initiative (CASPMI) project launched by the Chinese Academy of Sciences, including the de novo assembly of a northern Han reference genome (NH1.0) and whole genome analyses of 597 healthy people coming from most areas in China. Given the two existing reference genomes for Han Chinese (YH and HX1) were both from the south, we constructed NH1.0, a new reference genome from a northern individual, by combining the sequencing strategies of PacBio, 10× Genomics, and Bionano mapping. Using this integrated approach, we obtained an N50 scaffold size of 46.63 Mb for the NH1.0 genome and performed a comparative genome analysis of NH1.0 with YH and HX1. In order to generate a genomic variation map of Chinese populations, we performed the whole-genome sequencing of 597 participants and identified 24.85 million (M) single nucleotide variants (SNVs), 3.85 M small indels, and 106,382 structural variations. In the association analysis with collected phenotypes, we found that the T allele of rs1549293 in KAT8 significantly correlated with the waist circumference in northern Han males. Moreover, significant genetic diversity in MTHFR, TCN2, FADS1, and FADS2, which associate with circulating folate, vitamin B12, or lipid metabolism, was observed between northerners and southerners. Especially, for the homocysteine-increasing allele of rs1801133 (MTHFR 677T), we hypothesize that there exists a "comfort" zone for a high frequency of 677T between latitudes of 35-45 degree North. Taken together, our results provide a high-quality northern Han reference genome and novel population-specific data sets of genetic variants for use in the personalized and precision medicine.

7.A Brief Review of Software Tools for Pangenomics

Xiao JINGFA ; Zhang ZHEWEN ; Wu JIAYAN ; Yu JUN

Genomics, Proteomics & Bioinformatics 2015;(1):73-76

8.Ribogenomics:the Science and Knowledge of RNA

Wu JIAYAN ; Xiao JINGFA ; Zhang ZHANG ; Wang XUMIN ; Hu SONGNIAN ; Yu JUN

Genomics, Proteomics & Bioinformatics 2014;(2):57-63