1.KaKs_Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging
Zhang ZHANG ; Li JUN ; Zhao XIAO-QIAN ; Wang JUN ; Wong Ka-Shu GANE ; Yu JUN
Genomics, Proteomics & Bioinformatics 2006;4(4):259-263
KaKs_Calculator is a software package that calculates nonsynonymous (Ka) and synonymous (Ks) substitution rates through model selection and model averaging. Since existing methods for this estimation adopt their specific mutation (substitution) models that consider different evolutionary features, leading to diverse estimates, KaKs_Calculator implements a set of candidate models in a maximum likelihood framework and adopts the Akaike information criterion to measure fitness between models and data, aiming to include as many features as needed for accurately capturing evolutionary information in protein-coding sequences. In addition, several existing methods for calculating Ka and Ks are also incorporated into this software. KaKs_Calculator, including source codes,compiled executables, and documentation, is freely available for academic use at http://evolution.genomics.org.cn/software.htm.
2.Evolutionary Transients in the Rice Transcriptome
Wang JUN ; Zhang JIANGUO ; Li RUIQIANG ; Zheng HONGKUN ; Li JUN ; Zhang YONG ; Li HENG ; Ni PEIXIANG ; Li SONGGANG ; Li SHENGTING ; Wang JINGQIANG ; Liu DONGYUAN ; McDermott JASON ; Samudrala RAM ; Liu SIQI ; Wang JIAN ; Yang HUANMING ; Yu JUN ; Wong Ka-Shu GANE
Genomics, Proteomics & Bioinformatics 2010;08(4):211-228
In the canonical version of evolution by gene duplication,one copy is kept unaltered while the other is free to evolve.This process of evolutionary experimentation can persist for millions of years.Since it is so short lived in comparison to the lifetime of the core genes that make up the majority of most genomes,a substantial fraction of the genome and the transcriptome may-in principle-be attributable to what we will refer to as "evolutionarytransients",referring here to both the process and the genes that have gone or are undergoing this process.Using the rice gene set as a test case,we argue that this phenomenon goes a long way towards explaining why there are so many more rice genes than Arabidopsis genes,and why most excess rice genes show low similarity to eudicots.
3.A genome sequence of novel SARS-CoV isolates: the genotype, GD-Ins29, leads to a hypothesis of viral transmission in South China.
E'de QIN ; Xionglei HE ; Wei TIAN ; Yong LIU ; Wei LI ; Jie WEN ; Jingqiang WANG ; Baochang FAN ; Qingfa WU ; Guohui CHANG ; Wuchun CAO ; Zuyuan XU ; Ruifu YANG ; Jing WANG ; Man YU ; Yan LI ; Jing XU ; Bingyin SI ; Yongwu HU ; Wenming PENG ; Lin TANG ; Tao JIANG ; Jianping SHI ; Jia JI ; Yu ZHANG ; Jia YE ; Cui'e WANG ; Yujun HAN ; Jun ZHOU ; Yajun DENG ; Xiaoyu LI ; Jianfei HU ; Caiping WANG ; Chunxia YAN ; Qingrun ZHANG ; Jingyue BAO ; Guoqing LI ; Weijun CHEN ; Lin FANG ; Changfeng LI ; Meng LEI ; Dawei LI ; Wei TONG ; Xiangjun TIAN ; Jin WANG ; Bo ZHANG ; Haiqing ZHANG ; Yilin ZHANG ; Hui ZHAO ; Xiaowei ZHANG ; Shuangli LI ; Xiaojie CHENG ; Xiuqing ZHANG ; Bin LIU ; Changqing ZENG ; Songgang LI ; Xuehai TAN ; Siqi LIU ; Wei DONG ; Jun WANG ; Gane Ka-Shu WONG ; Jun YU ; Jian WANG ; Qingyu ZHU ; Huanming YANG
Genomics, Proteomics & Bioinformatics 2003;1(2):101-107
We report a complete genomic sequence of rare isolates (minor genotype) of the SARS-CoV from SARS patients in Guangdong, China, where the first few cases emerged. The most striking discovery from the isolate is an extra 29-nucleotide sequence located at the nucleotide positions between 27,863 and 27,864 (referred to the complete sequence of BJ01) within an overlapped region composed of BGI-PUP5 (BGI-postulated uncharacterized protein 5) and BGI-PUP6 upstream of the N (nucleocapsid) protein. The discovery of this minor genotype, GD-Ins29, suggests a significant genetic event and differentiates it from the previously reported genotype, the dominant form among all sequenced SARS-CoV isolates. A 17-nt segment of this extra sequence is identical to a segment of the same size in two human mRNA sequences that may interfere with viral replication and transcription in the cytosol of the infected cells. It provides a new avenue for the exploration of the virus-host interaction in viral evolution, host pathogenesis, and vaccine development.
Base Sequence
;
China
;
Cluster Analysis
;
Gene Components
;
Genetic Variation
;
Genome, Viral
;
Genotype
;
Molecular Sequence Data
;
Phylogeny
;
Reverse Transcriptase Polymerase Chain Reaction
;
SARS Virus
;
genetics
;
Sequence Analysis, DNA
;
Severe Acute Respiratory Syndrome
;
genetics