1.The path to enlightenment: making sense of genomic and proteomic information.
Genomics, Proteomics & Bioinformatics 2004;2(2):123-131
Whereas genomics describes the study of genome, mainly represented by its gene expression on the DNA or RNA level, the term proteomics denotes the study of the proteome, which is the protein complement encoded by the genome. In recent years, the number of proteomic experiments increased tremendously. While all fields of proteomics have made major technological advances, the biggest step was seen in bioinformatics. Biological information management relies on sequence and structure databases and powerful software tools to translate experimental results into meaningful biological hypotheses and answers. In this resource article, I provide a collection of databases and software available on the Internet that are useful to interpret genomic and proteomic data. The article is a toolbox for researchers who have genomic or proteomic datasets and need to put their findings into a biological context.
Computational Biology
;
Databases, Protein
;
Genomics
;
Internet
;
Proteomics
;
Software
2.Construction of a non-redundant human SH2 domain database.
Haiming HUANG ; Yuchen JIAO ; Rui XU ; Youhe GAO
Genomics, Proteomics & Bioinformatics 2004;2(2):119-122
Domain database is essential for domain property research. Eliminating redundant information in database query is very important for database quality. Here we report the manual construction of a non-redundant human SH2 domain database. There are 119 human SH2 domains in 110 SH2-containing proteins. Human SH2s were aligned with ClustalX, and a homologous tree was generated. In this tree, proteins with similar known function were classified into the same group. Some proteins in the same group have been reported to have similar binding motifs experimentally. The tree might provide clues about possible functions of hypothetical proteins for further experimental verification.
Amino Acid Sequence
;
Computational Biology
;
Databases, Protein
;
Humans
;
Molecular Sequence Data
;
Sequence Alignment
;
src Homology Domains
;
genetics
3.Identification and characterization of 177 unreported genes associated with liver regeneration.
Cunshuan XU ; Cuifang CHANG ; Jinyun YUAN ; Hongpeng HAN ; Kejin YANG ; Lifeng ZHAO ; Wenqiang LI ; Yuchang LI ; Huiyong ZHANG ; Salman RAHMAN ; Jingbo ZHANG
Genomics, Proteomics & Bioinformatics 2004;2(2):109-118
The mammalian liver has a very strong regeneration capacity after partial hepatectomy (PH). To further learn the genes participating in the liver regeneration (LR), 551 cDNAs selected from subtracted cDNA libraries of the regenerating rat liver were screened by microarray, and their expression profiles were studied by cluster and generalization analyses. Among them, 177 genes were identified unreported and up- or down-regulated more than twofold at one or more time points after PH, of which 62 genes were down-regulated to less than 0.5; 99 genes were up-regulated to 2-10 folds, and 16 genes were either up- or down-regulated at different time points during LR. By using BLAST and GENSCAN, these genes were located on responsible chromosomes with 131 genes on the long arms of the chromosomes. The cluster and generalization analyses showed that the gene expression profiles are similar in 2 and 4, 12 and 16, 96 and 144 h respectively after PH, suggesting that the actions of the genes expressed in the same profiles are similar, and those expressed in different profiles have less similarity. However, the types, characteristics and functions of the 177 genes remain to be further studied.
Animals
;
Chromosome Mapping
;
Expressed Sequence Tags
;
Gene Expression Profiling
;
Liver Regeneration
;
genetics
;
physiology
;
RNA, Messenger
;
metabolism
;
Rats
;
Rats, Sprague-Dawley
;
Sequence Analysis, DNA
;
Time Factors
4.Post-transcriptional gene silencing induced by short interfering RNAs in cultured transgenic plant cells.
Wei TANG ; Vanessa SAMUELS ; Nicki WHITLEY ; Nicole BLOOM ; Tinya DELAGARZA ; Ronald J NEWTON
Genomics, Proteomics & Bioinformatics 2004;2(2):97-108
Short interfering RNA (siRNA) is widely used for studying post-transcriptional gene silencing and holds great promise as a tool for both identifying function of novel genes and validating drug targets. Two siRNA fragments (siRNA-a and -b), which were designed against different specific areas of coding region of the same target green fluorescent protein (GFP) gene, were used to silence GFP expression in cultured gfp transgenic cells of rice (Oryza sativa L.; OS), cotton (Gossypium hirsutum L.; GH), Fraser fir [Abies fraseri (Pursh) Poir; AF], and Virginia pine (Pinus virginiana Mill.; PV). Differential gene silencing was observed in the bombarded transgenic cells between two siRNAs, and these results were consistent with the inactivation of GFP confirmed by laser scanning microscopy, Northern blot, and siRNA analysis in tested transgenic cell cultures. These data suggest that siRNA-mediated gene inactivation can be the siRNA specific in different plant species. These results indicate that siRNA is a highly specific tool for targeted gene knockdown and for establishing siRNA-mediated gene silencing, which could be a reliable approach for large-scale screening of gene function and drug target validation.
Cells, Cultured
;
Gene Transfer Techniques
;
Green Fluorescent Proteins
;
genetics
;
metabolism
;
Plants
;
genetics
;
metabolism
;
RNA Interference
;
physiology
;
RNA, Small Interfering
;
metabolism
;
Time Factors
5.Recent applications of Hidden Markov Models in computational biology.
Khar Heng CHOO ; Joo Chuan TONG ; Louxin ZHANG
Genomics, Proteomics & Bioinformatics 2004;2(2):84-96
This paper examines recent developments and applications of Hidden Markov Models (HMMs) to various problems in computational biology, including multiple sequence alignment, homology detection, protein sequences classification, and genomic annotation.
Computational Biology
;
Markov Chains
;
Models, Biological
;
Protein Conformation
;
Sequence Alignment
;
Sequence Homology
6.Microarray and proteomic analysis of brassinosteroid- and gibberellin-regulated gene and protein expression in rice.
Guangxiao YANG ; Setsuko KOMATSU
Genomics, Proteomics & Bioinformatics 2004;2(2):77-83
Brassinosteroid (BR) and gibberellin (GA) are two groups of plant growth regulators essential for normal plant growth and development. To gain insight into the molecular mechanism by which BR and GA regulate the growth and development of plants, especially the monocot plant rice, it is necessary to identify and analyze more genes and proteins that are regulated by them. With the availability of draft sequences of two major types, japonica and indica rice, it has become possible to analyze expression changes of genes and proteins at genome scale. In this review, we summarize rice functional genomic research by using microarray and proteomic approaches and our recent research results focusing on the comparison of cDNA microarray and proteomic analyses of BR- and GA-regulated gene and protein expression in rice. We believe our findings have important implications for understanding the mechanism by which BR and GA regulate the growth and development of rice.
Gene Expression Profiling
;
Gene Expression Regulation, Plant
;
physiology
;
Gibberellins
;
metabolism
;
Oligonucleotide Array Sequence Analysis
;
Oryza
;
genetics
;
physiology
;
Proteomics
7.Emerging genomic and proteomic evidence on relationships among the animal, plant and fungal kingdoms.
Genomics, Proteomics & Bioinformatics 2004;2(2):70-76
Sequence-based molecular phylogenies have provided new models of early eukaryotic evolution. This includes the widely accepted hypothesis that animals are related most closely to fungi, and that the two should be grouped together as the Opisthokonta. Although most published phylogenies have supported an opisthokont relationship, a number of genes contain a tree-building signal that clusters animal and green plant sequences, to the exclusion of fungi. The alternative tree-building signal is especially intriguing in light of emerging data from genomic and proteomic studies that indicate striking and potentially synapomorphic similarities between plants and animals. This paper reviews these new lines of evidence, which have yet to be incorporated into models of broad scale eukaryotic evolution.
Animals
;
Biological Evolution
;
Cell Differentiation
;
physiology
;
Fungi
;
cytology
;
genetics
;
Genomics
;
Humans
;
Phylogeny
;
Plant Cells
;
Plants
;
genetics
;
Proteomics
;
RNA Caps
;
biosynthesis
;
RNA, Messenger
;
metabolism
9.Preliminary study on the detection of the SARS-CoV specific target cDNA fragments by multiplex PCR.
Wenbing CHEN ; Shousong LI ; Biying SHAO ; Teng ZHENG ; Shuxun JIANG ; Xiaorong HUANG ; Kaizhen CAI ; Zhideng ZHANG
Genomics, Proteomics & Bioinformatics 2004;2(1):55-58
The multiplex polymerase chain reaction (PCR) technique was applied to detect the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) specific target cDNA fragments in the present study. The target cDNA fragments of SARS-CoV were synthesized artificially according to the genome sequence of SARS-CoV in GenBank submitted by The Chinese University of Hong Kong, and were used as simulated positive samples. Five primers recommended by World Health Organization (WHO) were used to amplify the fragments by single PCR and multiplex PCR. Three target cDNA fragments (121, 182 and 302 bp), as well as the three different combinations of any two of these fragments, were amplified by single PCR. The combination of these three fragments was amplified by multiplex PCR. The results indicated that the multiplex PCR technique could be applied to detect the SARS-CoV specific target cDNA fragments successfully.
Base Sequence
;
Cloning, Molecular
;
DNA Primers
;
DNA, Complementary
;
genetics
;
Molecular Sequence Data
;
Polymerase Chain Reaction
;
methods
;
SARS Virus
;
genetics
10.The structure, expression and function prediction of DAZAP2, a down-regulated gene in multiple myeloma.
Yiwu SHI ; Saiqun LUO ; Jianbin PENG ; Chenghan HUANG ; Daren TAN ; Weixin HU
Genomics, Proteomics & Bioinformatics 2004;2(1):47-54
In our previous studies, DAZAP2 gene expression was down-regulated in untreated patients of multiple myeloma (MM). For better studying the structure and function of DAZAP2, a full-length cDNA was isolated from mononuclear cells of a normal human bone marrow, sequenced and deposited to Genbank (AY430097). This sequence has an identical ORF (open reading frame) as the NM_014764 from human testis and the D31767 from human cell line KG-1. Phylogenetic analysis and structure prediction reveal that DAZAP2 homologues are highly conserved throughout evolution and share a polyproline region and several potential SH2/SH3 binding sites. DAZAP2 occurs as a single-copy gene with a four-exon organization. We further noticed that the functional DAZAP2 gene is located on Chromosome 12 and its pseudogene gene is on Chromosome 2 with electronic location of human chromosome in Genbank, though no genetic abnormalities of MM have been reported on Chromosome 12. The ORF of human DAZAP2 encodes a 17-kDa protein, which is highly similar to mouse Prtb. The DAZAP2 protein is mainly localized in cytoplasm with a discrete pattern of punctuated distribution. DAZAP2 may associate with carcinogenesis of MM and participate in yet-to-be identified signaling pathways to regulate proliferation and differentiation of plasma cells.
Amino Acid Sequence
;
Base Sequence
;
Chromosomes, Human, Pair 12
;
genetics
;
Chromosomes, Human, Pair 2
;
genetics
;
Cytoplasm
;
metabolism
;
DNA Primers
;
DNA, Complementary
;
genetics
;
Down-Regulation
;
Gene Components
;
Humans
;
Likelihood Functions
;
Models, Genetic
;
Molecular Sequence Data
;
Multiple Myeloma
;
genetics
;
metabolism
;
Phylogeny
;
Pseudogenes
;
genetics
;
RNA-Binding Proteins
;
genetics
;
metabolism
;
Sequence Alignment
;
Sequence Analysis, DNA