1.Technological advances of serial analysis of gene expression.
Chinese Journal of Biotechnology 2002;18(3):377-380
Serial analysis of gene expression (SAGE) is an effective method of determining gene expression profiles of tissues and organs under different conditions. In this paper, the detail protocol of SAGE was introduced and some modified procedure of SAGE was reviewed.
Gene Expression Profiling
;
methods
2.Impact of Time Delay in Processing Blood Sample on Next Generation Sequencing for Transcriptome Analysis.
Jae Eun LEE ; So Young JUNG ; So Youn SHIN ; Young Youl KIM
Osong Public Health and Research Perspectives 2018;9(3):130-132
No abstract available.
Gene Expression Profiling*
;
RNA
;
Transcriptome*
4.Bioinformatics Analysis of Core Genes and Key Pathways in Myelodysplastic Syndrome.
Yan WANG ; Ying-Shao WANG ; Nai-Bo HU ; Guang-Shuai TENG ; Yuan ZHOU ; Jie BAI
Journal of Experimental Hematology 2022;30(3):804-812
OBJECTIVE:
To screen differentially expressed gene (DEG) related to myelodysplastic syndrome (MDS) based on Gene Expression Omnibus (GEO) database, and explore the core genes and pathogenesis of MDS by analyzing the biological functions and related signaling pathways of DEG.
METHODS:
The expression profiles of GSE4619, GSE19429, GSE58831 including MDS patients and normal controls were downloaded from GEO database. The gene expression analysis tool (GEO2R) of GEO database was used to screen DEG according to | log FC (fold change) |≥1 and P<0.01. David online database was used to annotate gene ontology function (GO). Metascape online database was used to enrich and analyze differential genes in Kyoto Encyclopedia of Genes and Genomes (KEGG). The protein-protein interaction network (PPI) was constructed by using STRING database. CytoHubba and Mcode plug-ins of Cytoscape were used to analyze the key gene clusters and hub genes. R language was used to diagnose hub genes and draw the ROC curve. GSEA enrichment analysis was performed on GSE19429 according to the expression of LEF1.
RESULTS:
A total of 74 co-DEG were identified, including 14 up-regulated genes and 60 down regulated genes. GO enrichment analysis indicated that BP of down regulated genes was mainly enriched in the transcription and regulation of RNA polymerase II promoter, negative regulation of cell proliferation, and immune response. CC of down regulated genes was mainly enriched in the nucleus, transcription factor complexes, and adhesion spots. MF was mainly enriched in protein binding, DNA binding, and β-catenin binding. KEGG pathway was enriched in primary immunodeficiency, Hippo signaling pathway, cAMP signaling pathway, transcriptional mis-regulation in cancer and hematopoietic cell lineage. BP of up-regulated genes was mainly enriched in type I interferon signaling pathway and viral response. CC was mainly enriched in cytoplasm. MF was mainly enriched in RNA binding. Ten hub genes and three important gene clusters were screened by STRING database and Cytoscape software. The functions of the three key gene clusters were closely related to immune regulation. ROC analysis showed that the hub genes had a good diagnostic significance for MDS. GSEA analysis indicated that LEF1 may affect the normal function of hematopoietic stem cells by regulating inflammatory reaction, which further revealed the pathogenesis of MDS.
CONCLUSION
Bioinformatics can effectively screen the core genes and key signaling pathways of MDS, which provides a new strategy for the diagnosis and treatment of MDS.
Computational Biology
;
Gene Expression Profiling
;
Gene Expression Regulation, Neoplastic
;
Gene Ontology
;
Humans
;
Myelodysplastic Syndromes/genetics*
5.Transcriptome profiling and analysis of Panax japonicus var. major.
Shao-peng ZHANG ; Jian JIN ; Bing-xiong HU ; Ya-yun WU ; Qi YAN ; Wan-yong ZENG ; Yong-lian ZHENG ; Zhang XI-FENG ; Ping CHEN
China Journal of Chinese Materia Medica 2015;40(11):2084-2089
The rhizome of Panax japonicus var. major have been used as the natural medicinal agent by Chinese traditional doctors for more than thousand years. Most of the therapeutic effects of P. japonicus var. major had been reported due to the presence of tetracyclic or pentacyclic triterpene saponins. In this study, Illumina pair-end RNA-sequencing and de novo splicing were done in order to understand the pathway of triterpenoid saponins in this species. The valid reads data of 15. 6 Gb were obtained. The 62 240 unigenes were finally obtained by de novo splicing. After annotation, we discovered 19 unigenes involved in ginsenoside backbone biosynthesis. Additionally, 69 unigenes and 18 unigenes were predicted to have potential function of cytochrome P450 and UDP-glycosyltransferase based on the annotation results, which may encode enzymes responsible for ginsenoside backbone modification. This study provides global expressed datas for P. japonicus var. major, which will contribute significantly to further genome-wide research and analysis for this species.
Gene Expression Profiling
;
Panax
;
genetics
;
Saponins
;
biosynthesis
;
Sequence Analysis, RNA
6.Evaluation of clustering algorithms for gene expression data using gene ontology annotations.
Chinese Medical Journal 2012;125(17):3048-3052
BACKGROUNDClustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes. Biologists frequently face the problem of choosing an appropriate algorithm. We aimed to provide a standalone, easily accessible and biologically oriented criterion for expression data clustering evaluation.
METHODSAn external criterion utilizing annotation based similarities between genes is proposed in this work. Gene ontology information is employed as the annotation source. Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed.
RESULTSThe rank of these algorithms given by the criterion coincides with our common knowledge. Single-linkage has significantly poorer performance, even worse than the random algorithm. Ward's method archives the best performance in most cases.
CONCLUSIONSThe criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements. It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters. As an addition, we suggest using Ward's algorithm for gene expression data analysis.
Algorithms ; Cluster Analysis ; Gene Expression Profiling ; Humans ; Molecular Sequence Annotation
7.Preparation of gene chip for detecting different expression genes involved in aflatoxin biosynthesis.
Chinese Journal of Preventive Medicine 2009;43(5):423-427
OBJECTIVETo develop the methodology of gene chip to analyse genes involved in aflatoxin biosynthesis.
METHODSIn comparing reversed transcriptional PCR with gene chip, the gene chip was used to detect genes involved in aflatoxin biosynthesis.
RESULTSAfter arrayed the slide was incubated in water for 2 hours, exposed to a 650 mJ/cm2 of ultraviolet irradiation in the strata-linker for 30 s, roasted under 80 degrees C for 2 hours in oven, pre-hybridized for 45 minutes and dealt with other procedures. Finally, the slide was hybridized with fluor-derivatized sample at 42 degrees C for 16 hours.
CONCLUSIONWith the reasonable probe design and applicable protocol, the gene chip was prepared effectively for research on genes involved in aflatoxin biosynthesis.
Aflatoxins ; biosynthesis ; Gene Expression Profiling ; Oligonucleotide Array Sequence Analysis ; methods
9.Significant genes extraction and analysis of gene expression data based on matrix factorization techniques.
Wei KONG ; Juan WANG ; Xiaoyang MOU
Journal of Biomedical Engineering 2014;31(3):662-670
It is generally considered that various regulatory activities between genes are contained in the gene expression datasets. Therefore, the underlying gene regulatory relationship and the biologically useful information can be found by modeling the gene regulatory network from the gene expression data. In our study, two unsupervised matrix factorization methods, independent component analysis (ICA) and nonnegative matrix factorization (NMF), were proposed to identify significant genes and model the regulatory network using the microarray gene expression data of Alzheimer's disease (AD). By bio-molecular analyzing of the pathways, the differences between ICA and NMF have been explored and the fact, which the inflammatory reaction is one of the main pathological mechanisms of AD, is also emphasized. It was demonstrated that our study gave a novel and valuable method for the research of early detection and pathological mechanism, biomarkers' findings of AD.
Algorithms
;
Alzheimer Disease
;
genetics
;
Gene Expression Profiling
;
methods
;
Humans
10.Synthesis of triterpenoid saponins from Aesculus chinensis based on transcriptome data.
Yi-Ding WEI ; Chao XIONG ; Tian-Yuan ZHANG ; Han GAO ; Qing-Gang YIN ; Hui YAO ; Wei SUN ; Zhi-Gang HU ; Shi-Lin CHEN
China Journal of Chinese Materia Medica 2019;44(6):1135-1144
Aesculus chinensis belongs to Hippocastanaceae family,bears medicinal and ornamental values. The oleanane type triterpenoid saponin aescin is regarded as active ingredient and accumulated in seed. In order to understand its molecular basis of the triterpenoid biosynthesis,we used high-throughput sequencing under Illumina Hi Seq 2000 platform to obtain the transcriptome data of seed and flower from A. chinensis to further mine the genes involved in its metabolic pathway. Unigene's de novo splicing was performed using Trinity software; the transcriptome results were annotated with KEGG database to predict the specific pathways of the aescin triterpenoid metabolism. Terpenoid and triterpenoid pathways were found from transcriptome data,and forty seven and twenty seven corresponding genes were uncovered respectively. It was found that there are eight kinds of enzymes related to the terpenoid metabolism pathway precursors and three kinds of enzymes related to the triterpenoid metabolism pathway. In this study,five genes corresponding to triterpene cyclase were analyzed in A. chinensis for the first time,which may participate in the synthesis of triterpenoid. It' s revealed that there were thirty three differential genes associated with the ko00900 and ko00909 pathways by analysis on the difference in transcriptome expression between seeds and flowers; seventeen unigenes were up-regulated and sixteen unigenes were down-regulated in the seeds relative to flowers. In this study, qRT-PCR experiments were used to verify the expression of three key enzyme genes of SQE( Unigene25806),HMGS( Unigene36710),and β-AS( Unigene33291). The results of qRT-PCR were consistent with the transcriptome data. The candidate genes related to triterpenoid saponin aescin synthesis in A. chinensis found in this study can provide theoretical basis for the metabolism synthesis and regulation of aescin.
Aesculus
;
Flowers
;
Gene Expression Profiling
;
Saponins
;
Transcriptome
;
Triterpenes