Integration-based co-expression network analysis to investigate tumor-associated modules across three cancer types.
- Author:
Mengnan WANG
1
;
Mingfei HAN
1
;
Binghui LIU
2
;
Chunyan TIAN
1
;
Yunping ZHU
1
Author Information
- Publication Type:Journal Article
- Keywords: cancer; differential co-expression; differential expression; gene co-expression module; integration-based analytical framework
- MeSH: Gene Expression Profiling; Gene Regulatory Networks; Humans; Microarray Analysis; Neoplasms/genetics*
- From: Chinese Journal of Biotechnology 2021;37(11):4111-4123
- CountryChina
- Language:Chinese
- Abstract: In case/control gene expression data, differential expression (DE) represents changes in gene expression levels across various biological conditions, whereas differential co-expression (DC) represents an alteration of correlation coefficients between gene pairs. Both DC and DE genes have been studied extensively in human diseases. However, effective approaches for integrating DC-DE analyses are lacking. Here, we report a novel analytical framework named DC&DEmodule for integrating DC and DE analyses and combining information from multiple case/control expression datasets to identify disease-related gene co-expression modules. This includes activated modules (gaining co-expression and up-regulated in disease) and dysfunctional modules (losing co-expression and down-regulated in disease). By applying this framework to microarray data associated with liver, gastric and colon cancer, we identified two, five and two activated modules and five, five and one dysfunctional module(s), respectively. Compared with the other methods, pathway enrichment analysis demonstrated the superior sensitivity of our method in detecting both known cancer-related pathways and those not previously reported. Moreover, we identified 17, 69, and 11 module hub genes that were activated in three cancers, which included 53 known and three novel cancer prognostic markers. Random forest classifiers trained by the hub genes showed an average of 93% accuracy in differentiating tumor and adjacent normal samples in the TCGA and GEO database. Comparison of the three cancers provided new insights into common and tissue-specific cancer mechanisms. A series of evaluations demonstrated the framework is capable of integrating the rapidly accumulated expression data and facilitating the discovery of dysregulated processes.