1.Cancer systems biology: signal processing for cancer research.
Olli YLI-HARJA ; Antti YLIPÄÄ ; Matti NYKTER ; Wei ZHANG
Chinese Journal of Cancer 2011;30(4):221-225
In this editorial we introduce the research paradigms of signal processing in the era of systems biology. Signal processing is a field of science traditionally focused on modeling electronic and communications systems, but recently it has turned to biological applications with astounding results. The essence of signal processing is to describe the natural world by mathematical models and then, based on these models, develop efficient computational tools for solving engineering problems. Here, we underline, with examples, the endless possibilities which arise when the battle-hardened tools of engineering are applied to solve the problems that have tormented cancer researchers. Based on this approach, a new field has emerged, called cancer systems biology. Despite its short history, cancer systems biology has already produced several success stories tackling previously impracticable problems. Perhaps most importantly, it has been accepted as an integral part of the major endeavors of cancer research, such as analyzing the genomic and epigenomic data produced by The Cancer Genome Atlas (TCGA) project. Finally, we show that signal processing and cancer research, two fields that are seemingly distant from each other, have merged into a field that is indeed more than the sum of its parts.
Brain Neoplasms
;
genetics
;
Computational Biology
;
Genomics
;
methods
;
Glioblastoma
;
genetics
;
Humans
;
Neoplasms
;
genetics
;
Signal Processing, Computer-Assisted
;
Signal Transduction
;
Systems Biology
;
methods
2.A systems biological approach to identify key transcription factors and their genomic neighborhoods in human sarcomas.
Antti YLIPÄÄ ; Olli YLI-HARJA ; Wei ZHANG ; Matti NYKTER
Chinese Journal of Cancer 2011;30(1):27-40
Identification of genetic signatures is the main objective for many computational oncology studies. The signature usually consists of numerous genes that are differentially expressed between two clinically distinct groups of samples, such as tumor subtypes. Prospectively, many signatures have been found to generalize poorly to other datasets and, thus, have rarely been accepted into clinical use. Recognizing the limited success of traditionally generated signatures, we developed a systems biology-based framework for robust identification of key transcription factors and their genomic regulatory neighborhoods. Application of the framework to study the differences between gastrointestinal stromal tumor (GIST) and leiomyosarcoma (LMS) resulted in the identification of nine transcription factors (SRF, NKX2-5, CCDC6, LEF1, VDR, ZNF250, TRIM63, MAF, and MYC). Functional annotations of the obtained neighborhoods identified the biological processes which the key transcription factors regulate differently between the tumor types. Analyzing the differences in the expression patterns using our approach resulted in a more robust genetic signature and more biological insight into the diseases compared to a traditional genetic signature.
Gastrointestinal Neoplasms
;
genetics
;
metabolism
;
Gastrointestinal Stromal Tumors
;
genetics
;
metabolism
;
Gene Expression Profiling
;
methods
;
Gene Expression Regulation, Neoplastic
;
Humans
;
Leiomyosarcoma
;
genetics
;
metabolism
;
Protein Interaction Domains and Motifs
;
Systems Biology
;
methods
;
Transcription Factors
;
metabolism
3.Cancer research in the era of next-generation sequencing and big data calls for intelligent modeling.
Jari YLI-HIETANEN ; Antti YLIPÄÄ ; Olli YLI-HARJA
Chinese Journal of Cancer 2015;34(10):423-426
We examine the role of big data and machine learning in cancer research. We describe an example in cancer research where gene-level data from The Cancer Genome Atlas (TCGA) consortium is interpreted using a pathway-level model. As the complexity of computational models increases, their sample requirements grow exponentially. This growth stems from the fact that the number of combinations of variables grows exponentially as the number of variables increases. Thus, a large sample size is needed. The number of variables in a computational model can be reduced by incorporating biological knowledge. One particularly successful way of doing this is by using available gene regulatory, signaling, metabolic, or context-specific pathway information. We conclude that the incorporation of existing biological knowledge is essential for the progress in using big data for cancer research.
Computer Simulation
;
Genome
;
High-Throughput Nucleotide Sequencing
;
Humans
;
Neoplasms