On the reproducibility of results of pathway analysis in genome-wide expression studies of colorectal cancers |
| |
Authors: | Rosalia Maglietta Angela Distaso Ada Piepoli Orazio Palumbo Massimo Carella Annarita D’Addabbo Sayan Mukherjee Nicola Ancona |
| |
Affiliation: | 1. Istituto di Studi sui Sistemi Intelligenti per l’Automazione, CNR, Via Amendola 122/D-I, Bari, Italy;2. Unità Operativa di Gastroenterologia, IRCCS, “Casa Sollievo della Sofferenza”-Ospedale, 71013 San Giovanni Rotondo (FG), Italy;3. Servizio di Genetica Medica, IRCCS, “Casa Sollievo della Sofferenza”-Ospedale, 71013 San Giovanni Rotondo (FG), Italy;4. Institute for Genome Science and Policy, Duke University, Durham, NC, USA;1. Center for Biomedical Informatics; UC Institute for Translational Medicine; Institute for Genomics and Systems Biology; University of Chicago, Chicago, IL 60634, USA;2. Departments of Pediatrics and Medicine, Stanford University, Stanford, CA 94305, USA;3. Center for Computational Pharmacology, Computational Bioscience Program; Departments of Pharmacology, Preventive Medicine & Biometrics, Computer Science, and Biology; University of Colorado Health Sciences Center, Denver, CO 80262, USA |
| |
Abstract: | One of the major problems in genomics and medicine is the identification of gene networks and pathways deregulated in complex and polygenic diseases, like cancer. In this paper, we address the problem of assessing the variability of results of pathways analysis identified in different and independent genome wide expression studies, in which the same phenotypic conditions are assayed. To this end, we assessed the deregulation of 1891 curated gene sets in four independent gene expression data sets of subjects affected by colorectal cancer (CRC). In this comparison we used two well-founded statistical models for evaluating deregulation of gene networks. We found that the results of pathway analysis in expression studies are highly reproducible. Our study revealed 53 pathways identified by the two methods in all the four data sets analyzed with high statistical significance and strong biological relevance with the pathology examined. This set of pathways associated to single markers as well as to whole biological processes altered constitutes a signature of the disease which sheds light on the genetics bases of CRC. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|