Consensus and Meta-analysis regulatory networks for combining multiple microarray gene expression datasets |
| |
Authors: | Steele Emma Tucker Allan |
| |
Affiliation: | aCentre for Intelligent Data Analysis, Department of Information Systems and Computing, Brunel University, Kingston Lane, Uxbridge Middlesex UB8 3PH, UK |
| |
Abstract: | Microarray data is a key source of experimental data for modelling gene regulatory interactions from expression levels. With the rapid increase of publicly available microarray data comes the opportunity to produce regulatory network models based on multiple datasets. Such models are potentially more robust with greater confidence, and place less reliance on a single dataset. However, combining datasets directly can be difficult as experiments are often conducted on different microarray platforms, and in different laboratories leading to inherent biases in the data that are not always removed through pre-processing such as normalisation. In this paper we compare two frameworks for combining microarray datasets to model regulatory networks: pre- and post-learning aggregation. In pre-learning approaches, such as using simple scale-normalisation prior to the concatenation of datasets, a model is learnt from a combined dataset, whilst in post-learning aggregation individual models are learnt from each dataset and the models are combined. We present two novel approaches for post-learning aggregation, each based on aggregating high-level features of Bayesian network models that have been generated from different microarray expression datasets. Meta-analysis Bayesian networks are based on combining statistical confidences attached to network edges whilst Consensus Bayesian networks identify consistent network features across all datasets. We apply both approaches to multiple datasets from synthetic and real (Escherichia coli and yeast) networks and demonstrate that both methods can improve on networks learnt from a single dataset or an aggregated dataset formed using a standard scale-normalisation. |
| |
Keywords: | Bayesian networks Gene regulatory networks Consensus algorithms Meta-analysis Microarray gene expression data |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|