首页 | 本学科首页   官方微博 | 高级检索  
检索        


Integrative analysis of multiple cancer genomic datasets under the heterogeneity model
Authors:Jin Liu  Jian Huang  Shuangge Ma
Institution:1. Department of Biostatistics, School of Public Health, Yale University, , New Haven, CT 06520, U.S.A.;2. Department of Statistics & Actuarial Science, Department of Biostatistics, University of Iowa, , Iowa City, IA, 52242 U.S.A.;3. VA Cooperative Studies Program Coordinating Center, , West Haven, CT, U.S.A.
Abstract:In the analysis of cancer studies with high‐dimensional genomic measurements, integrative analysis provides an effective way of pooling information across multiple heterogeneous datasets. The genomic basis of multiple independent datasets, which can be characterized by the sets of genomic markers, can be described using the homogeneity model or heterogeneity model. Under the homogeneity model, all datasets share the same set of markers associated with responses. In contrast, under the heterogeneity model, different studies have overlapping but possibly different sets of markers. The heterogeneity model contains the homogeneity model as a special case and can be much more flexible. Marker selection under the heterogeneity model calls for bi‐level selection to determine whether a covariate is associated with response in any study at all as well as in which studies it is associated with responses. In this study, we consider two minimax concave penalty‐based penalization approaches for marker selection under the heterogeneity model. For each approach, we describe its rationale and an effective computational algorithm. We conduct simulations to investigate their performance and compare with the existing alternatives. We also apply the proposed approaches to the analysis of gene expression data on multiple cancers. Copyright © 2013 John Wiley & Sons, Ltd.
Keywords:integrative analysis  heterogeneity model  marker selection
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号