共查询到20条相似文献,搜索用时 15 毫秒
1.
We have used the unblinded MG1/Q1 Genetic Analysis Workshop 12 simulated data as a model system for investigating the use of linkage disequilibrium structure and simple genotype‐phenotype associations to identify candidate functional mutations within a gene of interest. Analysis of the pattern of pair‐wise linkage disequilibrium indicated three groups of single‐nucleotide polymorphisms for which the linkage disequilibrium was high between sites within a group, but lower between sites of different groups. Using linear regression to predict levels of the trait Q1 showed that the known functional site, 5782, was usually not the best genetic predictor of Q1, but sites belonging to the same group as 5782 (i.e., group 2) were always included in the prediction model. In 49 out of the 50 replicates, the functional site was not the best predictor of the trait. Finally, more detailed analyses demonstrate that the relationship between the adjusted R2 for the marker in the prediction model and its disequilibrium with 5782 was linear with the intercept at the origin and terminating at the R2 value for the known functional mutation when the disequilibrium is maximal. These data indicate that simple association studies will not identify the functional mutation, but rather will identify candidate functional mutations that are in very tight linkage disequilibrium with the functional mutation. © 2001 Wiley‐Liss, Inc. 相似文献
2.
Transmission disequilibrium test (TDT) is a nuclear family-based analysis that can test linkage in the presence of association. It has gained extensive attention in theoretical investigation and in practical application; in both cases, the accuracy and generality of the power computation of the TDT are crucial. Despite extensive investigations, previous approaches for computing the statistical power of the TDT are neither accurate nor general. In this paper, we develop a general and highly accurate approach to analytically compute the power of the TDT. We compare the results from our approach with those from several other recent papers, all against the results obtained from computer simulations. We show that the results computed from our approach are more accurate than or at least the same as those from other approaches. More importantly, our approach can handle various situations, which include (1) families that consist of one or more children and that have any configuration of affected and nonaffected sibs; (2) families ascertained through the affection status of parent(s); (3) any mixed sample with different types of families in (1) and (2); (4) the marker locus is not a disease susceptibility locus; and (5) existence of allelic heterogeneity. We implement this approach in a user-friendly computer program: TDT Power Calculator. Its applications are demonstrated. The approach and the program developed here should be significant for theoreticians to accurately investigate the statistical power of the TDT in various situations, and for empirical geneticists to plan efficient studies using the TDT. 相似文献
3.
When studying either qualitative or quantitative traits, tests of association in the presence of linkage are necessary for fine-mapping. In a previous report, we suggested a polytomous logistic approach to testing linkage and association between a di-allelic marker and a quantitative trait locus, using genotyped triads, consisting of an individual whose quantitative trait has been measured and his or her two parents. Here we extend that approach to incorporate marker information from entire nuclear families. By computing a weighted score function instead of a maximum likelihood test, we allow for both an unspecified correlation structure between siblings and \"informative\" family size. Both this approach and our original approach allow for population admixture by conditioning on parental genotypes. The proposed method allows for missing parental genotype data through a multiple imputation procedure. We use simulations based on a population with admixture to compare our method to a popular non-parametric family-based association test (FBAT), testing the null of no association in the presence of linkage. 相似文献
4.
The transmission disequilibrium test (TDT) based on case-parents trios is a powerful tool in linkage analysis and association studies. When only one parent is available, the 1-TDT is applicable in the absence of imprinting. In the presence of imprinting, a statistic is proposed, based on case-mother pairs and case-father pairs to test for linkage when association is present as well as to test for association when linkage is present. The recombination fractions are allowed to be sex-specific in this test statistic. Meanwhile, a statistic based on case-parent pairs is proposed to test for imprinting. Both test statistics can be extended to include families with more than one affected offspring. A number of simulation studies are conducted to investigate the validity of the proposed tests. The effects of different ratios of the numbers of case-mother pairs and case-father pairs on the powers of the proposed tests are studied through simulation. The results show that the optimal ratio is 1:1. How to combine case-parents, case-mother pairs, and case-father pairs jointly in testing for linkage, association, and imprinting is addressed. 相似文献
5.
Sarah W. Hardy Bruce S. Weir Norman L. Kaplan Eden R. Martin 《Genetic epidemiology》2001,21(Z1):S441-S446
The pedigree disequilibrium test (PDT) has been proposed recently as a test for association in general pedigrees [Martin et al., Am J Hum Genet 67:146–54, 2000]. The Genetic Analysis Workshop (GAW) 12 simulated data, with many extended pedigrees, is an example the type of data to which the PDT is ideally suited. In replicate 42 from the general population the PDT correctly identifies candidate genes 1, 2, and 6 as containing single nucleotide polymorphisms (SNPs) that arc significantly associated with the disease. We also applied the truncated product method (TPM) [Zaykin et al., Genet Epidemiol, in press] to combine p‐values in overlapping windows across the genes. Our results show that the TPM is helpful in identifying significant SNPs as well as removing spurious false positives. Our results indicate that, using the PDT, functional disease‐associated SNPs can be successfully identified with a dense map of moderately polymorphic SNPs. © 2001 Wiley‐Liss, Inc. 相似文献
6.
The corporation of a linkage disequilibrium parameter, delta, into linkage analysis is illustrated for data from Genetic Analysis Workshop II. Points from a joint likelihood surface are calculated and displayed on a recombination fraction-linkage disequilibrium grid using a simple modification of LIPED. The approach is shown to increase the power of linkage analysis and the power of tests for heterogeneity of linkage for the simulated examples. 相似文献
7.
Laura E. Mitchell 《Genetic epidemiology》1995,12(6):647-651
A sequential scheme for identifying genetic markers, in linkage disequilibrium with disease susceptibility loci, was utilized to evaluate potential associations between a rare oligogenic disease and genetic variation at 360 anonymous DNA markers. ©1995 Wiley-Liss, Inc. 相似文献
8.
Zaykin DV 《Genetic epidemiology》2004,27(3):252-257
The composite linkage disequilibrium (LD) measure is often calculated for two-locus genotypic data, especially when coupling and repulsion double heterozygotes cannot be distinguished. This measure was reported to have good statistical properties and was suggested for routine testing of LD, regardless of Hardy-Weinberg equilibrium at either of two loci. However, the bounds for this measure have not been yet reported. These bounds are derived here as functions of one-locus genotype or allele frequencies. They provide standardized measures of composite linkage disequilibrium, defined as the proportion of its maximum attainable value, given observed allele or genotype frequencies. 相似文献
9.
In genetic association studies, quality control (QC) filters are applied to remove potentially problematic markers before the markers are tested for statistical associations. However, spurious associations can still occur after QC. We introduce Post-Association Cleaning (PAC) approach that can complement QC by capturing spurious associations using the information in the post-association results. Specifically, we propose a PAC filter based on the linkage disequilibrium (LD) information. The intuition is that if the association is caused by a true genetic effect, neighboring markers in LD should show comparably significant P-values. If not, it may be evidence of spurious association. Previous studies have applied the same idea but only manually without a formal statistical framework. Our proposed method LD-PAC provides a systematic framework to quantitatively measure the evidence of spurious associations based on the likelihood ratio. Simulations show that LD-PAC can detect spurious associations with high detection rate (84%). In addition to detecting spurious associations, our method can also be used to \"rescue\" candidate associations from the supposedly unclean data such as the markers excluded by QC. Although the additional associations must be treated with care, they can suggest interesting regions. The application of our method to the Wellcome Trust Case Control Consortium (WTCCC) data led to the discovery of an additional candidate association for type 1 diabetes among the QC-excluded markers. This locus turns out to be in a region recently identified as significant by a meta-analysis performed after the WTCCC study was published. 相似文献
10.
Meta‐analyses of genetic association studies are usually performed using a single polymorphism at a time, even though in many cases the individual studies report results from partially overlapping sets of polymorphisms. We present here a multipoint (or multilocus) method for multivariate meta‐analysis of published population‐based case‐control association studies. The method is derived by extending the general method for multivariate meta‐analysis and allows for multivariate modelling of log(odds ratios (OR)) derived from several polymorphisms that are in linkage disequilibrium (LD). The method is presented in a genetic model‐free approach, although it can also be used by assuming a genetic model of inheritance beforehand. Furthermore, the method is presented in a unified framework and is easily applied to both discrete outcomes (using the OR), as well as to meta‐analyses of a continuous outcome (using the mean difference). The main innovation of the method is the analytical calculation of the within‐studies covariances between estimates derived from linked polymorphisms. The only requirement is that of an external estimate for the degree of pairwise LD between the polymorphisms under study, which can be obtained from the same published studies, from the literature or from HapMap. Thus, the method is quite simple and fast, it can be extended to an arbitrary set of polymorphisms and can be fitted in nearly all statistical packages (Stata, R/Splus and SAS). Applications in two already published meta‐analyses provide encouraging results concerning the robustness and the usefulness of the method and we expect that it would be widely used in the future. Genet. Epidemiol. 34: 702‐715, 2010. © 2010 Wiley‐Liss, Inc. 相似文献
11.
A number of tests for linkage and association with qualitative traits have been developed, with the most well-known being the transmission/disequilibrium test (TDT). For quantitative traits, varying extensions of the TDT have been suggested. The quantitative trait approach we propose is based on extending the log-linear model for case-parent trio data (Weinberg et al. [1998] Am. J. Hum. Genet. 62:969-978). Like the log-linear approach for qualitative traits, our proposed polytomous logistic approach for quantitative traits allows for population admixture by conditioning on parental genotypes. Compared to other methods, simulations demonstrate good power and robustness of the proposed test under various scenarios of the genotype effect, distribution of the quantitative trait, and population stratification. In addition, missing parental genotype data can be accommodated through an expectation-maximization (EM) algorithm approach. The EM approach allows recovery of most of the lost power due to incomplete trios. 相似文献
12.
Jeanette McClintick Daniel L. Koller Nathan Pankratz Sandra C. Kirkwood Brian Naughton Tatiana Foroud 《Genetic epidemiology》2001,21(Z1):S528-S533
A two‐step process was used to find loci contributing to the qualitative disease phenotype in the Genetic Analysis Workshop (GAW) 12 simulated data. The first step used parametric linkage analysis with a limited number of dominant and recessive models to detect linkage to chromosomal regions. Subsequently, a subset of the simulated biallelic sequence polymorphisms was used for transmission/disequilibrium tests and to build haplotypes to fine map the disease‐predisposing polymorphism(s). A haplotype, strongly associated with the disease phenotype whose proximal end was within 39 base pairs of the functional allele for simulated major gene 6, was identified in the isolated population. © 2001 Wiley‐Liss, Inc. 相似文献
13.
Based on the symmetry of transmitted/nontransmitted alleles from heterozygous parents under the null hypothesis of no association, the work proposed here establishes a general statistical framework for constructing association tests with data from nuclear families with multiple affected children. A class of association tests is proposed for both diallelic and multiallelic markers. The proposed test statistics reduce to the transmission disequilibrium test for trios, to T(su) by Martin et al. ([1997] Am. J. Hum. Genet. 61:439-448) for affected sib pairs, and to the pedigree disequilibrium test by Martin et al. ([2000] Am. J. Hum. Genet. 67:146-154); [2001] Am. J. Hum. Genet. 68:1065-1067) when using affected sibships only. The association test used in simulation and for real data (sitosterolemia) is the one which has the best overall power in detecting association. This association test is generally more powerful than the association tests proposed by Martin et al. ([2000] Am. J. Hum. Genet. 67:146-154); [2001] Am. J. Hum. Genet. 68:1065-1067) when using only affected sibships. For the sitosterolemia data set, the association test has its most significant result (P-value=0.0012) for the marker locus on the same bacterial artificial chromosome as the disease locus. 相似文献
14.
Pau Navarro Véronique Vitart Caroline Hayward Albert Tenesa Lina Zgaga Danica Juricic Ozren Polasek Nicholas D. Hastie Igor Rudan Harry Campbell Alan F. Wright Chris S. Haley Sara A. Knott 《Genetic epidemiology》2010,34(2):140-145
Human isolates have been postulated as a good resource for the identification of QTL due to reduced genetic diversity and a more homogeneous environment. Isolates may also have increased linkage disequilibrium (LD) due to small effective population size and, either loss or increase in frequency of alleles that are rare in the general population from which they originate. Here we investigate the difference in allele and genotype frequencies, LD and homozygous tracts between an isolate—several villages from the island of Vis in Croatia—and an outbred population of European origin: the Hapmap CEPH founders. Using the HumanHap300 v1 Genotyping BeadChip, we show that our population does not differ greatly from the reference CEU outbred population despite having a slightly higher proportion of monomorphic loci, a slightly higher long‐range LD, and a greater proportion of individuals with long homozygous tracts. We conclude that genotyping arrays should perform equally well in our isolate as in outbred European populations for disease mapping studies and that SNP–trait associations discovered in our well‐characterized Croatian isolate should be valid in the general European population from which they descend. Genet. Epidemiol. 34: 140–145, 2010. © 2009 Wiley‐Liss, Inc. 相似文献
15.
Eden R. Martin Ilker Tunc Zhi Liu Susan H. Slifer Ashley H. Beecham Gary W. Beecham 《Genetic epidemiology》2018,42(2):214-229
Population substructure can lead to confounding in tests for genetic association, and failure to adjust properly can result in spurious findings. Here we address this issue of confounding by considering the impact of global ancestry (average ancestry across the genome) and local ancestry (ancestry at a specific chromosomal location) on regression parameters and relative power in ancestry‐adjusted and ‐unadjusted models. We examine theoretical expectations under different scenarios for population substructure; applying different regression models, verifying and generalizing using simulations, and exploring the findings in real‐world admixed populations. We show that admixture does not lead to confounding when the trait locus is tested directly in a single admixed population. However, if there is more complex population structure or a marker locus in linkage disequilibrium (LD) with the trait locus is tested, both global and local ancestry can be confounders. Additionally, we show the genotype parameters of adjusted and unadjusted models all provide tests for LD between the marker and trait locus, but in different contexts. The local ancestry adjusted model tests for LD in the ancestral populations, while tests using the unadjusted and the global ancestry adjusted models depend on LD in the admixed population(s), which may be enriched due to different ancestral allele frequencies. Practically, this implies that global‐ancestry adjustment should be used for screening, but local‐ancestry adjustment may better inform fine mapping and provide better effect estimates at trait loci. 相似文献
16.
Lately, many different methods of linkage, association or joint analysis for family data have been invented and refined. Common to most of those is that they require a map of markers that are in linkage equilibrium. However, at the present day, high-density single nucleotide polymorphisms (SNPs) maps are both more inexpensive to create and they have lower genotyping error. When marker data is incomplete, the crucial and computationally most demanding moment in the analysis is to calculate the inheritance distribution at a certain position on the chromosome. Recently, different ways of adjusting traditional methods of linkage analysis to denser maps of SNPs in linkage disequilibrium (LD) have been proposed. We describe a hidden Markov model which generalizes the Lander-Green algorithm. It combines Markov chain for inheritance vectors with a Markov chain modelling founder haplotypes and in this way takes account for LD between SNPs. It can be applied to association, linkage or combined association and linkage analysis, general phenotypes and arbitrary score functions. We also define a joint likelihood for linkage and association that extends an idea of Kong and Cox (1997 Am. J. Hum. Genet. 61: 1179-1188) for pure linkage analysis. 相似文献
17.
Weir BL 《Genetic epidemiology》2001,21(Z1):S415-S420
A range of study designs, using unrelated or family controls, were used to investigate the pattern of association with disease of single nucleotide polymorphisms (SNPs) within candidate gene 1 (simulated data). Strong evidence of disease association at the functional locus was detected using all study designs, and in the "general" but not the "isolated" population the functional polymorphism displayed considerably higher association than surrounding SNPs. There was much variation in the strength of association of SNPs with disease, up to 70% of which was explained by SNP allele frequency and distance from the functional polymorphism. Some common polymorphisms very close to the functional locus however showed no association with disease. Analysis of short haplotypes of SNPs reduced but did not totally remove this feature. 相似文献
18.
Peter Kraft 《Genetic epidemiology》2001,21(Z1):S447-S452
Many family‐based tests of linkage disequilibrium are not valid when related nuclear families from larger pedigrees are used, or when independent nuclear families with multiple cases are used. The Pedigree Disequilibrium Test (PDT) proposed by Martin et al. [Am J Hum Genet 67:146–54, 2000] avoids these problems. This paper sketches an extension of the PDT that can account for measured covariates. Where the PDT is based on allele‐counting methods, this extension is based on conditional logistic regression. Versions of these statistics were used to test for association between disease and two known functional single nucleotide polymorphisms (SNPs) on gene 1 and gene 6 and one inert SNP on gene 7 in the first 25 replicates of the simulated population‐isolate data. The new method was also used to test for linkage disequilibrium after correcting for the effect of the environmental factor E1. The PDT and the conditional logistic extension had similar power to detect the functional SNPs (100% for gene 1, approximately 50% for gene 6) and appropriate type I error rates for the inert SNP. Correcting for E1 slightly increased power to detect the association between gene 6 and disease. © 2001 Wiley‐Liss, Inc. 相似文献
19.
Chunyu Liu Qiong Yang L. Adrienne Cupples James B. Meigs Josée Dupuis 《Genetic epidemiology》2009,33(4):299-307
Association analyses may follow an initial linkage analysis for mapping and identifying genes underlying complex quantitative traits and may be conducted on unrelated subsets of individuals where only one member of a family is included. We evaluate two methods to select one sibling per sibship when multiple siblings are available: (1) one sibling with the most extreme trait value; and (2) one sibling using a combination score statistic based on extreme trait values and identity‐by‐descent sharing information. We compare the type I error and power. Furthermore, we compare these selection strategies with a strategy that randomly selects one sibling per sibship and with an approach that includes all siblings, using both simulation study and an application to fasting blood glucose in the Framingham Heart Study. When genetic effect is homogeneous, we find that using the combination score can increase power by 30–40% compared to a random selection strategy, and loses only 8–13% of power compared to the full sibship analysis, across all additive models considered, but offers at least 50% genotyping cost saving. In the presence of genetic heterogeneity, the score offers a 50% increase in power over a random selection strategy, but there is substantial loss compared to the full sibship analysis. In application to fasting blood sample, two SNPs are found in common for the selection strategies and the full sample among the 10 highest ranked single nucleotide polymorphisms. The EV strategy tends to agree with the IBD‐EV strategy and the analysis of the full sample. Genet. Epidemiol. 2009. © 2008 Wiley‐Liss, Inc. 相似文献
20.
Though multiple interacting loci are likely involved in the etiology of complex diseases, early genome-wide association studies (GWAS) have depended on the detection of the marginal effects of each locus. Here, we evaluate the power of GWAS in the presence of two linked and potentially associated causal loci for several models of interaction between them and find that interacting loci may give rise to marginal relative risks that are not generally considered in a one-locus model. To derive power under realistic situations, we use empirical data generated by the HapMap ENCODE project for both allele frequencies and linkage disequilibrium (LD) structure. The power is also evaluated in situations where the causal single nucleotide polymorphisms (SNPs) may not be genotyped, but rather detected by proxy using a SNP in LD. A common simplification for such power computations assumes that the sample size necessary to detect the effect at the tSNP is the sample size necessary to detect the causal locus directly divided by the LD measure r(2) between the two. This assumption, which we call the \"proportionality assumption\", is a simplification of the many factors that contribute to the strength of association at a marker, and has recently been criticized as unreasonable (Terwilliger and Hiekkalinna [2006] Eur J Hum Genet 14(4):426-437), in particular in the presence of interacting and associated loci. We find that this assumption does not introduce much error in single locus models of disease, but may do so in so in certain two-locus models. 相似文献