Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / GENETICS
Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia
,
,
,
,
,
*Department of Psychiatry Research, Zucker Hillside Hospital, North Shore–Long Island Jewish Health System, 75-59 263rd Street, Glen Oaks, NY 11004;
The Feinstein Institute for Medical Research, 350 Community Drive, Manhasset, NY 11030;
Department of Psychiatry and Behavioral Science, Albert Einstein College of Medicine of Yeshiva University, 1300 Morris Park Avenue, Belfer Room 403, Bronx, NY 10461; ¶Golden Helix, Inc., 716 South 20th Avenue, Suite 102, Bozeman, MT 59718; ||Harvard Partners Center for Genetics and Genomics, 65 Landsdowne Street, Cambridge, MA 02139; and **Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115
Communicated by James D. Watson, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, October 22, 2007 (received for review July 10, 2007)
| Abstract |
|---|
|
|
|---|
genomewide | selection | haplotype | HapMap | susceptibility
500,000 SNPs, to detect novel susceptibility loci for SCZ. SCZ is a disease with estimated lifetime morbid risk approaching 1% worldwide. Although genetic epidemiologic studies have revealed high heritability estimates (70–80%) for SCZ, identification of susceptibility genes remains challenging. As with other complex diseases, linkage studies have revealed multiple candidate regions with modest LOD scores (4), whereas studies of individual candidate genes are inherently limited in scope. By contrast, WGHA (described in detail below) presents an opportunity for rapidly identifying susceptibility loci broadly across the genome, yet with resolution sufficient to implicate a circumscribed set of candidate genes. WGHA is designed to be sensitive for detecting loci under selective pressure, and recent data suggest that signatures of evolutionary selection may be strongly observed in genes regulating neurodevelopment (5, 6). Thus, WGHA may be particularly effective for SCZ, which is thought to have a primary pathophysiological basis in abnormal neurodevelopmental processes (7).
Regions of extended homozygosity across large numbers of consecutive SNPs form the basis of WGHA analysis. In general, extent of homozygosity is a function of LD within a chromosomal region, which in turn is a function of recombination rates and population history (8–10). Size and structure of LD blocks vary widely across the genome and across populations (11), and regions of extensive long-range LD may be indicative of partially complete selective sweeps of functional significance (12). For example, variants of the extended haplotype homozygosity test (13) have been used to examine identity-by-descent across unrelated chromosomes in HapMap (14) and other population samples, identifying known loci under selection (e.g., LCT in Europeans, see refs. 15 and 16). A logical consequence of such identity across unrelated chromosomes is that long stretches of homozygosity may be observed in healthy individuals from outbred populations lacking any known consanguineous parentage (17, 18). However, the relative commonality of this phenomenon has not been systematically documented in large datasets at high resolution. Moreover, although homozygosity mapping has successfully identified disease loci in pedigrees marked by Mendelian illness (19), the ability of such a method to detect susceptibility loci in common disease has not been examined in a case-control study. We present data addressing both normal patterns of homozygosity and use of these patterns in WGHA mapping of SCZ.
| Results |
|---|
|
|
|---|
Identification of Common ROHs in ZHH Subjects.
The first step of WGHA analysis is the identification of runs of homozygosity (ROHs) in each subject, defined in the present study as any window of 100 or more consecutive SNPs on a single chromosome not receiving a heterozygous call (see Methods for details). Because WGHA seeks to identify frequently observed variants that can be statistically compared in a case-control design, only those ROHs in which 10 or more subjects share
100 identical homozygous calls were retained for further analysis. Each common ROH was then scored "present" or "absent" for each subject.
A total of 339 common ROHs were thus identified [supporting information (SI) Table 4], encompassing
12–13% of the genome as measured both by number of included SNPs and total chromosomal length. The six longest ROHs, ranging from 6 to 15.6 mb, encompass the centromeres of chromosomes 3, 5, 8, 11, 16, and 19. In part, this is a function of long regions with no SNPs ascertained; nevertheless, in each case, these centromeric gene deserts are flanked by homozygous regions containing hundreds of SNPs. This phenomenon may reflect meiotic drive, the selective bias toward transmission of a single meiotic product that has been observed at centromeric DNA across species (5, 21). The greatest number of consecutive SNPs (852) is found in roh172, spanning the centromere of chromosome 8; this region, which contains the gene encoding syntrophin
1 (SNTG1), has been highlighted in several genomewide studies of selective sweeps (5, 14–16), providing a positive control for our method.
There are nine ROHs that were very common (>25% frequency) in ZHH healthy controls. As displayed in Table 1, publicly available data indicate that these regions are not marked by excessive copy number variation or segmental duplication, nor do they appear to have abnormally low recombination rates (14). However, examination of Haplotter data (ref. 15, accessed at http://hg-wen.uchicago.edu/selection/haplotter.htm) indicates high scores for each of these regions on one or more measures of positive selection in Caucasian samples. Several gene categories identified in studies of selective pressure (5, 14–16) are evident in these regions, including genes involved in the immune system (on chromosomes 6p, 12q, and 5q), olfactory receptors (6p and 11p), members of the dystrophin protein complex (SNTG1 and DGKZ), and many other CNS-expressed genes (e.g., GPHN, UNC5D, and ATXN2). Across all 339 regions, ROH frequency in controls was significantly correlated with maximal integrated haplotype score (iHS, ref. 15; r = 0.33, P = 3.4 x 10–10) and Tajima's D (r = 0.30, P = 2.8 x 10–8); these correlations are comparable to the intercorrelation of maximal iHS and D for the same regions (r = 0.30, P = 1.3 x 10–8).
|
2 comparison (within the ZHH healthy control cohort) of carriers vs. noncarriers of each ROH (10–4 < P values <10–21). Notably, for each core SNP, ROH carriers bear the derived (nonancestral) allele, consistent with an incomplete selective sweep.
Validation of ROH Methodology in HapMap Samples.
Using publicly available data (www.affymetrix.com), we applied analogous methods to Affymetrix 500K data derived from all unrelated individuals in each of the three major HapMap populations (Caucasian, African, and Asian). For each population, we identified all ROHs of length
100 SNPs that were present in at least 20% of subjects using all available Affymetrix SNPs (no filtering applied). As described below, we tested a series of hypotheses, to support our interpretation that common ROHs indicate loci under selective pressure as well as to eliminate the possibility that biased SNP selection on the Affymetrix array might have served to confound this interpretation. Specifically, we predicted considerable overlap between ROHs identified in our control cohort and Caucasian HapMap samples and considerable disjunction with African and Asian samples. Moreover, we predicted that the African cohort would possess fewer ROHs, whereas the Asian cohort would demonstrate a greater frequency of ROHs, based on relative age and homogeneity of the respective lineages (9, 22).
Of the 32 ROHs that were found in the HapMap CEU (CEPH Utah residents with ancestry from Northern and Western Europe) sample (n = 60 founders), all but one overlapped with a ROH that was common (>5%) in our Caucasian controls. Moreover, the four most common ROHs in the CEU sample coincided with four of the five most common ROHs in our control sample (roh172, roh134, roh89, and roh291). By contrast, no common ROHs were identified in the YRI (Yoruba from Ibadan, Nigeria) founder sample (n = 60) using the same 20% threshold, consistent with their much more ancient lineage and resultant increase in recombination events. Results from the YRI sample also provided an important test of a potential artifact. We examined the heterozygosity (in YRI founders) of the 1,673 SNPs that were constituents of the most commonly found in ROHs in our Caucasian control sample. In YRI founders, the heterozygosity of these 1,673 SNPs (31.2%) was higher than the mean heterozygosity across the remainder of the array (28.8%). This result demonstrates that the identification of ROHs is not driven by artifactual properties of specific SNPs on the array (i.e., these are not SNPs that always lead to low heterozygosity calls due to poor signal/noise characteristics or absolute rarity).
Consistent with very recent data on allele frequency spectra (22), the Asian HapMap samples show greater long-range LD relative to the CEU samples, despite the fact that the Asian samples combine two distinct subgroups [CHB and JPT (Han Chinese from Beijing and Japanese from Tokyo)]. By using the same 20% frequency threshold, more than three times as many ROHs were identified as in the CEU sample. Moreover, the most common ROH in the Asian sample (53.3% frequency) was not among the common ROHs identified in the CEU sample. Located in the centromeric region of chromosome 16, this region overlapped with roh306 (SI Table 4), which was only the 40th most common ROH in the Caucasian ZHH control sample.
Comparison of ROH Frequency in ZHH Patients and Controls. The total number of common ROHs marked "present" was summed for each ZHH subject to permit genomewide comparison across diagnostic groups. Of a total possible sum of 339, patients with schizophrenia demonstrated a significantly greater number of common ROHs (mean = 31.7, SD = 12.3) relative to healthy volunteers (mean = 28.0, SD = 12.8; t320 = 2.62, P = 0.009). Nine individual ROHs significantly (P < 0.01) differed in frequency between cases and controls (Table 2); each was more common in SCZ cases.
|
2 = 44.7, df = 1, P = 2.3 x 10–11; permuted P = 0.0022; odds ratio = 5.15, 95% CI = 3.13–8.46). Moreover, as the number of risk ROHs increases, risk of illness increases dramatically. Using logistic regression, the total number of risk ROHs significantly predicted group status (
2 = 62.6, df = 1, P = 2.51 x 10–15; permuted P = 0.00095), with each additional risk ROH imparting a hazard ratio of 2.83 (95% CI = 2.10–3.81; see also Table 3).
|
2 = 8.1, df = 1, P = 0.0045). The odds ratio for this ROH was moderate (1.93; 95% CI = 1.22–3.04), although population attributable risk was 12% because of its commonality. This ROH is centered on the very large (
675 kb) gene GPHN, which codes for gephyrin, a protein scaffold that serves to anchor GABA receptors in the postsynaptic membrane. Patients with schizophrenia who exhibited this ROH tended to carry the same derived allele as was noted in those controls carrying the ROH (rs2053149 C). Comparison of CC genotype frequency for this core allele in patients carrying the ROH to control non-ROH carriers was strongly significant (P = 1.37 x 10–18). Core SNP for other risk ROHs in Table 2 was determined by the homozygous allele that was most common to patients carrying the ROH yet least common among controls not carrying the ROH. For six of the nine risk ROHs, all or nearly all patients (0–2 exceptions) carried the same core allele, which was the derived allele; however, a sizable fraction of patients carrying roh55, roh314, and roh321 demonstrated homozygosity at the alternate alleles. Genes Within Risk ROHs. Four of the nine ROHs contain or immediately neighbor genes that have been linked to schizophrenia, a result that is significantly unlikely by chance (binomial distribution P < 0.01) even if a 10% prior probability is assigned to each region (4, 23). Specifically, roh15 on chromosome 1q contains NOS1AP (formerly CAPON), which has been related to schizophrenia in both genetic linkage and association studies, as well as in postmortem gene-expression studies (24). This protein competes with PSD95 for binding to neuronal nitric oxide synthase (nNOS), thereby disrupting neuronal NMDA receptor transmission at the postsynaptic density. Similarly, roh52 contains ATF2, a downstream target of the mitogen-activated protein kinase/extracellular signal-regulated kinase signaling pathway triggered by nNOS; protein levels of activating transcription factor 2 have been reported to be elevated in postmortem SCZ brain tissue (25). Further, roh314 contains NSF (encoding N-ethylmaleimide sensitive fusion), which regulates dissociation of the SNARE complex and binds to the GluR2 subunit of AMPA glutamate receptors. Abnormalities in this gene have been also linked with schizophrenia in both gene-expression and genetic-association studies (23, 26). In addition to NSF, roh314 (at chromosome 17q21) contains MAPT (microtubule-associated protein tau). MAPT has been reported to contain a common inversion under selective pressure, resulting in a distinctive haplotypic genealogy that has been associated with multiple neurological disorders (27).
Two ROHs that were significantly overrepresented in patients with SCZ contained no known genes (roh321 on chromosome 18q and roh129 on 5q). Although both regions include one or more ESTs and may harbor as-yet-unknown regulatory elements, it is also possible that the extent of allelic hitchhiking is not fully captured by our ROH methodology and may impact genes immediately neighboring these regions (8). Consequently, the first gene located within 500 kb in either direction of these ROHs is listed in parentheses in Table 2. PIK3C3 (adjacent to roh129) encodes phosphoinositide-3-kinase, class 3. A promoter region variant in this gene has been associated with SCZ in three studies to date (23).
Finally, exploratory analyses examining binarized individual SNP data revealed subregions of two additional ROHs that were significantly overrepresented in SCZ cases relative to controls (SI Table 5). Segments of the very large ROH on chromosome 8 (roh172), demonstrated a strong differentiation between cases and controls (maximal
2 = 12.9, df = 1, P = 3.28 x 10–4) occurring directly in the coding region of SNTG1 (Fig. 1). Notably, SNTG1 is expressed exclusively in neurons, including hippocampal pyramidal cells, cerebellar Purkinje cells, and multiple cortical regions, where it binds to dystrophin, the dystrobrevins, and diacylglycerol kinase,
(DGKZ) in the postsynaptic density.
|
| Discussion |
|---|
|
|
|---|
Nevertheless, ROH frequency is a readily available measure for statistical comparisons in a case-control design. In case-control comparison, we observed that ROHs were overrepresented in SCZ at a genomewide level. The effect size (Cohen's d) was
0.30, a small to moderate effect comparable with the effect size seen across many biological studies of schizophrenia. Although subtle differences in ascertainment between groups cannot be fully ruled out, the finding of increased homozygosity associated with heightened disease risk is predicted by classical genetic models (30) and is supported by empirical data from Drosophila and other organisms (31). Intriguingly, studies of population isolates and consanguineous families demonstrate elevated rates of schizophrenia (32, 33).
The presence of nine specific ROHs was associated with illness susceptibility both individually and cumulatively. Four of these regions implicated genes related to postsynaptic (largely glutamatergic) receptor complexes implicated in SCZ pathophysiology. These genes include NOS1AP and NSF, each of which has been associated with schizophrenia, as well as GPHN and SGCD, which have not been previously examined in SCZ association studies. A fifth region spanning the coding region of SNTG1 was associated with SCZ in exploratory analyses; syntrophin abnormalities in SCZ are consistent with the accumulating evidence associating DTNBP1 haplotypic variation with SCZ susceptibility (23).
It should be noted that results for at least one risk region (roh314) may be influenced by the frequent presence of copy number variation at chromosome 17q21 (34); however, it is unlikely that results of the present study are primarily reflective of copy number variation, for four reasons. First, HapMap data suggest that duplications in this region are far more common than deletions (34), whereas deletions are more likely to create a spurious pattern of homozygous calls (35). Second, deletions in this region have been associated with mental retardation (36), which is not observed in our study. Third, chromosomal locations containing highly common ROHs (Table 1) are not generally marked by frequent copy number variation in publicly available databases (34). Fourth, inspection of raw intensity plots from microarrays analyzed for the present study are not consistent with frequent, large regions of copy number variation in the neighborhood of common ROHs (data not shown). Further research is needed to carefully examine the role of copy number variation in SCZ.
It is noteworthy that most of the risk ROHs demonstrated low frequencies in the general population. Future studies may determine whether these rare variants, conferring high risk ratios in small subpopulations, demarcate dissociable subtypes of illness at the genetic level. It is possible that this form of genetic heterogeneity coexists with the multifactorial, common-disease/common-variant mode of inheritance that is generally studied in whole-genome association. Twin studies of heritability of schizophrenia demonstrate considerable heterogeneity in MZ/DZ concordance rates (37), which may be consistent with a disease that can follow either multifactoral polygenicity or oligogenic heterogeneity modes of transmission in different families (38). As a simplified example, a single allele with 10% frequency (1% homozygosity) in the general population, conveying 10-fold increased risk under a recessive model, could account for a large portion of the sibling recurrent risk (estimated at 10%) in a small number of families with schizophrenia (10%). Such an allele would likely be missed by other methodologies, including standard WGA and linkage designs.
Finally, at least two risk ROHs were relatively common in controls, possibly reflecting positive selection. It is perhaps counterintuitive that such ROHs would be commonly observed in patients with schizophrenia. However, results are consistent with a model of rare, deleterious recessive effects associated with an allele or haplotype with overdominant properties (15, 39). These balancing effects may either be the result of the same allele, as in HBB and malaria, or from distal alleles that have hitchhiked on a region undergoing selection (5). Although WGHA currently lacks the spatial resolution to identify the causative allele(s), regions reported in the present study provide fairly narrow windows containing highly plausible candidates for further investigation.
| Methods |
|---|
|
|
|---|
Healthy controls (n = 144) were recruited by use of local newspaper advertisements, flyers, and community internet resources. After written informed consent was provided, the nonpatient SCID (SCID-NP) was administered to rule out the presence of an Axis I psychiatric disorder; a urine toxicology screen for drug use and an assessment of the subject's family history of psychiatric disorders were also performed. Exclusion criteria included (current or past) Axis I psychiatric disorder, psychotropic drug treatment, substance abuse, a first-degree family member with an Axis I psychiatric disorder, or the inability to provide written informed consent. Patients (65 female/113 male) and controls (63 female/81 male) did not significantly differ in sex distribution (P > 0.05).
All subjects self-identified as Caucasian, non-Hispanic. As described in ref. 20, population structure was tested by examination of 210 ancestry informative markers (AIMs). AIMs included all SNPs on the array that passed initial quality control procedures and demonstrated a frequency difference of
0.5 in comparisons between Caucasian individuals and Asians or African-Americans in data made publicly available by Shriver and colleagues (40) (http://146.186.95.23/biolab/voyage/psa.html). Two tests of structure were performed, both of which indicated no significant stratification. First, analysis with the STRUCTURE (41) program (using multiple levels of K) confirmed that all subjects were drawn from a single population; second, comparison of cases and controls on allelic frequency across the 210 AIMs revealed no differences beyond those expected by chance.
Genotyping.
Genomic DNA extracted from whole blood was hybridized to two oligonucleotide microarrays (42) containing
262,000 and
238,000 SNPs (mean spacing = 5.8 kb; mean heterozygosity = 27%) as per manufacturer's specifications (Affymetrix). Patients and controls were proportionally distributed on each plate and were processed together to minimize confounding plate artifacts. Genotype calls were obtained by using the Bayesian Robust Linear Model with Mahalanobis distance classifier (BRLMM) algorithm thresholded at 0.5 applied to batches of 100 samples. Quality control procedures followed several steps (20). First, samples that obtained mean call rates <90% across both chips (or <85% for a single chip) were rejected. Mean call rate of remaining samples (total n = 322) was 97%. Twenty two of these cases were successfully repeated, and concordance of the two calls (reliability) for each SNP was evaluated. SNPs with more than one discrepancy were excluded from further analyses. Concordance across the remaining 454,699 SNPs exceeded 99.4%. Additionally, 9,936 SNPs in the sex-linked (i.e., nonpseudoautosomal) portion of the X chromosome were deleted, yielding 444,763 SNPs available for WGHA analysis. For WGHA, individual SNPs with low call rates even in valid cases were included, as were SNPs not in Hardy–Weinberg equilibrium in the control sample, because SNPs with these properties may be indicative of structural genomic variation of interest (35). It should be noted that the major results reported in Tables 1 and 2 were not substantively changed when analyses were performed on only the 439,511 SNPs that met strict QC criteria (Hardy–Weinberg equilibrium P > 0.001 in controls, and call rate >85). Specifically, patients still exhibited an average of four more ROHs compared with controls (P = 0.006). Each of the nine "risk ROHs" described in Table 2 remained significant at the P < 0.01 level. Additionally, each of the nine most common ROHs in healthy controls (Table 1) remained prevalent at a frequency
24%. All statistical analyses described above were conducted by using HelixTree software (Golden Helix).
WGHA: Construction of ROHs. WGHA analysis entails several within-subject and across-subject analytic steps, each performed with customized python scripting in the HelixTree environment: First, SNP data from each chromosome of each subject were interrogated for runs of homozygosity (ROHs), which are long series of consecutive SNPs that are homozygous (uncalled SNPs are permitted within a run, because these may indicate genomic phenomena of interest). A fixed threshold of 100 consecutive SNPs was selected in a manner analogous to a recently published study that used the Affymetrix 10K chip to study recessive effects in known consanguineous families (28). It is acknowledged that patterns of LD are quite variable across the genome and that the threshold could be dynamically adjusted to account for this regional variability. However, dynamically adjusting ROH requirements for regional LD would confound the primary goal of the ROH approach, which is the identification of regions of strong, extended LD.
Additionally, it is possible that variable SNP properties on the Affymetrix array can result in a nonuniform distribution of heterozygosity; for example, locally dense marker spacing or lower minor allele frequency could result in ROHs that do not reflect meaningful biological phenomena. We addressed this potential confound to the interpretability of ROHs in two ways: (i) Mean SNP density across all 339 ROHs in SI Table 4 (
7.1 kb) was lower than SNP density than the average across the entire 500K array (
5.8 kb). Even excluding seven ROHs that span centromeres, which might artificially inflate the SNP spacing, the average marker spacing in the remaining ROHs is 6.0 kb, which is still slightly greater than mean spacing across the array. (ii) Minor allele frequency (and thus, heterozygosity) of SNPs in common ROHs (as identified in ZHH controls) was higher than the array average when measured in HapMap YRI samples (see Results).
Our criterion of 100 consecutive SNPs was selected to be more than an order of magnitude larger than the mean haploblock size in the human genome, without being so large as to be very rare, which would prohibit meaningful group comparisons. As an approximation, putting aside regional variability in LD and heterozygosity, the likelihood of observing 100 consecutive chance events can be described as follows: Because mean heterozygosity across all SNPs in the ZHH was observed to be 27%, any given SNP has, on average, a 0.73 chance of being called homozygous. Given 444,763 reliable SNPs and 322 subjects, a minimum run length of 70 would be required to produce <5% randomly generated ROHs across all subjects (0.7370 x 444,763 x 322 = 0.04), assuming complete independence of all SNPs. Because of linkage disequilibrium, SNP calls are not fully independent, thereby inflating the likelihood of chance occurrence of biologically meaningless ROHs. Genomewide identification of tag SNPs within windows of 70 markers by using the Carlson method (2) as implemented in HelixTree revealed 314,869 separable tag groups, representing a 29.3% reduction of information compared with the total number of original SNPs. Thus, run size of 100 SNPs was selected to approximate the degrees of freedom of 70 independent SNP calls.
Each subject's SNP data were then converted to binary calls (0 or 1) at each position indicating whether that SNP is a member of a ROH for that individual. Next, at each position, data from all subjects were examined to determine whether a minimum number of individuals share a ROH call at a given position. Because the purpose of this investigation was the identification of statistical differences between biologically meaningful ROHs in a case-control design, SNPs with <10 ROH calls across the entire sample were eliminated, resulting in 65,422 SNPs with 10 or more ROH calls, an 85% reduction from the original pool of SNPs. Taking this strategy a step further, "common" ROHs were identified that contained a minimum of 100 consecutive ROH calls across 10 or more subjects. A total of 339 such ROHs were identified across the genome, ranging in size from 100 to 852 SNPs in length (mean = 161, SD = 82, median = 133, see SI Table 4). A subject whose individual ROH calls overlapped with a common ROH was called present for that common ROH. Thus, each subject could have a total (sum) score ranging from 0 to 339.
WGHA Statistical Plan.
Based on these definitions, the statistical plan followed several steps for the identification of differences between cases and controls. First, this total score for common ROHs was compared between cases and controls by using Student's t test; this constituted a single genomewide test for difference in ROH frequency, with
set to 0.05. Next, as a planned post hoc examination of any significant genomewide difference, case-control comparisons of frequency of presence for each common ROH were examined by using
2 tests (or Fisher's exact test when expected values <10 were found for any cell); although
would be protected by the preceding genomewide comparison, the threshold for significance for this analysis was set to P < 0.01 to further reduce the risk of false positives. Third, the cumulative effect of these risk-imparting ROHs (i.e., the dose-dependence of the presence of "risk ROHs") was tested with logistic regression. Because the predictor variables for these logistic regression analyses were the ROHs already identified as significantly differentiating cases and controls, the raw P values for these regressions should be considered as strongly anticonservative. Therefore, empirical P values were calculated by using 100,000 permutations of the full ROH dataset for each regression analysis.
Finally, as an exploratory analysis to potentially identify smaller regions of difference between cases and controls,
2 tests were performed on the 54,600 binarized SNP calls within common ROHs. Analogous to the dual-thresholding procedures commonly used in voxelwise brain imaging studies (43), statistical significance for these exploratory analyses was defined as 50 or more consecutive SNPs significantly differing between cases and controls at the P < 0.01 level (see SI Text for WGHA methods summary).
| Acknowledgements |
|---|
|
|
|---|
| Footnotes |
|---|
To whom correspondence should be addressed. E-mail: lencz{at}lij.eduFreely available online through the PNAS open access option.
Author contributions: T.L., J.M.K., and A.K.M. designed research; T.L., T.V.M., R.K., and A.K.M. performed research; T.L. and C.L. contributed new reagents/analytic tools; T.L., C.L., P.D., K.E.B., and A.K.M. analyzed data; and T.L. and A.K.M. wrote the paper.
Conflict of interest statement: C.L. is employed with Golden Helix and holds >5% equity in the company. Golden Helix subsequently developed a commercial version of the data analysis methodologies described in this paper.
This article contains supporting information online at www.pnas.org/cgi/content/full/0710021104/DC1.
© 2007 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||