New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
The human gene damage index as a gene-level approach to prioritizing exome variants
Contributed by Jean-Laurent Casanova, September 22, 2015 (sent for review June 26, 2015); reviewed by Jay Shendure and David B. Goldstein

Significance
The protein-coding exome of a patient with a monogenic disease contains about 20,000 variations, of which only one or two are disease causing. When attempting to select disease-causing candidate mutation(s), a challenge is to filter out as many false-positive (FP) variants as possible. In this study, we describe the gene damage index (GDI), a metric for the nonsynonymous mutational load in each protein-coding gene in the general population. We show that the GDI is an efficient gene-level method for filtering out FP variants in genes that are highly damaged in the general population.
Abstract
The protein-coding exome of a patient with a monogenic disease contains about 20,000 variants, only one or two of which are disease causing. We found that 58% of rare variants in the protein-coding exome of the general population are located in only 2% of the genes. Prompted by this observation, we aimed to develop a gene-level approach for predicting whether a given human protein-coding gene is likely to harbor disease-causing mutations. To this end, we derived the gene damage index (GDI): a genome-wide, gene-level metric of the mutational damage that has accumulated in the general population. We found that the GDI was correlated with selective evolutionary pressure, protein complexity, coding sequence length, and the number of paralogs. We compared GDI with the leading gene-level approaches, genic intolerance, and de novo excess, and demonstrated that GDI performed best for the detection of false positives (i.e., removing exome variants in genes irrelevant to disease), whereas genic intolerance and de novo excess performed better for the detection of true positives (i.e., assessing de novo mutations in genes likely to be disease causing). The GDI server, data, and software are freely available to noncommercial users from lab.rockefeller.edu/casanova/GDI.
Footnotes
- ↵1To whom correspondence may be addressed. Email: yitan{at}rockefeller.edu or casanova{at}rockefeller.edu.
↵2L.Q.-M. and J.-M.C. contributed equally to this work.
↵3S.-Y.Z., L.A., and J.-L.C. contributed equally to this work.
Author contributions: Y.I., B.B., A. Bolze, J.G.M., R.M.-B., S.B.-D., D.N.C., L.Q.-M., J.-M.C., S.-Y.Z., L.A., and J.-L.C. designed research; Y.I., L.S., E.P., M.M.-V., E.S., M.J.C., F.G.L., S.J.d.J., X.-F.K., J.B., A.P., P.D.S., J.G.G., and S.-Y.Z. performed research; Y.I., B.B., E.P., M.M.-V., E.S., J.G.M., S.J.d.J., P.N., J.B., S.B.-D., P.D.S., J.G.G., D.N.C., L.Q.-M., J.-M.C., S.-Y.Z., and L.A. contributed new reagents/analytic tools; Y.I., L.S., E.P., E.S., P.N., A. Belkadi, and P.D.S. analyzed data; and Y.I., A. Bolze, M.J.C., R.M.-B., X.-F.K., L.Q.-M., J.-M.C., S.-Y.Z., L.A., and J.-L.C. wrote the paper.
Reviewers: J.S., University of Washington; D.B.G., Columbia University.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1518646112/-/DCSupplemental.
Freely available online through the PNAS open access option.
Citation Manager Formats
Sign up for Article Alerts
Jump to section
You May Also be Interested in
More Articles of This Classification
Biological Sciences
Related Content
- No related articles found.
Cited by...
- NFIL3 mutations alter immune homeostasis and sensitise for arthritis pathology
- Blacklisting variants common in private cohorts but not in public databases optimizes human exome analysis
- Human IFN-{gamma} immunity to mycobacteria is governed by both IL-12 and IL-23
- Life-threatening influenza pneumonitis in a child with inherited IRF9 deficiency
- The human CIB1-EVER1-EVER2 complex governs keratinocyte-intrinsic immunity to {beta}-papillomaviruses
- Analysis of the recovery of CD247 expression in a PID patient: insights into the spontaneous repair of defective genes
- Combined immunodeficiency and Epstein-Barr virus-induced B cell malignancy in humans with inherited CD70 deficiency
- Dual T cell- and B cell-intrinsic deficiency in humans with biallelic RLTPR mutations
- Whole-exome sequencing to analyze population structure, parental inbreeding, and familial linkage