Abstract
There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.
Original language | English |
---|---|
Pages (from-to) | 654-669 |
Number of pages | 16 |
Journal | American Journal of Human Genetics |
Volume | 107 |
Issue number | 4 |
DOIs | |
Publication status | Published - 1 Oct 2020 |
Keywords
- DNA methylation
- epimutation
- epivariation
- folate-sensitive fragile site
- repeat expansion
ASJC Scopus subject areas
- Genetics
- Genetics(clinical)
Access to Document
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver
}
A Survey of Rare Epigenetic Variation in 23,116 Human Genomes Identifies Disease-Relevant Epivariations and CGG Expansions. / Garg, Paras; Jadhav, Bharati; Rodriguez, Oscar L.; Patel, Nihir; Martin-Trujillo, Alejandro; Jain, Miten; Metsu, Sofie; Olsen, Hugh; Paten, Benedict; Ritz, Beate; Kooy, R. Frank; Gecz, Jozef; Sharp, Andrew J.
In: American Journal of Human Genetics, Vol. 107, No. 4, 01.10.2020, p. 654-669.Research output: Contribution to journal › Article › peer-review
TY - JOUR
T1 - A Survey of Rare Epigenetic Variation in 23,116 Human Genomes Identifies Disease-Relevant Epivariations and CGG Expansions
AU - Garg, Paras
AU - Jadhav, Bharati
AU - Rodriguez, Oscar L.
AU - Patel, Nihir
AU - Martin-Trujillo, Alejandro
AU - Jain, Miten
AU - Metsu, Sofie
AU - Olsen, Hugh
AU - Paten, Benedict
AU - Ritz, Beate
AU - Kooy, R. Frank
AU - Gecz, Jozef
AU - Sharp, Andrew J.
N1 - Funding Information: The Biobank-Based Integrative Omics Studies (BIOS) Consortium is funded by BBMRI-NL, a research infrastructure financed by the Dutch government (NWO 184.021.007). The Parkinson disease patient and control study was funded by NIEHS grants ES024356, R01ES10544, and P01ES016732. The Framingham Heart Study is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with Boston University (Contract No. N01-HC-25195 and HHSN268201500001I). Additional funding for SABRe was provided by Division of Intramural Research, NHLBI, and Center for Population Studies, NHLBI. The Women’s Health Initiative (WHI) program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts HHSN268201600018C, HHSN268201600001C, HHSN268201600002C, HHSN268201600003C, and HHSN268201600004C. This manuscript was not prepared in collaboration with investigators of the Framingham Heart Study or WHI and does not necessarily reflect the opinions or views of the Framingham Heart Study, WHI investigators, Boston University, or NHLBI. Funding Information: This work was supported by NIH grant NS105781 to A.J.S., NIH predoctoral fellowship NS108797 to O.R., and American Heart Association Postdoctoral Fellowship 18POST34080396 to A.M.T. R.F.K. acknowledges support of the Research Fund of the University of Antwerp (Methusalem-OEC grant – “ GENOMED ”). Research reported in this paper was supported by the Office of Research Infrastructure of the National Institutes of Health under award number S10OD018522 . The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This work was supported in part through the computational resources and staff expertise provided by Scientific Computing at the Icahn School of Medicine at Mount Sinai. Funding Information: This work was supported by NIH grant NS105781 to A.J.S. NIH predoctoral fellowship NS108797 to O.R. and American Heart Association Postdoctoral Fellowship 18POST34080396 to A.M.T. R.F.K. acknowledges support of the Research Fund of the University of Antwerp (Methusalem-OEC grant ? ?GENOMED?). Research reported in this paper was supported by the Office of Research Infrastructure of the National Institutes of Health under award number S10OD018522. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This work was supported in part through the computational resources and staff expertise provided by Scientific Computing at the Icahn School of Medicine at Mount Sinai. Data used in the preparation of this article were obtained from the Parkinson's Progression Markers Initiative (PPMI) database (see Web Resources). PPMI, a public-private partnership, is funded by the Michael J. Fox Foundation for Parkinson's Research and funding partners, a full list of which can be found online. The Biobank-Based Integrative Omics Studies (BIOS) Consortium is funded by BBMRI-NL, a research infrastructure financed by the Dutch government (NWO 184.021.007). The Parkinson disease patient and control study was funded by NIEHS grants ES024356, R01ES10544, and P01ES016732. The Framingham Heart Study is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with Boston University (Contract No. N01-HC-25195 and HHSN268201500001I). Additional funding for SABRe was provided by Division of Intramural Research, NHLBI, and Center for Population Studies, NHLBI. The Women's Health Initiative (WHI) program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts HHSN268201600018C, HHSN268201600001C, HHSN268201600002C, HHSN268201600003C, and HHSN268201600004C. This manuscript was not prepared in collaboration with investigators of the Framingham Heart Study or WHI and does not necessarily reflect the opinions or views of the Framingham Heart Study, WHI investigators, Boston University, or NHLBI. Funding Information: Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (see Web Resources ). PPMI, a public-private partnership, is funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners, a full list of which can be found online.
PY - 2020/10/1
Y1 - 2020/10/1
N2 - There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.
AB - There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.
KW - DNA methylation
KW - epimutation
KW - epivariation
KW - folate-sensitive fragile site
KW - repeat expansion
UR - http://www.scopus.com/inward/record.url?scp=85091662642&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2020.08.019
DO - 10.1016/j.ajhg.2020.08.019
M3 - Article
C2 - 32937144
AN - SCOPUS:85091662642
VL - 107
SP - 654
EP - 669
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
SN - 0002-9297
IS - 4
ER -