Abstract
Background/Aims
Phenotypic discordance in monozygotic (MZ) twin pairs can have an epigenetic or genetic basis. Although age-related macular degeneration (AMD) has a strong genetic component, few studies have addressed it’s epigenetic basis.
Methods
Using SNP arrays, we evaluated differences in copy-number variation (CNV) and allele-specific methylation (ASM) patterns (via methyl-sensitive restriction enzyme digestion of DNA) in MZ twin pairs from the US Twin Study of AMD. Further analyses examined the relationship between ASM and CNVs with AMD by both case/control analysis of ASM at candidate regions and by analysis of ASM and CNVs in twins discordant for AMD.
Results
The frequency of ASM sites differ between cases and controls in regions surrounding the AMD-candidate genes CFH, C2 and CFB. While ASM patterns show substantial dependence on local sequence polymorphisms, we observe dissimilar patterns of ASM between MZ twins. Genes closest to sites where discordant MZ twins have dissimilar patterns of ASM are enriched for genes implicated in gliosis, a process associated with neovascular AMD. Similar twin-based analyses revealed no AMD associated CNVs.
Conclusions
Our results provide evidence of epigenetic influences beyond the known genetic susceptibility and implicate inflammatory responses and gliosis in the etiology of AMD.
Keywords: allele-specific methylation, AMD, CNV, epigenetic, monozygotic, twin
BACKGROUND
The US Twin Study of Age-Related Macular Degeneration (AMD), a population-based study of monozygotic (MZ) and dizygotic (DZ) twin pairs who are concordant and discordant for AMD, was established in 1988 to evaluate the relative effects of genetics and the environment on AMD[1]. The results of this study concluded that the heritability of AMD is between 46% and 71%, with more advanced cases having higher heritability of between 67% and 71% [1]. As shown in our previous studies of twins including the study of monozygotic twin pairs discordant for AMD, environmental factors, including smoking and diet, also contribute to this disease, and together with genetic factors are predictive of the onset and progression of the disease [2–4].
Since genetic susceptibility is a strong determinant of the development and progression of AMD, identical MZ twin pairs in which one twin is affected with intermediate or advanced disease, and the co-twin is free of disease or in the early stages, are a unique population to assess somatic differences and how they contribute to AMD. One unexplored source of these differences is post-twinning de-novo Copy Number Variation (CNV), and MZ twins have been reported to display different CNV profiles [5]. More commonly, differential environmental exposure is thought to cause MZ twin differences, as suggested in our previous report showing that smoking and dietary intake of betaine, methionine, and vitamin D differ between MZ twins discordant for AMD [3]. It is possible that environmental exposures exert their influence through molecular mechanisms such as DNA methylation [6], an epigenetic process that can influence gene expression [7].
Previous studies of DNA methylation in MZ twins have demonstrated DNA methylation differences between twins [8], particularly as the twins aged [9]. Studies of DNA methylation have traditionally focused on two types of genomic regions 1) CpG islands which are statistically improbable clusters of CpGs occurring in gene promoters, and 2) differentially methylated regions associated with genomic imprinting, where methylation is associated with one allele in a parent of origin manner. Our recent studies [10,11] as well as others [12,13] have demonstrated a novel type of differential methylation where the methylation mark is consistently associated with one allele. Often termed allele-specific methylation (or ASM), the phenomenon can be influenced to varying degrees by DNA sequence within a population, ranging from complete association of methylation and genotype (i.e. cis-regulated ASM) [11–13], to more stochastic associations, where either allele may be associated with the methylation mark. As ASM is associated with expression changes in nearby genes, variation in differential methylation among individuals, including MZ twins, has the potential to affect phenotypic variation [12–15].
In this report, we examined whether differences in CNVs and ASM profiles correlated with AMD phenotype between twins utilizing two sources of data: 1) a standard analysis of genome-wide copy-number variation among 66 individuals (a total of 33 twin pairs; 7 case pairs, 16 control pairs, and 10 phenotypically discordant pairs) using Affymetrix 6.0 SNP arrays and 2) an assay of ASM, using methyl-sensitive restriction enzyme (MSRE) digestions paired with Affymetrix 6.0 SNP arrays, at almost 100,000 genomic regions in a subset of these samples. The subset for differential methylation analyses included 11 MZ twin pairs: 2 with phenotypically discordant AMD status, 4 case pairs and 5 control pairs, and fourteen singleton samples.
METHODS
For further details on methods development, see Supplemental Methods in the Supplementary Material.
Study Population
The study population for the US Twin Study of Macular Degeneration was derived from the National Academy of Sciences-National Research Council World War II Veteran Twin Registry which includes medical information for 15,924 white male twin pairs born between 1917 and 1927 and was used for both CNV and methylation analyses [16]. Details of the twin study methods have been previously described [1,4]. Briefly, among 12,126 individuals surveyed, 684 (5.6%) reported having been diagnosed with AMD, and 820 individual twins were enrolled in the Twin Study of Age-Related Macular Degeneration including 381 MZ and DZ pairs and 58 singletons [1]. From this population we selected MZ twin pairs discordant for stage of AMD. We defined discordant pairs as each co-twin having a different Clinical Age-Related Maculopathy Staging System (CARMS) grade [17] at last known assessment, with one twin being affected with intermediate AMD (large drusen and/or drusenoid retinal pigment epithelial detachment or advanced AMD (either geographic atrophy (GA) or neovascular (NV) disease), and the other co-twin with early stages or no AMD. Blood samples were drawn, DNA extracted and cell lines established using standard procedures. Institutional review boards approved the research protocols and all participants signed consent statements. Research adhered to the tenets of the Declaration of Helsinki.
Macular Characteristics
Fundus examination and standard fundus photography were performed to record the stage of AMD without knowledge of the zygosity of the twins. Photographs of the macula were graded using a grid according to the Wisconsin Age-Related Macular Degeneration Grading System [18] for detailed evaluation of macular drusen and retinal pigment characteristics. Study examination data and photographs were also graded by JMS and classified into normal or no AMD, (grade 1) early (grade 2), intermediate (grade 3), and advanced AMD stages of geographic atrophy (grade 4) and neovascular disease (grade 5) according to the CARMS grading system [17] (Table 1 and Supplemental Table 1).
Table 1 -. Sample AMD grades.
Grades of matched twins (A and B) and singletons (C) used for CNV analyses (A) and methylation analyses (B and C), using the Clinical Age-Related Maculopathy Staging System [17].
A - CNV Analysis Twins | |||||||||
---|---|---|---|---|---|---|---|---|---|
Twin-II | |||||||||
Affected | Unaffected | ||||||||
Grade | CARMS 5 | CARMS 4 | CARMS 3 | CARMS 2 | CARMS 1 | Total | |||
Twin-I | CARMS 5 | 4 | 3 | 6 | 13 | ||||
Affected | CARMS 4 | 3 | 1 | 4 | |||||
CARMS 3 | 0 | ||||||||
Unaffected | CARMS 2 | 0 | |||||||
CARMS 1 | 16 | 16 | |||||||
Total | 4 | 3 | 0 | 9 | 17 | 33 | |||
B - Methylation Analysis Twins | |||||||||
Twin-II | |||||||||
Affected | Unaffected | ||||||||
Grade | CARMS 5 | CARMS 4 | CARMS 3 | CARMS 2 | CARMS 1 | Total | |||
Twin-I | Affected | CARMS 5 | 1 | 3 | 2 | 6 | |||
CARMS 4 | 0 | ||||||||
CARMS 3 | 0 | ||||||||
Unaffected | CARMS 2 | 0 | |||||||
CARMS 1 | 5 | 5 | |||||||
Total | 1 | 3 | 0 | 2 | 5 | 11 | |||
C - Methylation Analysis Singletons | |||||||||
Affected | Unaffected | ||||||||
Grade | CARMS5 | CARMS4 | CARMS3 | CARMS2 | CARMS1 | Total | |||
Total | 2 | 1 | 3 | 8 | 14 |
Zygosity Status and Genotyping
Each MZ twin pair was genotyped on the Affymetrix 6.0 (Affymetrix Inc., Santa Clara, California) platform at the Broad Institute Center for Genotyping and Analysis (Cambridge, MA). The Affymetrix 6.0 has probesets against 906,600 SNPs and 940,000 CNV probes. Data for each twin pair was analyzed using PLINK [19] for basic quality assurance and quality control (QAQC) metrics. Samples that performed at acceptable QAQC levels in the SNP-scan part of our GWAS were used for the CNV part of our analysis. Zygosity was determined by an identity-by-descent score (IBD), which looks at overall genotypic sharing as a function of relatedness. To pass zygosity checks, we required twin pairs to have IBD scores of more than 0.95. Known AMD genes were also genotyped and twins in each pair had identical genotypes.
CNV analyses
The initial study population consisted of 35 twin pairs. Two of these twin pairs failed the initial genotyping based zygosity checks so 66 individuals (total of 33 twin pairs; 7 case pairs, 16 control pairs, and 10 phenotypically discordant pairs) were passed through to the CNV twin pair analyses.
CNVs were identified using Birdseye [20] which identifies rare CNVs by integrating intensity data from neighboring probes using a Hidden Markov Model (HMM) on a per-individual basis. Performance is dependent on a number of factors including: SNP and copy number probe density, mean intra-individual probe variance and CNV frequency. For each CNV a LOD (logarithm of odds) score was generated that describes the likelihood of the CNV relative to no CNV over the given interval.
Within the 66 samples, we observed 297,345 unfiltered (i.e. without Birdseye baseline filters such as length of deletion event, number of probes identified in the region, contiguity of probes identified in a region) regions of copy number deletions that differed between twins. We restricted our search to deletions as they have much stronger signal-to-noise signal properties than duplications. A CNV duplication, unless there are multiple duplications, will only provide a 3:2 (one chromosomal duplication) or 4:2 (two chromosomal duplications) ratio, which is much harder to see than a 2:1 (one chromosomal deletion) or 2:0 (two chromosomal deletions. In order to obtain a high-quality dataset, we restricted our analysis to the 307 CNVs with a LOD > 10 and physical length greater than 100kb within our analysis set. We imposed a ~1% frequency threshold by removing any CNV with greater than 50% of its length spanning a region with more than 65 out of ~6500 CNVs in the total sample [21]. We also removed events spanning regions of common CNV (>1%) in the CEU HapMap individuals identified using Affymetrix array data generated in the same laboratory and called with the same analytic pipeline. Additionally, we removed regions identified in the Database of Genomic Variants [22]. As a final step, we joined any CNVs that appeared to be artificially split by the HMM used in analysis, and also removed any CNVs that spanned known large gaps (> 200kb) in the hg18 reference genome or regions of known rearrangement (e.g. hg18: chr2:88695164–95087413; chr14:104644530–106268819; chr22:20797557–21512883) to yield novel CNV deletion events that differ between twins [23].
Methylation Analyses - Samples
DNA samples for methylation analyses were all derived from whole blood. Cell lines were not considered for these analyses since we found that, in agreement with recent reports [24], methylation patterns differ between lymphoblast cell lines and whole blood DNA derived from the same subjects. Among the 36 samples, 14 were singletons used only for case-control analyses, and 22 were twins used in the twin pair methylation analyses. Of the 14 singletons, 12 samples were derived from twins pairs for which one twin had only cell line DNA and 2 were derived from the two twin pairs which failed the zygosity checks in the CNV QAQC steps above. The other 22 samples were from 11 twin pairs (Table 1 and Supplemental Table 1) consisting of 5 concordant control twin pairs (both twins grade 1), 4 concordant case twin pairs and 2 twin pairs discordant for AMD (both twins grade 5 for one pair and 3 pairs with one twin grade 4 and the other twin grade 5) and 2 twin pairs discordant for AMD (one twin grade 5 and other twin grade 2). These samples were used in both case-control and twin pair methylation analyses. To adjust for technical variation. all methylation samples were analyzed as technical replicates on two independetn Affymetrix arrays. Probe intensities used in further analyses represent the mean values from these technical replicates.
Methylation Analyses - MSRE Digest and Validation
High throughput methylation assay: genomic DNA was genotyped on the Affymetrix 6.0 SNP mapping arrays according to the manufacturer’s manual (www.affymetrix.com). For methylation analysis, 3 μg of genomic DNA was digested at 37 degrees Celsius for16 hours with a methyl sensitive restriction enzyme (MSRE) cocktail including Aci I (60 units), BsaH I (3.9 units), Hha I (7.5 units), Hpa II (7.5 units), and HpyCH4 IV (30 units) (New England Biolabs), in a 200 μL reaction volume with 1% BSA and 10% NEB buffer #4 (New England Biolabs) and heat inactivated for 20 minutes at 60°C. Samples were ethanol precipitated, washed in 70% ethanol, and resuspended in reduced EDTA TE (5mM Tris, 0.1mM EDTA) at 50 ng/μL. After this pre-treatment, the DNA enters the mapping array procedure; it is further digested with the Nsp I and Sty I restriction enzymes, and fragments of 100–1,100 bp (containing the polymorphic sites to be assessed) are PCR amplified, with the resulting amplicons then end-labeled and hybridized to the array.
Efficacy of the independent MSRE digestions was determined by examining MSRE action on amplicons with solitary cut sites for a component MSRE in a control mix run alongside the AMD samples. Our results showed that amplicons with single MSRE cut sites, with the exception of HhaI, exhibited greatly reduced combined probe intensities as compared to amplicons with no MSRE sites (MNRs) (Supplemental Figure 1). Any amplicons with only HhaI sites (6,132 in total) were ignored in further analyses.
Methylation Analyses - Array Quality Control
To eliminate replicates with poor array hybridizations, the un-normalized probe intensities of the replicates of MSRE digested samples were compared by linear regression. All replicate coefficients of determination were within 2.5 standard deviations of the mean of the entire sample set (mean=0.919, standard deviation=0.125). After normalization, the mean coefficient of determination was 0.94 with a standard deviation of 0.028. All but two samples had coefficients within 2.5 standard deviations of this mean (T39B-0.86 and T27B-0.87).
Methylation Analyses - Array Normalization and ASM Detection
Briefly, invariant probe sets were first quantile normalized between arrays and the normalized values for variant probe sets then interpolated from the normalized invariant probe sets derived from their respective arrays. After various levels of technical cleanup, including eliminating probe sets that performed poorly, allele-specific methylation at heterozygous SNPs was detected as a sample and SNP-specific normalized deviation from the HapMap derived heterozygote relative probe intensity (see Supplemental Methods for more details).
The potential for artifacts arising from polymorphisms in MSRE recognition sites was eliminated as described previously [11]. Briefly, we excluded all SNPs on the Affymetrix array residing on amplicons containing any known polymorphism in an MSRE recognition site in the dbSNP129 database [25]. These filters were used for all such polymorphisms present at any frequency in the population. Unless otherwise noted, this filter was used in all analyses. Although this filter discards loci that may not be polymorphic for the individuals used in this study, we conservatively chose to ensure robust analyses.
For validation of these methods on known ASM regions se the Supplemental Results section of the Supplemental Materials.
Methylation Analyses – Independent Confirmation
We employed 454 large-scale parallel pyrosequencing to examine SNP and CpG site methylation status. Primers for bisulfite PCR (BSP) amplification were designed with Methprimer [26] and BiSearch [27] to uniquely amplify fragments of less than 500 bp encompassing the MSRE sites and target SNP. All amplifications were strand specific and designed to allow observation of the SNP status after bisulfite conversion and amplification; for C/T SNPs, the guanine strand was amplified. Primer sequences (against bisulfite converted DNA) and amplicon genomic locations for all amplicons can be found in Supplemental Table 2. DNA samples were bisulfite converted with the Qiagen EpiTect 96 Bisulfite Kit according to manufacturer’s instructions, quantitated by Nanodrop and amplified in 96 well plates for in 20 ul PCR reactions comprised of 10ng of converted DNA, 1 unit of Qiagen HotStarTaq Plus, 50 picomoles of each primer (2.5 μM final concentration), 1X PCR buffer, and 200 nanomoles of dNTPs (10 mM final concentration). All amplifications were run for 35 cycles. Product sizes were visually confirmed by agarose gel electrophoresis, quantitated by Picogreen, and equimolar amounts from each target’s amplification products mixed for each individual before bar-code indexed 454 Pyrosequencing. After sequencing, pooled reads were deindexed and adapters and low quality sequences clipped; reads shorter than 100 were removed. Reads were non-directionally aligned to amplicon sequences with Bismark [28] with the Bowtie 2 gapped. Bisulfite conversion rates of above 95% were confirmed and reads not extending over both the amplicon’s target SNP and MSRE sites discarded. Genotypes at target SNPs were called in VCF format with samtools mpileup [29] with extended BAQ, and a BAQ cutoff of 10. All amplicons had coverage depth of at least 20 reads and genotype qualities of at least 20 in a majority (>50%) of samples. SNPs did not significantly deviate (p>0.01) from the expected Hardy-Weinberg equilibrium within the sample population. Agreement with the expected minor allele frequency as defined by a relative mean difference less than 0.2 from the expected CEU minor allele frequency (HapMap release 27) was also confirmed for all amplicons. For the results of these validation experiments see the Supplemental Results section of the Supplemental Materials.
All unprocessed data are available by request from Dr. J. Seddon.
RESULTS
CNV Analyses
After quality control and filtering, we observed a total of 307 CNVs that were not shared between twins of a twin pair. These 307 unshared CNVs were distributed among the different twin pair types (e.g. case-case, control-control, case-control) rather than concentrated in any twin pair type. While the majority of these 307 unshared CNVs were commonly observed deletions, 8 were novel CNV deletion events. None of these 8 events were seen in more than one twin pair, and they were also distributed throughout the twin pair types. These eight CNV deletion events occur in genomic regions that are mostly devoid of genes, and where there are genes present, they are neither genes implicated in AMD by previous association studies nor are they paralogs of the previously implicated genes.
Allele-Specific Methylation at Regions Implicated in Age-Related Macular Degeneration
Chi-square analysis of individual MPRs between individuals classified as cases (13 individuals, CARMS grades 3,4, and 5) [17] and controls (23 individuals, CARMS 1 and 2) (Supplemental Table 1) did not yield any significant results after correction for multiple hypothesis testing (Supplemental Figure 2, Bonferroni adjusted p>0.05, Supplemental Table 3). As ASM changes can correlate across larger genomic regions than assayed by a single MPR [14] we also examined the grouped ASM status of all MPRs in the 1Mb windows around each of 12 genetic variants previously implicated in AMD:
rs1061170, or CFHY402H, affects exon 9 of the CFH gene on chromosome 1q32, resulting in a substitution of histidine for tyrosine at codon 402 of the CFH protein[32–34],
rs1410996 an independently associated SNP variant within intron 14 of CFH [35],
rs10033900, an independently associated SNP located in the linkage peak region of chromosome 4, 2781 base pairs downstream of the 3’ untranslated region of CFI [36],
rs9332739, or C2 E318D, a non-synonymous coding SNP variant in exon 7 of C2 resulting in a substitution of aspartic acid for glutamic acid at codon 318 [35,37],
rs641153 or CFB R32Q, a non-synonymous coding SNP variant in exon 2 of CFB resulting in the substitution of the amino acid glutamine for arginine at codon 32 [35,36],
rs4711751, a noncoding variant downstream of VEGFA[38],
rs1999930, a noncoding variant nearby FRK/COL10A1 [38],
rs10490924, a non-synonymous coding SNP variant in exon 1 of ARMS2 on chromosome 10 resulting in a substitution of the amino acid serine for alanine at codon 69 [39,40],
rs10468017 a promoter variant of the Hepatic Lipase C gene LIPC on chromosome 15q22 [21]
rs3764261, a noncoding variant ~2.5kb upstream of CETP on chromosome 16 [21,38]
rs2230199, or C3 R102G, the non-synonymous coding SNP variant in exon 3 of C3 resulting in the substitution of the amino acid glycine for arginine at codon 102 [41,42] and
While the majority of windows (8/12) showed no significant association of ASM occurrence with either case or control status, two largely overlapping windows centered on the CFH locus (rs1061170 and rs1410996) had significant differences (Bonferroni adjusted p-values<0.05, at 0.0072 and 0.0336 respectively). Two overlapping windows centered on SNPs within the C2 and CFB loci (rs9332739 and rs641153) also showed significant differences (Bonferroni adjusted p-values <0.1, at 0.0972 and 0.0912, respectively) in levels of allele-specific methylation between cases and controls. To account for any correlation structure within our samples which might confound these results, we compared them to results from identically structured tests of 10,000 randomly drawn windows of equal base pair size and equal or higher MPR counts; the AMD SNP containing windows represented genomic regions with some of the highest differences in allele-specific methylation levels between cases and controls (Table 2 and Figure 1). Taken together, these results and the large overlap of the two windows within each genomic region indicate the presence of two genomic regions with differential levels of ASM in patients with differential AMD status.
Table 2 -. Analyses of allele-specific methylation in cases and controls in regions surrounding known AMD variants.
For the two window sizes shown, 500kb and 1Mb centered on the AMD associated variant, allele-specific methylation counts were summed for all MPRs within the window in both cases and controls and Fisher’s exact tests performed. These values were compared to values derived from 10,000 identically sized, randomly located windows drawn from the same data. One region surrounding two variants (rs1061170 and rs1410996 located ~38kb apart on chromosome 1) showed more differences between cases and controls than 99% of randomly drawn windows of 1Mb in size and more than 95% of randomly drawn windows of 500kb in size. Another region surrounding two variants (rs9332739 and rs6411536 located ~10kb apart on chromosome 6) shows more significant differences between cases and controls than more than 99% of randomly drawn windows of 500kb in size and almost 95% of randomly drawn windows of 1Mb in size.
SNP | rs1061170 | rs1410996 | rs10033900 | rs9332739 | rs641153 | rs4711751 | rs1999930 | rs10490924 | rs10468017 | rs3764261 | |
---|---|---|---|---|---|---|---|---|---|---|---|
Proximal Gene | CFH | CFH | CFI | C2 | CFB | VEGFA | FRK/C0L10A1 | ARMS2/HTRA1 | LIPC | CETP | |
Chromosome | 1 | 1 | 4 | 6 | 6 | 6 | 6 | 10 | 15 | 16 | |
Position | 194925860 | 194963556 | 110878516 | 32011783 | 32022159 | 43936560 | 116493827 | 124204438 | 56465804 | 55550825 | |
5e5 bp Window Size | Case MPRs | 37 | 37 | 127 | 128 | 137 | 213 | 133 | 188 | 391 | 181 |
Case ASM MPRs | 4 | 4 | 5 | 3 | 3 | 17 | 12 | 14 | 27 | 18 | |
Control MPRs | 59 | 60 | 149 | 158 | 166 | 266 | 140 | 263 | 496 | 274 | |
Control ASM MPRs | 0 | 0 | 16 | 16 | 16 | 15 | 11 | 25 | 32 | 26 | |
Pvalue | 0.0199 | 0.0191 | 0.0404 | 0.0085 | 0.0082 | 0.3586 | 0.8286 | 0.4994 | 0.7879 | 0.8726 | |
Percentile | 2.02 | 1.97 | 4.07 | 0.81 * | 0.76 * | 32.15 | 75.00 | 45.23 | 71.50 | 80.09 | |
1e6 bp Window Size | Case MPRs | 84 | 84 | 350 | 346 | 350 | 331 | 333 | 382 | 561 | 389 |
Case ASM MPRs | 11 | 9 | 28 | 16 | 16 | 23 | 31 | 45 | 35 | 30 | |
Control MPRs | 145 | 144 | 430 | 425 | 439 | 431 | 460 | 567 | 664 | 511 | |
Control ASM MPRs | 2 | 2 | 39 | 35 | 36 | 20 | 33 | 64 | 41 | 37 | |
Pvalue | 0.0004 | 0.0025 | 0.6103 | 0.0574 | 0.0438 | 0.2052 | 0.2924 | 0.8359 | 1.0000 | 0.7989 | |
Percentile | 0.06 ** | 0.28 ** | 57.83 | 5.95 | 4.58 | 20.24 | 28.83 | 79.34 | 99.98 | 75.64 |
significant at p<0.1 after multiple testing adjustment
significant at p<0.05 after multiple testing adjustment
Figure 1 -. Allele-specific methylation differences between cases and controls in regions surrounding genetic variants with known AMD associations.
Distributions of p-values from Fisher’s-exact tests of allele-specific methylation counts in cases and controls as derived from randomly drawn MPR containing windows of 500kb (A) and 1Mb (B) are shown. Vertical red lines indicate the positions of p-values derived from windows surrounding known AMD variants found to be within the 5th percentile of the distribution. The respective variant identities and percentiles are noted in text to the left of the gray lines.
Monozygotic Twins Exhibit Different Allele-Specific Methylation Patterns
We examined the overall level of disagreement between twins for allele-specific methylation. There are two types of methylation scenarios we observed: 1) allelic differences, where both twins showed allele-specific methylation at an MPR, but for different alleles and 2) ASM occurrence differences, where one twin exhibits allele-specific methylation at an MPR and the remaining twin exhibits biallelic methylation (Figure 2). We observed very few cases of the first scenario, less than one would expect were allele preference random (Table 3). This supports our prior work showing the prevalence of SNP influenced allele-specific methylation across the genome [11]. The second scenario is more prevalent; while most ASM events are present in both twins, we did observe instances where, for up to 5.2% of MPRs, one twin exhibits ASM and the other does not. To account for the possibility of these instances resulting from technical variation, we leveraged the two technical replicate arrays, comparing the level of disagreement found between all replicates between twins (four comparisons) to those found when comparing replicates from the same twin (two comparisons). We observed that for the majority of whole blood samples examined, biological ASM differences exceed technical noise, i.e. the percentage of ASM disagreements between twins at these MPRs (median 2.5% of MPRs) exceeds the percentage of ASM disagreements observed when comparing replicates from an individual twin (median 2.05% of MPRs) (Figure 3). Thus, while most of the disagreements are due to technical noise, some of the differences reflect stable differences between the twins or alternatively could reflect differences we would have observed if we had compared samples taken at different time points from the same individual.
Figure 2 -. Differential ASM scenarios in monozygotic twins.
Potential modes of differing ASM calls between monozygotic twins are shown. In the first model (A), the identity of the allele associated with methylation distinguishes the twins, Twin-I or Twin-II exhibits methylation associated with the A-allele (highlighted in red) and the remaining twin exhibits methylation associated with the B-allele (highlighted in blue). Alternatively, in a second model (B), the differential occurrence of ASM distinguishes the twins, and Twin-I or Twin-II shows allele specific methylation (associated with either one of the A- or B-alleles), and the remaining twin exhibits biallelic methylation.
Table 3 -. Analyses of allele-specific methylation allele preference agreement between twins.
Results are shown for ASM allele preferences for informative MPRs (heterozygous genotype, above intensity cutoff) in Twin-I and Twin-II of a twin pair; the number of ASM events associated either allele A (A-ASM) or allele B (B-ASM) for each twin are shown (the remaining informative MPRs show biallelic methylation in at least one twin). Observed numbers of matching (Agree) or opposite allele preference (Oppose) between twins are shown as compared to the null hypothesis of random preference by Fisher’s Exact tests.
Informative Events | Twin-I | Twin-II | Expected | Observed | Pvalue (fisher’s exact) | |||||
---|---|---|---|---|---|---|---|---|---|---|
Twin pair | A-ASM | B-ASM | A-ASM | B-ASM | Agree | Oppose | Agree | Oppose | ||
T04 | 53004 | 301 | 308 | 617 | 486 | 6 | 6 | 132 | 5 | 2.58E-05 |
T12 | 53386 | 536 | 593 | 585 | 569 | 12 | 12 | 644 | 0 | 1.81E-19 |
T14 | 53367 | 273 | 283 | 341 | 386 | 4 | 4 | 268 | 4 | 1.87E-05 |
T16 | 52897 | 1091 | 1046 | 842 | 687 | 31 | 31 | 952 | 22 | 4.74E-28 |
T20 | 51969 | 1278 | 1345 | 482 | 427 | 23 | 23 | 270 | 32 | 2.77E-09 |
T26 | 51923 | 430 | 502 | 925 | 1032 | 18 | 17 | 438 | 0 | 7.27E-22 |
T27 | 53600 | 341 | 336 | 444 | 407 | 5 | 5 | 274 | 4 | 1.88E-06 |
T29 | 53116 | 1548 | 1415 | 422 | 423 | 24 | 24 | 351 | 30 | 5.80E-12 |
T30 | 52707 | 661 | 657 | 804 | 769 | 20 | 20 | 695 | 27 | 4.73E-16 |
T38 | 51995 | 644 | 664 | 574 | 546 | 14 | 14 | 782 | 0 | 7.48E-23 |
T39 | 52639 | 705 | 785 | 384 | 359 | 10 | 11 | 190 | 2 | 3.18E-11 |
Figure 3 -. ASM patterns between monozygotic twins.
Each circle represents the percentage of regions found to disagree for allele-specific methylation between the two sample comparisons. These results are derived from either technical replicates drawn from an individual twin (orange and red circles for Twin-I and Twin-II respectively) or comparisons between the replicates from each of the different twins (four blue circles). Results are shown for an approach in which ASM calls were more extreme than a Z-score cutoff excluding 99% of MNRs (1% false positive rate) and non-ASM calls less extreme than 95% of MNRs (i.e. included within 95% of the MNR distribution). In this approach, SNPs with Z-score values more extreme than the 99% Z-score cutoff in one twin (i.e. ASM) and more than 95% but less than 99% in the other twin (i.e. neither ASM or non-ASM) were excluded as indeterminate. Biological differences between twins that cannot be explained by technical noise are consistent with the blue circles having greater values than the red and orange circles. The first five twin pairs are concordant controls, the next five are concordant cases and the final two twin pairs are discordant twins.
Allele-Specific Methylation Differs in Twins with Discordant AMD Status at Regions Proximal to Genes Implicated in Gliosis
We extended these results to search for genomic regions with consistent allele-specific methylation differences between discordant twins. For these twins, we did not observe any instances where both twins showed allele-specific methylation at an MPR, but for different alleles (Figure 2A) (for instance, regions where, for both discordant twin pairs, the cases exhibit methylation associated with the A- allele and the controls exhibited methylation associated with the B-allele). Instead we observed the differential occurrence of allele-specific methylation between the discordant twins (Figure 2B), where at 42 MPRs, for both discordant twin pairs, one twin shows biallelic methylation and the other had allele-specific methylation (for instance, regions where for both discordant twin pairs, the cases showed allele-specific methylation (either ASM-A or ASM-B) and the controls exhibited biallelic methylation) (Table 4). To determine how likely we were to see this number of discordant ASM MPRs at random, we performed a permutation test of the data set, shuffling the discordant and concordant assignments of the twin pairs; 2217 of the 10,000 of the permuted datasets resulted in at least this number of methylation differences between the “discordant” twin pairs to yield a p-value of 0.2217. To determine if any biological functions might be associated with these particular MPRs we performed a gene ontology analysis of the genes closest to these 42 MPRs (as determined by the Affymetrix SNP6.0 hg19 annotations). Our results showed this list of genes to be enriched for genes involved in the process of gliogenesis. Specifically, 36 of the 42 genes had a gene ontology annotation, and 4 of these 36 genes were contained within the gliogenesis annotation, yielding an adjusted p-value of 0.0045 (via the g:SCS algorithm, a multiple test corrected p-value which accounts for the hierarchical structure within gene ontology terms [44]).
Table 4 -. SNPs discordant for allele-specific methylation incidence in monozygotic twins discordant for AMD.
Results are shown for an approach in which ASM calls were based on Z-score cutoffs excluding 99% of MNRs (1% false positive rate) and any non-ASM calls less extreme than 95% of MNRs (i.e. included within 95% of the MNR distribution). In this approach, regions with Z-score values more extreme than a Z-score cutoff in one twin (i.e. ASM) and more than 95% but less than 99% in the other twin (i.e. neither ASM or non-ASM) were excluded as indeterminate. The 42 MPRs shown exhibit consistent ASM incidence discordance in twin pairs discordant for AMD, where the ASM call for the twins within the twin pair are separated by a dash. A “BI” indicates biallelic methylation, an “A”: indicates allele-specific methylation of the A allele and a “B” indicates allele-specific methylation of the B allele. Thus, “BI – BI” denotes biallelic methylation in both twins, “A – A” denotes allele-specific methylation of the A allele in both twins and “B – B” denotes allele-specific methylation of the B allele in both twins. The code before the dash indicates the ASM status of the unaffected twin (control), and the code after the dash, that of the affected twin (case). Affymetrix annotations for genomic location and relationship to the closest gene are also shown for each of these variants.
Discordant Twin Status | SNP | Closest Gene | ||||||
---|---|---|---|---|---|---|---|---|
ProbeID | T30 | T29 | ID | Chr | Position | Relation to Gene | Distance from TSS (if not genic) | Symbol |
SNP_A-2083845 | BI-B | BI-B | rs13411523 | 2 | 114454164 | downstream | 21527 | ACTR3 |
SNP_A-2021856 | A-BI | A-BI | rs11679258 | 2 | 241441734 | upstream | 15100 | AGXT |
SNP_A-2058335 | B-BI | B-BI | rs1993329 | 6 | 68992781 | upstream | 409571 | BAI3 |
SNP_A-2045223 | B-BI | B-BI | rs12323921 | 14 | 98427236 | downstream | 278140 | BCL11B |
SNP_A-8484764 | BI-B | BI-B | rs6140071 | 20 | 6654493 | upstream | 42251 | BMP2 |
SNP_A-2210596 | B-BI | B-BI | rs7909808 | 10 | 78062324 | downstream | 75194 | C10orf11 |
SNP_A-8559771 | BI-A | BI-A | rs9384701 | 6 | 109664235 | downstream | 72429 | C6orf182 |
SNP_A-8717023 | A-BI | A-BI | rs8063808 | 16 | 47795251 | downstream | 74460 | CBLN1 |
SNP_A-2133676 | A-BI | A-BI | rs10470437 | 3 | 133265423 | upstream | 28889 | CPNE4 |
SNP_A-8573108 | B-BI | B-BI | rs2693022 | 2 | 15842155 | downstream | 153479 | DDX1 |
SNP_A-1896123 | A-BI | A-BI | rs10930538 | 2 | 172813725 | upstream | 138001 | DLX2 |
SNP_A-1793639 | B-BI | B-BI | rs12691430 | 5 | 107116609 | upstream | 82114 | EFNA5 |
SNP_A-1896576 | A-BI | A-BI | rs11865876 | 16 | 13787369 | upstream | 134145 | ERCC4 |
SNP_A-1888329 | B-BI | B-BI | rs10988682 | 9 | 132099642 | downstream | 60238 | FREQ |
SNP_A-4282113 | A-BI | A-BI | rs9309883 | 3 | 82097118 | upstream | 203478 | GBE1 |
SNP_A-8392053 | A-BI | A-BI | rs2017257 | 17 | 40352675 | upstream | 4235 | GFAP |
SNP_A-4284358 | B-BI | B-BI | rs12478085 | 2 | 206766767 | intron | GPR1 | |
SNP_A-8321092 | B-BI | B-BI | rs8076416 | 17 | 74822048 | intron | hCG_1776007 | |
SNP_A-1867638 | A-BI | A-BI | rs2461056 | 8 | 80836175 | downstream | 2625 | HEY1 |
SNP_A-8425957 | A-BI | A-BI | rs11207325 | 1 | 59136207 | upstream | 113834 | JUN |
SNP_A-2246766 | BI-B | BI-B | rs6985117 | 8 | 111591041 | upstream | 534906 | KCNV1 |
SNP_A-8529083 | A-BI | A-BI | rs10446676 | 4 | 36562606 | upstream | 360478 | KIAA1239 |
SNP_A-4196589 | B-BI | B-BI | rs10457178 | 6 | 108948401 | intron | LACE1 | |
SNP_A-8564552 | A-BI | A-BI | rs130993 | 22 | 32897763 | upstream | 251347 | LARGE |
SNP_A-1855257 | B-BI | B-BI | rs962885 | 17 | 41291420 | intron | LOC10012897’ | |
SNP_A-2274687 | A-BI | A-BI | rs4142502 | 6 | 143917664 | exon | LOC285740 | |
SNP_A-2162548 | A-BI | A-BI | rs12361464 | 11 | 41476061 | upstream | 1203821 | LRRC4C |
SNP_A-8498579 | BI-A | BI-A | rs4715127 | 6 | 49396584 | downstream | 110366 | MUT |
SNP_A-8338412 | BI-A | BI-A | rs8134891 | 21 | 28888157 | downstream | 53610 | NCRNA00161 |
SNP_A-2001046 | A-BI | A-BI | rs7917091 | 10 | 55684541 | intron | PCDH15 | |
SNP_A-8682352 | B-BI | B-BI | rs11949422 | 5 | 59292928 | upstream | 67550 | PDE4D |
SNP_A-1909129 | A-BI | A-BI | rs532095 | 13 | 74708951 | downstream | 940 | RP11–159J2.1 |
SNP_A-4276454 | A-BI | A-BI | rs941655 | 14 | 94393085 | upstream | 55496 | RPL15 |
SNP_A-4298178 | B-BI | B-BI | rs1200470 | 13 | 31010348 | upstream | 201330 | RXFP2 |
SNP_A-8634510 | A-BI | A-BI | rs941597 | 14 | 93883114 | upstream | 23673 | SERPINA6 |
SNP_A-8698392 | A-BI | A-BI | rs7824948 | 8 | 14987154 | intron | SGCZ | |
SNP_A-8607148 | A-BI | A-BI | rs1731847 | 7 | 155348283 | upstream | 50555 | SHH |
SNP_A-8434018 | A-BI | A-BI | rs13179872 | 5 | 128177775 | upstream | 151333 | SLC27A6 |
SNP_A-4280941 | BI-B | BI-B | rs454693 | 5 | 87579969 | intron | TMEM161B | |
SNP_A-2065284 | A-BI | A-BI | rs955336 | 1 | 100881570 | upstream | 76314 | VCAM1 |
SNP_A-1788455 | A-BI | A-BI | rs16963005 | 19 | 34756795 | downstream | 9729 | VSTM2B |
SNP A-1953456 | A-BI | A-BI | rs4796421 | 17 | 7320374 | intron | ZBTB4 |
DISCUSSION
This work represents one of the first genome-wide analysis of differential ASM and DNA methylation within several MZ twin pairs participating in an AMD study. This work also represents the first evaluation of CNVs in monozygotic twin pairs with concordant and discordant AMD phenotypes. In addition to identifying smaller (~100bp to 1.1kb) differentially methylated genomic regions in monozygotic twins discordant for AMD we also demonstrate that larger genomic regions (0.5 to 1 Mb) containing known AMD-associated variants are differentially methylated according to AMD status. Our results extend our previous AMD studies with this twin cohort and while there is no incontrovertible evidence of an epigenetic signature for AMD, our results do suggest candidate epigenetic targets underlying the discordant phenotypes observed in twins [3].
There are few previous studies examining CNV differences between MZ twins. While Bruder et al. [5] claimed they could see many somatic differences between subjects, we took a more conservative approach to setting thresholds for calling these differential CNV events. Upon deeper examination of the underlying data we found that frequently, when one twin reached the calling threshold, the other twin showed sub-threshold levels of signal. These results indicate that what differences in CNV deletions we do observe between twins for CNV deletions are likely driven by small differences in signal between twins, and may not represent true CNV differences. In support of our conservative approach and the resultant low number of MZ twin pair CNV differences, somatic mutations have historically been estimated to occur once every 100 million base pairs, yielding only about 30 events per individual across the genome. While it is difficult to determine when these events occur, we would expect any of mutations occurring very early during embryogenesis prior to the “twinning” event to be shared between monozygotic twins. Therefore, the expectation, which is fulfilled by our results, is for relatively few of these events to be seen between MZ twins.
Previous attempts to associate methylation changes with phenotypic traits in discordant twins have demonstrated differences between MZ twins within a pair but have largely been limited in genomic scope. Multiple studies targeting CpG islands have shown suggestive results; Javierre et al. [45] observed widespread changes in the DNA methylation status of a significant number of CpG-containing promoters of genes associated with immune function in 5 monozygotic twin pairs discordant for systemic lupus erythematosus. Using the Illumina 27K platform (which largely targets CpG islands of cancer-associated genes), Dempster et al. [46] found significant enrichment of epigenetic changes in biological pathways relevant to psychiatric disorder and neurodevelopment in 22 twin pairs discordant for major psychosis while Wang et al. [47] observed methylation differences in two genes between 7 twin pairs discordant for obesity. In contrast, a study of three monozygotic twins discordant for multiple sclerosis examined genome-wide DNA methylation using next-generation bisulfite sequencing did not find any methylation changes associated with the phenotype [48]. Far fewer studies have examined the association of allele-specific methylation changes with phenotypic traits and none have done so genome-wide. Stepanow et al. [49] showed allele-specific methylation of MCHR1 to be associated with body mass index. In cancer progression, Kang et al. report an association of p14ARF (CDKN2A) polymorphisms with the likelihood of methylation of this gene within colorectal cancers [50], and Boumber et al. found an indel polymorphism in PDLIM4 which influences the methylation of this gene in leukemia and colon cancer [51]. A study by Wei et al. [52] examined methylation patterns in twins discordant for AMD, assaying methylation levels of promoter regions genes in PBMCs using MeDip-Chip and NimbleGen Human DNA Methylation 2.1M Deluxe Promoter Arrays. In contrast to our results, they report consistent differential methylation patterns upstream of IL17RC in 3 twin pairs discordant for AMD. Though other groups have failed to replicate these results [53], differences between our reported results are due to differing definitions of differential methylation. While Wei et al. define differential methylation as a binary difference in overall methylation between discordant twins (i.e. 0% methylation in one twin versus >0% methylation in the other), we searched for changes in allele-specific methylation patterns which present themselves as changes from 100% methylation (biallelic methylation) to 50% methylation (ASM). As such, Wei et al. would not have detected our loci. Furthermore, as we screen out loci with overall low intensity levels, we do not examine the unmethylated loci described by Wei et al.
Given our focus on rare deletion events, it is perhaps not surprising that we found only 8 CNV differences between MZ twin pairs. Even though we were conservative in calling differential CNV events, we did find several instances of unique CNV events in one twin of a pair. However, due to the rarity of these events and the relatively large amount of the variance already explained for this disease, we found no statistically significant result showing any of these CNVs to be associated with AMD.
In contrast, our analysis of differential methylation revealed some methylation events associated with AMD. In particular, analyses of large genomic regions from 0.5–1Mb in size near complement factor genes uncovered evidence of differential allele-specific methylation in these regions between cases and controls. We also found smaller genomic regions with ASM differences between MZ twins discordant for AMD near genes involved in gliogenesis. Taken together, these results further implicate the role of inflammation and glial cell proliferation in AMD progression.
A recent study by Hunter et al. [54] used the Illumina 27K platform to assess methylation in a case-control study of postmortem retina pigment epithelium (RPE)/choroid samples. In contrast to the results we report here, they do not report methylation changes upstream of either CFH, or CFB. However, we report results for large regions (0.5–1MB) composed of multiple MPRs that only partially overlap CpGs assayed by both studies. As such, it is possible that the signal we observe is a) driven by CpGs not located within the targets assayed by Hunter et al. or b) our signal represents an aggregate of signals from multiple amplicons that would not be detected as individual results from CpGs. This latter hypothesis is supported by our failure to find significant differences in ASM in individual MPRs (Supplemental Table 3).
It is possible that the methylation changes we observe in whole blood directly influence AMD progression through this cell population’s known effects on both the inflammatory and complement responses. Given previous observations of the lack of tissue specificity of ASM [11] it is possible that these methylation differences reflect a broader methylation status within multiple tissues that only manifests itself in combination with other factors within a particular susceptible cell type, such as the retinal pigment epithelium. However, given the demonstrated existence of tissue specific allele-specific methylation [13,55], it is possible the observed methylation changes are the result of differences in the proportions of cell types in the samples examined. While this is unlikely to be the case in our case-control analyses, which involve multiple independent samples, the differences observed between discordant twins could be the result of different cell type proportions within the twin blood samples. Lacking prior data on tissue representation, we resorted to examining whether any of the loci we identify in twin-pairs with discordant ASM also show tissue specific methylation. None of the 100 tissue-specific loci identified by Houseman et al. [56] or of the 679 tissue specific loci identified by Glossup et al. [57] are located within the amplicons identified in our discordant twin study. Of course, this does not eliminate the possibility that these discordant MPRs are as yet unidentified tissue specific loci, and our results should be interpreted with this in mind.
One smaller region of interest that was identified by our analysis of the discordant MZ twins that of the MPR encompassing rs2017257, located downstream of kinesin family member 18B and ~4.2kb upstream of the glial fibrillary acidic protein gene (GFAP). GFAP is an intermediate filament protein expressed in numerous cell types of the central nervous system and is a primary marker of astrocytes [58]. Gliosis, an inflammatory injury response involving astrocyte proliferation and glial scarring during wound healing, is associated with increased levels of GFAP expression [59]. Importantly, both gliosis and increased GFAP levels have been reported to be associated with AMD [58,60–62]. In this context, the discordant ASM signal observed in the region encompassing rs11949422 is also interesting. This region overlaps some isoforms of the gene for cAMP-specific phosphodiesterase4D (PDE4D). Rolipram, an inhibitor of phosphodiesterase 4, can attenuate gliosis [63] indicating a potential role for this gene in gliosis and AMD progression. While the exact molecular effect of the differential methylation event near GFAP awaits further research, ENCODE data do identify the MPR’s genomic coordinates as a strong CTCF based insulator in multiple cell types [64]; DNA methylation of a CTCF binding site can disrupt its insulator function [65]. It is possible that the biallelic methylation observed at this MPR in discordant twin cases results in lower levels of CTCF binding as compared to that of the discordant twin controls, which have allele-specific methylation at this MPR. This in turn could result in decreased CTCF insulator function and increased expression of GFAP within the discordant case twins.
Findings of a genetic influence on differential methylation raise the possibility that ASM differences observed between cases and controls at larger genomic regions near known AMD variants originate from these variants themselves. In this respect, we note that the rs1410996 variant near CFH is noncoding [35], suggesting its potential regulatory role in AMD progression. The risk allele frequencies of rs1410996 in cases (73.1%) and controls (50%) vary significantly (p<0.05), raising the possibility of genetic influence on the observed ASM differences. In contrast, the risk allele frequencies for AMD variants near C2 and CFB (rs9332739 and rs641153) are essentially equal between cases (92.3% and 100% respectively) and controls (100% for both), suggesting a non-genetic origin for the observed ASM differences between cases and controls.
While the ASM differences observed in genetically identical MZ twins discordant for AMD do not appear to be the result of genetic differences, it is difficult to differentiate between the observed ASM changes as either a cause or effect of AMD (much like with expression changes). In this respect, as noted above, detection of these methylation differences in whole blood derived DNA could be interpreted as an organism-wide epigenetic response to an external stimulus that manifests within a susceptible cell type. Attempting to examine this possibility, we investigated whether the presence of external stimuli in our patients, such as smoking history and body mass index (BMI), known risk factors for AMD [2–4] correlated with twin discordance and case/control differences in outcome and ASM patterns/levels. We were, however, unable to detect any statistically significant patterns or changes in ASM that correlated with these external events within our small sample set (data not shown).
This study has the advantages of a well-characterized population examined according to the rigorous standard protocols with longitudinal follow-up to obtain the most recent AMD grade. Although we took a very conservative approach towards CNV detection and calling, we did see unique events in one MZ twin and it remains possible that these early somatic mutations could give rise to some of the phenotypic discordance observed in MZ twins. Although our results do not explain AMD discordance between MZ twins, we have developed a foundation on which the field can build its knowledge of somatic structural variation in genetically identical subjects. Further improvements upon previous studies include its genome-wide, unbiased examination of allele-specific methylation through the use of a platform specifically designed for differential allele detection. We observed many suggestive results, particularly those involving genomic regions proximal to loci involved in the inflammatory and wound healing responses. Although this is the largest twin study of AMD done for these types of analyses, the sample size was small and some associations may have been missed.
CONCLUSIONS
In summary, we examined the role of copy-number variation and differential methylation in AMD in a genetically constrained cohort of monozygotic twins and identified differentially methylated regions in discordant twins with biologically suggestive roles. Our results provide no evidence that epigenetics play a central role in AMD, but do provide some evidence of epigenetic influences beyond the known genetic susceptibility and further implicate inflammatory responses and gliosis in the etiology of AMD. Larger studies of carefully selected twins or singletons targeted to these genomic regions may be an effective way of further assessing the effect of epigenetic modifications at these locations in AMD. Given the malleable nature of epigenetic changes, these results may eventually provide therapeutic targets for this important and increasingly prevalent disease.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Yi Yu and Lillian Merriam for constructive criticism of this manuscript. This work was supported in part by Grants R01-EY11309 from the National Institutes of Health, Bethesda, MD; Massachusetts Lions Eye Research Fund, Inc.; Unrestricted grant from Research to Prevent Blindness, Inc., New York, NY; the American Macular Degeneration Foundation, Northampton, MA; donations to the Macular Degeneration Research Fund of the Ophthalmic Epidemiology and Genetics Service, New England Eye Center, Tufts Medical Center, Tufts University School of Medicine, Boston, MA. All authors read and approved the final manuscript.
LIST OF ABBREVIATIONS USED
- ASM
allele-specific methylation
- CNV
copy number variation
- AMD
age-related macular degeneration
- MZ
monozygotic
- MSRE
methyl sensitive restriction enzyme
- MPRs
MSRE Positive Regions, Affymetrix 6.0 SNP array amplicon target regions with at least one MSRE site
- MNRs
MSRE Negative Regions, Affymetrix 6.0 SNP array amplicon target regions without MSRE sites
- LBL
lymphoblast line
- SNP
single nucleotide polymorphism
Footnotes
COMPETING INTERESTS
The author(s) declare that they have no competing interests.
REFERENCES
- 1.Seddon JM, Cote J, Page WF, Aggen SH, Neale MC: The US Twin Study of Age-Related Macular Degeneration: Relative Roles of Genetic and Environmental Influences. Arch Ophthalmol 2005. March 1;123:321–327. [DOI] [PubMed] [Google Scholar]
- 2.Seddon JM, George S, Rosner B: Cigarette smoking, fish consumption, omega-3 fatty acid intake, and associations with age-related macular degeneration: the US Twin Study of Age-Related Macular Degeneration. Arch Ophthalmol 2006. July;124:995–1001. [DOI] [PubMed] [Google Scholar]
- 3.Seddon JM, Reynolds R, Shah HR, Rosner B: Smoking, dietary betaine, methionine, and vitamin D in monozygotic twins with discordant macular degeneration: epigenetic implications. Ophthalmology 2011. July;118:1386–1394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Seddon JM, Reynolds R, Yu Y, Daly MJ, Rosner B: Risk models for progression to advanced age-related macular degeneration using demographic, environmental, genetic, and ocular factors. Ophthalmology 2011. November;118:2203–2211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bruder CEG, Piotrowski A, Gijsbers AACJ, Andersson R, Erickson S, Diaz de Ståhl T, et al. : Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles. Am J Hum Genet 2008. March;82:763–771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Dolinoy DC, Weidman JR, Jirtle RL: Epigenetic gene regulation: linking early developmental environment to adult disease. Reprod Toxicol 2007. April;23:297–307. [DOI] [PubMed] [Google Scholar]
- 7.Jaenisch R, Bird A: Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 2003. March;33 Suppl:245–254. [DOI] [PubMed] [Google Scholar]
- 8.Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, et al. : Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA 2005. July 26;102:10604–10609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Boks MP, Derks EM, Weisenberger DJ, Strengman E, Janson E, Sommer IE, et al. : The relationship of DNA methylation with age, gender and genotype in twins and healthy controls. PLoS ONE 2009;4:e6767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hellman A, Chess A: Gene body-specific methylation on the active X chromosome. Science 2007. February 23;315:1141–1143. [DOI] [PubMed] [Google Scholar]
- 11.Hellman A, Chess A: Extensive sequence-influenced DNA methylation polymorphism in the human genome. Epigenetics & Chromatin 2010;3:11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kerkel K, Spadola A, Yuan E, Kosek J, Jiang L, Hod E, et al. : Genomic surveys by methylation-sensitive SNP analysis identify sequence-dependent allele-specific DNA methylation. Nat Genet 2008. July;40:904–908. [DOI] [PubMed] [Google Scholar]
- 13.Schalkwyk LC, Meaburn EL, Smith R, Dempster EL, Jeffries AR, Davies MN, et al. : Allelic skewing of DNA methylation is widespread across the genome. Am J Hum Genet 2010. February 12;86:196–212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gertz J, Varley KE, Reddy TE, Bowling KM, Pauli F, Parker SL, et al. : Analysis of DNA methylation in a three-generation family reveals widespread genetic influence on epigenetic regulation. PLoS Genet 2011. August;7:e1002228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang Y, Rohde C, Tierling S, Jurkowski TP, Bock C, Santacruz D, et al. : DNA methylation analysis of chromosome 21 gene promoters at single base pair and single allele resolution. PLoS Genet 2009. March;5:e1000438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jablon S, Neel JV, Gershowitz H, Atkinson GF: The NAS-NRC twin panel: methods of construction of the panel, zygosity diagnosis, and proposed use. Am J Hum Genet 1967. March;19:133–161. [PMC free article] [PubMed] [Google Scholar]
- 17.Seddon JM, Sharma S, Adelman RA: Evaluation of the clinical age-related maculopathy staging system. Ophthalmology 2006. February;113:260–266. [DOI] [PubMed] [Google Scholar]
- 18.Klein R, Davis MD, Magli YL, Segal P, Klein BE, Hubbard L: The Wisconsin age-related maculopathy grading system. Ophthalmology 1991. July;98:1128–1134. [DOI] [PubMed] [Google Scholar]
- 19.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. : PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007. September;81:559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Korn JM, Kuruvilla FG, McCarroll SA, Wysoker A, Nemesh J, Cawley S, et al. : Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet 2008. October;40:1253–1260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Neale BM, Fagerness J, Reynolds R, Sobrin L, Parker M, Raychaudhuri S, et al. : Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC). Proc Natl Acad Sci USA 2010. April 20;107:7395–7400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, et al. : Detection of large-scale variation in the human genome. Nat Genet 2004. September;36:949–951. [DOI] [PubMed] [Google Scholar]
- 23.International Schizophrenia Consortium: Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 2008. September 11;455:237–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Åberg K, Khachane AN, Rudolf G, Nerella S, Fugman DA, Tischfield JA, et al. : Methylome-wide comparison of human genomic DNA extracted from whole blood and from EBV-transformed lymphocyte cell lines. Eur J Hum Genet 2012. September;20:953–955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Database of Single Nucleotide Polymorphisms (dbSNP). Bethesda (MD): National Center for Biotechnology Information, National Library of Medicine; (dbSNP Build ID:129). [Google Scholar]
- 26.Li L-C, Dahiya R: MethPrimer: designing primers for methylation PCRs. Bioinformatics 2002. November;18:1427–1431. [DOI] [PubMed] [Google Scholar]
- 27.Arányi T, Váradi A, Simon I, Tusnády GE: The BiSearch web server. BMC Bioinformatics 2006;7:431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Krueger F, Andrews SR: Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 2011. June 1;27:1571–1572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. : The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009. August 15;25:2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rachmilewitz J, Goshen R, Ariel I, Schneider T, de Groot N, Hochberg A: Parental imprinting of the human H19 gene. FEBS Lett 1992. August 31;309:25–28. [DOI] [PubMed] [Google Scholar]
- 31.Williams CA, Zori RT, Stone JW, Gray BA, Cantu ES, Ostrer H: Maternal origin of 15q11–13 deletions in Angelman syndrome suggests a role for genomic imprinting. Am J Med Genet 1990. March;35:350–353. [DOI] [PubMed] [Google Scholar]
- 32.Narayanan R, Butani V, Boyer DS, Atilano SR, Resende GP, Kim DS, et al. : Complement factor H polymorphism in age-related macular degeneration. Ophthalmology 2007. July;114:1327–1331. [DOI] [PubMed] [Google Scholar]
- 33.Edwards AO, Ritter R, Abel KJ, Manning A, Panhuysen C, Farrer LA: Complement factor H polymorphism and age-related macular degeneration. Science 2005. April 15;308:421–424. [DOI] [PubMed] [Google Scholar]
- 34.Haines JL, Hauser MA, Schmidt S, Scott WK, Olson LM, Gallins P, et al. : Complement factor H variant increases the risk of age-related macular degeneration. Science 2005. April 15;308:419–421. [DOI] [PubMed] [Google Scholar]
- 35.Maller J, George S, Purcell S, Fagerness J, Altshuler D, Daly MJ, et al. : Common variation in three genes, including a noncoding variant in CFH, strongly influences risk of age-related macular degeneration. Nat Genet 2006. September;38:1055–1059. [DOI] [PubMed] [Google Scholar]
- 36.Fagerness JA, Maller JB, Neale BM, Reynolds RC, Daly MJ, Seddon JM: Variation near complement factor I is associated with risk of advanced AMD. Eur J Hum Genet 2009. January;17:100–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gold B, Merriam JE, Zernant J, Hancox LS, Taiber AJ, Gehrs K, et al. : Variation in factor B (BF) and complement component 2 (C2) genes is associated with age-related macular degeneration. Nat Genet 2006. April;38:458–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yu Y, Bhangale TR, Fagerness J, Ripke S, Thorleifsson G, Tan PL, et al. : Common variants near FRK/COL10A1 and VEGFA are associated with advanced age-related macular degeneration. Hum Mol Genet 2011. September 15;20:3699–3709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rivera A, Fisher SA, Fritsche LG, Keilhauer CN, Lichtner P, Meitinger T, et al. : Hypothetical LOC387715 is a second major susceptibility gene for age-related macular degeneration, contributing independently of complement factor H to disease risk. Hum Mol Genet 2005. November 1;14:3227–3236. [DOI] [PubMed] [Google Scholar]
- 40.Dewan A, Liu M, Hartman S, Zhang SS-M, Liu DTL, Zhao C, et al. : HTRA1 promoter polymorphism in wet age-related macular degeneration. Science 2006. November 10;314:989–992. [DOI] [PubMed] [Google Scholar]
- 41.Maller JB, Fagerness JA, Reynolds RC, Neale BM, Daly MJ, Seddon JM: Variation in complement factor 3 is associated with risk of age-related macular degeneration. Nat Genet 2007. October;39:1200–1201. [DOI] [PubMed] [Google Scholar]
- 42.Yates JRW, Sepp T, Matharu BK, Khan JC, Thurlby DA, Shahid H, et al. : Complement C3 variant and the risk of age-related macular degeneration. N Engl J Med 2007. August 9;357:553–561. [DOI] [PubMed] [Google Scholar]
- 43.Chen W, Stambolian D, Edwards AO, Branham KE, Othman M, Jakobsdottir J, et al. : Genetic variants near TIMP3 and high-density lipoprotein-associated loci influence susceptibility to age-related macular degeneration. Proc Natl Acad Sci USA 2010. April 20;107:7401–7406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Reimand J, Kull M, Peterson H, Hansen J, Vilo J: g:Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res 2007. July;35:W193–200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Javierre BM, Fernandez AF, Richter J, Al-Shahrour F, Martin-Subero JI, Rodriguez-Ubreva J, et al. : Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. Genome Res 2010. February;20:170–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dempster EL, Pidsley R, Schalkwyk LC, Owens S, Georgiades A, Kane F, et al. : Disease-associated epigenetic changes in monozygotic twins discordant for schizophrenia and bipolar disorder. Hum Mol Genet 2011. December 15;20:4786–4796. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang X, Zhu H, Snieder H, Su S, Munn D, Harshfield G, et al. : Obesity related methylation changes in DNA of peripheral blood leukocytes. BMC Med 2010;8:87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Baranzini SE, Mudge J, van Velkinburgh JC, Khankhanian P, Khrebtukova I, Miller NA, et al. : Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis. Nature 2010. April 29;464:1351–1356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Stepanow S, Reichwald K, Huse K, Gausmann U, Nebel A, Rosenstiel P, et al. : Allele-specific, age-dependent and BMI-associated DNA methylation of human MCHR1. PLoS ONE 2011;6:e17711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Kang MY, Lee BB, Ji YI, Jung EH, Chun H-K, Song SY, et al. : Association of interindividual differences in p14ARF promoter methylation with single nucleotide polymorphism in primary colorectal cancer. Cancer 2008. April 15;112:1699–1707. [DOI] [PubMed] [Google Scholar]
- 51.Boumber YA, Kondo Y, Chen X, Shen L, Guo Y, Tellez C, et al. : An Sp1/Sp3 binding polymorphism confers methylation protection. PLoS Genet 2008;4:e1000162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wei L, Liu B, Tuo J, Shen D, Chen P, Li Z, et al. : Hypomethylation of the IL17RC promoter associates with age-related macular degeneration. CellReports 2012. November 29;2:1151–1158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Oliver VF, Franchina M, Jaffe AE, Branham KE, Othman M, Heckenlively JR, et al. : Hypomethylation of the IL17RC Promoterin Peripheral Blood Leukocytes Is NotA Hallmark of Age-Related Macular Degeneration. CellReports 2013. December 26;5:1527–1535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hunter A, Spechler PA, Cwanger A, Song Y, Zhang Z, Ying G-S, et al. : DNA methylation is associated with altered gene expression in AMD. IOVS 2012. April;53:2089–2105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Shoemaker R, Deng J, Wang W, Zhang K: Allele-specific methylation is prevalent and is contributed by CpG-SNPs in the human genome. Genome Res 2010. July;20:883–889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Houseman E, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. : DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics 2012;13:86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Glossop JR, Nixon NB, Emes RD, Haworth KE, Packham JC, Dawes PT, et al. : Epigenome-wide profiling identifies significant differences in DNA methylation between matched-pairs of T- and B-lymphocytes from healthy individuals. Epigenetics 2013. November 1;8:1188–1197. [DOI] [PubMed] [Google Scholar]
- 58.Ramirez JM, Triviño A, Ramirez AI, Salazar JJ, Garcia-Sanchez J: Immunohistochemical study of human retinal astroglia. Vision Res 1994. August;34:1935–1946. [DOI] [PubMed] [Google Scholar]
- 59.Pekny M, Nilsson M: Astrocyte activation and reactive gliosis. Glia 2005. June;50:427–434. [DOI] [PubMed] [Google Scholar]
- 60.Madigan MC, Penfold PL, Provis JM, Balind TK, Billson FA: Intermediate filament expression in human retinal macroglia. Histopathologic changes associated with age-related macular degeneration. Retina (Philadelphia, Pa) 1994;14:65–74. [PubMed] [Google Scholar]
- 61.Ramirez JM, Ramirez AI, Salazar JJ, de Hoz R, Triviño A: Changes of astrocytes in retinal ageing and age-related macular degeneration. Exp Eye Res 2001. November;73:601–615. [DOI] [PubMed] [Google Scholar]
- 62.Wu KHC, Madigan MC, Billson FA, Penfold PL: Differential expression of GFAP in early v late AMD: a quantitative analysis. Br J Ophthalmol 2003. September;87:1159–1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Nikulina E, Tidwell JL, Dai HN, Bregman BS, Filbin MT: The phosphodiesterase inhibitor rolipram delivered after a spinal cord lesion promotes axonal regeneration and functional recovery. Proc Natl Acad Sci USA 2004. June 8;101:8786–8790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.ENCODE Project Consortium: A user’s guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol 2011. April;9:e1001046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bell AC, Felsenfeld G: Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature 2000. May 25;405:482–485. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.