Abstract
SCNN1B encodes the beta subunit of the epithelial sodium channel ENaC. Previously, we reported an association between SNP markers of SCNN1B gene and disease severity in cystic fibrosis-affected sibling pairs. We hypothesized that factors interacting with the SCNN1B genomic sequence are responsible for intrapair discordance. Concordant and discordant pairs differed at six SCNN1B markers (Praw = 0.0075, Pcorr = 0.0397 corrected for multiple testing). To identify the factors binding to these six SCNN1B SNPs, we performed an electrophoretic mobility shift assay and captured the DNA–protein complexes. Based on protein mass spectrometry data, the epithelial splicing regulatory protein ESRP2 was identified when using SCNN1B-derived probes and the ESRP2-SCNN1B interaction was independently confirmed by coimmunoprecipitation assays. We observed an alternative SCNN1B transcript and demonstrated in 16HBE14o− cells that levels of this transcript are decreased upon ESRP2 silencing by siRNA. Furthermore, we confirmed that mildly and severely affected siblings have different ESPR2 genetic backgrounds and that ESRP2 markers are linked to the response of CF patients’ nasal epithelium to amiloride, indicating ENaC involvement (Pbest = 0.0131, Pcorr = 0.068 for multiple testing). Our findings demonstrate that sibling pairs clinically discordant for CF can be used to identify meaningful DNA regulatory elements and interacting factors.
Subject terms: Gene regulation, Genetic association study
Introduction
Genetic variation in humans contributes significantly to phenotypic variation. The question of which single nucleotide polymorphism (SNP) determines disease outcome and/or severity, has been addressed in more than 2000 genome-wide association studies (GWAS)1. However, the results have revealed that more than 90% of the polymorphisms identified by GWAS do not directly alter the gene’s coding sequence. This has led to the conclusion that clinically relevant variation of the human genome mediates gene regulation, i.e., the transcript expression level or the composition of transcript isoforms2. Recent genome and epigenome studies have substantiated this hypothesis3. Enrichment of elements known for covalent modifications of DNA bases or their associated nucleosomes4 among disease- and trait-associated genetic variants determined by GWAS has also been noted.
The gene affected in cystic fibrosis (CF), cystic fibrosis transmembrane conductance regulator (CFTR), encodes a chloride- and bicarbonate channel of epithelia5–7 that localizes with the epithelial sodium channel ENaC at the apical membrane of many8, albeit not all9, epithelial cells. Both CFTR and ENaC act synergistically to regulate salt and fluid transport across the epithelium8,10, and the SCNN1B gene, encoding the beta subunit of ENaC, is a highly plausible modifier gene of CF.
Previously, we focused on the three genes encoding the subunits of ENaC as candidate genes in the European CF twin and sibling study11. In an association study on extreme phenotypes, anthropometry and lung function data were used to select patients whose clinical data fell below the 25th centile (severely affected) or above the 75th centile (mildly affected) for both clinical parameters12. Three groups of affected patient pairs were defined as follows by a ranking algorithm used to describe the severity of CF: concordant mildly affected sibling pairs, concordant severely affected sibling pairs and discordant sibling pairs. Discordant sibling pairs were composed of one mildly and one severely affected sibling12.
When discordant sibling pairs were compared to concordant sibling pairs, one SCNN1B haplotype defined by SNPs rs238547–rs152730–rs250563 occurred more frequently among discordant than among concordant siblings11. We concluded that this signal cannot be fully explained by a variant observed within SCNN1B because discordant siblings have a dissimilar phenotype by definition12, and yet these siblings share an SCNN1B intragenic haplotype11. Our working hypothesis relied on the idea that the association signal in SCNN1B delineates a functional regulatory element, whereby a DNA-binding protein encoded in trans interacts with this regulatory element, stably binding to the haplotype of the regulatory element that is predominant among discordant siblings. Hence, the genetic variation of the interaction partner can determine the phenotype causing intrapair discordance in affected sibling pairs.
In association studies involving affected patient pairs, interaction between a regulatory element and a DNA-binding protein may result in a paradoxical situation. The regulatory element is recognized through an INTERpair comparison by an association with the phenotype “discordance of sibs”. However, genetic information at the regulatory element is shared by both siblings within a pair, which provides an opportunity to identify the DNA-binding protein encoded in trans by an INTRApair comparison. If the phenotype is caused by interaction of the DNA-binding protein with the regulatory element, mildly and severely affected siblings of discordant pairs must carry different genetic information at the locus encoding the DNA-binding protein.
Results
Six SNPs within SCNN1B differ between concordant and discordant CF patient pairs
We previously reported that intrapair discordance for CF disease severity is associated with three intragenic markers spanning SCNN1B from codon 3 to codon 293. To describe the genomic fragment for which concordant and discordant pairs carry different genetic information, we analyzed 7 previously typed markers11 and 49 SNPs genotyped for fine-mapping in the 16p12 region, encompassing the entire SCNN1G/SCNN1B-locus (Fig. 1). Next, we employed a haplotype-based fine-mapping strategy previously used to identify causative variants within this cohort13. To determine whether concordant and discordant pairs carried the same or different genetic information, we employed the software package FAMHAP to construct two-marker haplotypes composed of two informative markers. By using this approach, we found a significant difference in two-marker-haplotype distributions for two adjacent genomic fragments defined by markers rs152730–rs152745 and rs152745–rs152740 (Praw = 0.0075 and Praw = 0.00869, respectively; corrected for multiple testing of all informative markers at the SCNN1G/SCNN1B-locus Pcorr = 0.0397, Fig. 1). We concluded that the variant(s) that determine intrapair discordance are located on the genomic fragment between rs152730 and rs152740. Based on the allele frequency distribution among concordant and discordant pairs, we selected representatives for the contrasting haplotypes for Sanger resequencing of the mapped genomic fragment (Table 1). We chose three homozygotes for the haplotype TTAGA, two homozygotes for the haplotype GGAAT and one homozygote for the haplotype GTCAT for sequencing of the rs152730–rs152740 genomic fragment on contrasting alleles at markers rs152730–rs8044970–rs63982–rs152745–rs152740. We used long-range PCR to amplify an 8269 bp and an 8856 bp product encompassing the sequence of interest (Table 2). Sanger sequencing was performed using internal primers positioned every 500 bp on the forward and reverse strands. Using the software CodonCode Aligner, 476 primary reads with a median length of 737 bp were aligned to the reference sequence, assuring coverage of at least 4 reads per haplotype at each genomic position. Based on this alignment, we identified 6 SNP positions for carriers of the contrasting haplotypes for which concordant and discordant pairs had different genetic information. At the six SNPs rs152730–rs152731–rs152745–rs152744–rs152741–rs152740, alleles associated with intrapair concordance carried the haplotype GCAGTT; in contrast, alleles associated with intrapair discordance carried the haplotype TTGACA. None of these six SNPs reside within the coding sequence of SCNN1B. However, according to in silico analyses, they possibly alter the secondary structure of the pre-mRNA (SupplTab. 1, SupplTab. 2, SupplFig. 1).
Table 1.
Freq. among 14 discordant pairs | Freq. among 23 concordant pairs | ||
---|---|---|---|
rs152730–rs8044970–rs63982–rs152745–rs152740 haplotype | |||
TTAGA | 0.839 | 0.599 | Pcorr = 0.0397 |
GTCAT | 0.051 | 0.185 | |
GGAAT | 0.036 | 0.129 | |
Other pooleda | 0.074 | 0.087 |
rs152730–rs8044970–rs63982–rs152745–rs152740 haplotype | |||
---|---|---|---|
TTAGA/TTAGAb | 0.714 | 0.400 | Pcorr = 0.049920 |
TTAGA/GTCAT | 0.071 | 0.201 | |
TTAGA/GGAAT | 0.071 | 0.127 | |
GGAAT/GGAATb | 0.000 | 0.020 | |
GTCAT/GTCATb | 0.000 | 0.059 | |
Other pooled | 0.144 | 0.193 |
aFour rare haplotypes (Freq. < 0.05) were observed among discordant pairs, 8 rare haplotypes (Freq. < 0.05) were observed among concordant pairs.
bTo identify all genetic variants associated with intrapair discordance on the genomic fragment rs12730–rs152740, the entire 8000 bp genomic fragment was compared by Sanger re-sequencing for three homozygotes for TTAGA, two homozygotes for GGAAT and one homozygote for GTCAT (see Table 2).
Table 2.
Haplotype at rs152730–rs8044970–rs63982–rs152745–rs152740 | Associated with intrapair disease manifestation | rs152730a | rs62029384 | rs8044970a | rs8044984 | rs80443907 | rs152731 | rs152732 | rs180878 | rs152733 |
---|---|---|---|---|---|---|---|---|---|---|
TTAGA | Discordant | T | C | T | T | C | T | C | T | T |
GGAAT | Concordant | G | T | G | G | T | C | C | T | T |
GTCAT | Concordant | G | C | T | T | C | C | T | G | C |
Different between TTAGA and GGAAT as well as GTCAT? | Yes | No | No | No | No | Yes | No | No | No |
Haplotype at rs152730–rs8044970–rs63982–rs152745–rs152740 | Associated with intrapair disease manifestation | rs63982a | rs152745a | rs8062922 | rs152744 | rs62029385 | rs152743 | rs152741 | rs57406669 | rs152740a |
---|---|---|---|---|---|---|---|---|---|---|
TTAGA | Discordant | A | G | C | A | C | G | C | C | A |
GGAAT | Concordant | A | A | T | G | T | G | T | G | T |
GTCAT | Concordant | C | A | C | G | C | A | T | C | T |
Different between TTAGA and GGAAT as well as GTCAT? | No | Yes | No | Yes | No | No | Yes | No | Yes |
ars152730–rs8044970–rs63982–rs152745–rs152740 were used to map the fragment associated with intrapair discordance and define the contrasting haplotypes TTAGA (associated with intrapair discordance) and GGAAT as well as GTCAT (both associated with intrapair concordance); see Table 1.
An uncommon alternative SCNN1B transcript generated by intron retention in epithelial cell lines
Because none of the six identified SNPs affect the amino acid sequence of the SCNN1B protein, we next aimed to determine whether SCNN1B undergoes alternative splicing. Alternative transcripts were inferred from mapped expressed sequence tags (ESTs, SupplFig. 2). As a source for polyA + RNA, we used T84 cells, which are derived from colon carcinoma, 16HBE14o-cells, which are virus-transformed non-CF respiratory epithelial cells, and CFBE41o- and CFTE29o-cells, both of which are immortalized respiratory epithelial cells derived from F508del-CFTR homozygous CF patients. Primers for combinatorial reverse-transcription PCR were designed to reflect ESTs reported for SCNN1B in the area of interest defined by SNPs rs152730–rs152740 (SupplFig. 2A). By using primers located in exons 3 and 4 or exons 3 and 5, we detected wild-type SCNN1B in all four epithelial cell lines (Fig. 1C). Additionally, we amplified a 280 bp product using one primer located within exon 3 and one primer located 100 bp 3′ of the splice site at the end of exon 3 (Fig. 1C and SupplFig. 2). We specifically investigated this intronic sequence because it has been reported to be retained in EST clones BM694355 and BU730506, which are generated from a cDNA library prepared from retinal pigment epithelium of a healthy adult male. Primers designed to detect ESTs AW844136, CV337204 and BX485038 did not amplify a product (SupplFig. 2). The 280 bp product, derived from the alternative SCNN1B transcript generated by exon read-through, was reliably amplified from T84 derived cDNA even when the RNA was pretreated with DNAse (SupplFig 2). Moreover, Sanger sequencing of the alternative product confirmed its identity at the exon 3/intron 3 border. Hence, the alternative mRNA was generated by an exon read-through event and matched the genomic sequence by the base (SupplFig 2E). Furthermore, primers placed upstream in exon 1 encoding the 5′ UTR of SCNN1B in combination with a primer placed on the retained intron sequence yielded a product from cDNA (Fig. 1C). Conversely, no signal was observed when using a primer located in the downstream exon 5 in combination with the retained intron sequence (data not shown). To summarize, cancer-derived intestinal epithelial cells as well as virus-immortalized respiratory epithelial cells expressed an alternative SCNN1B transcript in which intron 3 was partially retained. If translated, this alternative transcript would preserve the reading frame at the end of exon 3 and would be translated into a protein that terminates prematurely after an 26 additional amino acids derived from the retained intron sequence, producing a severely truncated SCNN1B protein of 221 amino acids.
The SCNN1B haplotype found in discordant pairs is enriched for predicted transcription factor binding sites
We next assumed that an allelic association with a discordant manifestation of CF severity is mediated by factors that recognize the allele enriched among discordant pairs. To test our hypothesis, we assessed whether the 6-marker-haplotype that is associated with intrapair discordance for CF severity, i.e., whether TTGACA at the six SNPs rs152730–rs152731–rs152745–rs152744–rs152741–rs152740 attracts different DNA-binding proteins compared to the GCAGTT allele that is observed among concordant sibling pairs. To identify potential transcription factor binding sites, we used the tool “Match” (available at http://www.gene-regulation.com/)14, which is based on a library of mononucleotide weight matrices from TRANSFAC6.0. The settings were restricted to vertebrate transcription factors and limited to minimize false negatives (estimated error rate of 10% for training data set). As an input sequence, we used both alleles at each of the six divergent SNPs and + /− 20 bp flanking sequences. Next, we compared the list of putative transcription factor binding sites between the input sequences derived from haplotypes associated with concordance and discordance and noted those predicted to interact with only one of the two contrasting alleles at each SNP. Surprisingly, only 6 binding sites were predicted for the haplotype observed among concordant sibling pairs; 21 predicted interactions were exclusively related to the six-marker-haplotype associated with intrapair discordance (Fig. 2, SupplTab 3a). Different from concordant sibling pairs, the haplotype observed among discordant sibling pairs had significantly more opportunities to interact with DNA-binding proteins (p = 0.048; in comparison to the expectancy value derived from 26 binding sites distributed equally between both haplotypes).
Next, we evaluated whether predicted interacting proteins of the SCNN1B haplotype TTGACA (SupplTab 3a) are associated with the response to amiloride upon superfusion of the nasal epithelium. The function of ENaC in vivo can be assessed based on the potential difference between the nasal epithelium and the subcutaneous space15. According to the nasal potential difference with the use of amiloride (indicative of ENaC-mediated sodium transport), except for GATA2Sat (Praw = 0.0456), none of the genes encoding predicted interaction partners showed an association with ENaC function (SupplTab 3b).
Interaction partners of double-stranded DNA sequences can be captured with an electrophoretic mobility shift assay following protein sequencing (EMSA-PSeq)
As our in silico analysis did not extend to DNA-binding proteins with unknown binding motifs, we aimed to identify interacting proteins by performing a modified electrophoretic mobility shift assay followed by protein sequencing of the DNA–protein complex (EMSA-PSeq; see supplement for experimental details). Briefly, we used nuclear extracts derived from epithelial cells and biotinylated 35-mer dsDNA probes that centrally carry one of the contrasting alleles of the SNPs as bait. To separate the unbound probe from the probe-protein-complexes, we performed native polyacrylamide gel electrophoresis, and the probe-protein-complexes were visualized after transfer of the separated samples to a membrane. The gel fragment corresponding to the signal generated by the probe-protein-complex was excised, and proteins within the excised gel fragment were identified by protein mass spectrometry. We used the NFkappaB-P65-consensus sequence16 for optimization of the experimental setup. Protein mass spectrometry and MASCOT analysis identified several hundred proteins per high-molecular weight complex. To enable the recognition of proteins that incidentally comigrate together with the probe/protein complex and/or that bind to any DNA unspecifically, and/or are introduced to the sample as contaminants during handling of the gel fragment, a set of 25 EMSA-PSeq experiments were performed in parallel for noise filtering (SupplFig 3). In the EMSA-PSeq sample obtained with the NFkappaB-P65-consensus probe (Fig. 3), we detected P65 (score 78, tagged by three specific peptides) as a unique signal within a total of 25 evaluated EMSA-PSeq data sets. Additionally, the EMSA-PSeq sample baited with the NFkappaB-P65-consensus probe uniquely attracted STAT3 and STAT6, both of which were not observed in any other of the 25 EMSA-PSeq datasets.
Nucleic acid binding proteins attracted by probes of SNPs rs152730, rs152731 and rs152744 and identified as unique by EMSA-PSeq
To detect interaction partners of SCNN1B SNPs associated with intrapair discordance, we used certain experimental conditions to detect NFkappaB-P65 using a probe with a p65 consensus sequence as bait (Fig. 3). Although we varied the conditions for electrophoresis, electrotransfer and detection (see supplement for details), we were not able to obtain a reproducible high-molecular-weight complex for either allele of rs152745, rs152741 and rs152740. However, high-molecular-weight DNA–protein-complexes were observed with probes representing the two contrasting alleles at SNPs rs152730, rs152731 and rs152744. As described in detail within the Supplemental material, we have filtered the raw data set of 25 EMSA-PSeq experiments for low-expressed proteins annotated to have nucleic acid binding capabilities and attracted to only one SNP (SupplFig 3).
Even when considering the inaccuracy of complex sizes after separation on the native polyacrylamide gel, high-molecular-weight complexes were incompatible with the interaction of a single protein found by EMSA-PSeq (SupplTab 4). Thus, several independent comigrating protein–protein and protein–protein-nucleic acid complexes were likely subjected to protein sequencing. Among the proteins identified as specific for the P65 consensus probe, P65 was one of 11 proteins (Fig. 4, SupplTab 4). Sixteen, five and eight proteins were identified uniquely for SNPs rs152730, rs152731 and rs152744, respectively, by EMSA-PSeq and data mining (Fig. 4, SupplTab 4). Since P65 was found with the same experimental and data evaluation strategy used for the data sets for the SCNN1B SNPs, we assume that our true-positive protein of interest has been captured as well.
The ESRP2 genetic background is associated with the manifestation of the amiloride-sensitive sodium current in the nasal epithelium
To prove or disprove that a protein identified by EMSA-PSeq can influence ENaC function, we performed a candidate-gene-based association study among CF patients in which one of the studied phenotypes addresses ENaC function in the nasal epithelium in vivo17. To select a plausible genetic locus based on the list of 29 candidate proteins (Fig. 4, SupplTab 4), we excluded components of the spliceosome multiprotein complex. Among the remaining EMSA-PSeq-derived proteins, the epithelial-specific splicing regulatory protein 2 ESRP218,19 was the most reasonable candidate. First, CF is an epithelial disease and ESRP2 is consistent with this feature20,21. Second, an alternative SCNN1B transcript was observed (Fig. 1C, SupplFig 2) and ESRP2 is plausible based on its role in transcript processing.
We found ESRP2 exclusively on probes representing rs152731 whereas ESRP1 was detected on probes derived from the three EMSA-PSeq SNPs (Fig. 5). In the association study, the ESRP1 marker ESRP1-Sat1 showed no association with the phenotype or severity of CF (Praw > 0.2). In contrast, ESRP2-Sat1 exhibited an association signal (Pbest = 0.04) that was confirmed with a second microsatellite and 5 SNPs (Pbest = 0.0131, Pcorr = 0.068 for multiple testing of 7 markers; Fig. 5) for the manifestation of amiloride-sensitive sodium conductance, a hallmark of ENaC function.
The two rs152731 alleles are nonequivalent concerning ESRP2 binding
ESRP2 was detected as a factor binding to the C to T SNP rs152731 by EMSA-PSeq (Fig. 4) by using polyacrylamide gels with a separating distance of 4 cm. To detect ESRP2-rs152731 binding complexes, we employed EMSAs using polyacrylamide gels with a separating distance of 20 cm (Fig. 6A,B). To exclude nonspecific binding, signals obtained with an antibody directed against ESRP2 were compared to protein-DNA-complexes probed with IgG (isotype control). Signal intensities for rs152731-C were comparable in IgG and anti-ESRP2-Ab lanes. In contrast, signals obtained for rs152731-T were stronger with anti-ESRP2-Ab than with IgG. Normalized signal intensities for rs152731-T were higher than those for rs152731-C (p = 0.013).
Next, we addressed whether ESRP2 recognizes rs152731 directly. We used a coimmunoprecipitation (co-IP) experiment using a biotinylated rs152731 EMSA probe as bait with a raw nuclear extract. After precipitation with an anti-biotin-Ab, detection of ESRP2 by Western blotting in co-IP samples verified that all components sufficient for ESRP2 to bind rs152731 are present within the nuclear extract and/or the components used for the EMSA preceding co-IP (Fig. 6C,D). In three independent co-IP experiments comparing the two contrasting rs152731 alleles, ESRP2 signals derived from probes with rs152731-T yielded stronger signals than probes for rs152731-C. In summary, the results from two different techniques confirmed that rs152731-T binds ESRP2 better than does rs152731-C. Thus, the allele rs152731-T associated with intrapair discordance (Tables 1, 2) attracts ESRP2 to a greater extent and is more vulnerable to its genetic variations (Fig. 5).
ESRP2 knockdown alters global SCNN1B expression in 16HBE14o-cells
To determine whether ESRP2 influences the transcript species generated from SCNN1B, we downregulated ESRP2 by siRNA in T84 and 16HBE14o-cells and quantified the wild-type and the alternative transcripts by qPCR (Fig. 7). Silencing of ESRP2 in T84 cells resulted in highly variable changes in SCNN1B transcripts. Moreover, the observed changes were comparable to those using scrambled control siRNA. In contrast, no systematic effect of scrambled control siRNA was detected in 16HBE 14o-cells (p = 0.50 for wild-type, p = 0.32 for alternative SCNN1B transcript; Wilcoxon signed-rank test). Moreover, the amounts of wild-type and alternative SCNN1B were increased in 16HBE14o-cells upon treatment with siRNA directed against ESRP2 (p = 0.015 for wild-type; p = 0.054 for alternative SCNN1B transcript). Comparison of changes in wild-type and alternative SCNN1B transcript levels assessed by paired ΔΔCt levels for both amplicons indicated that downregulation of ESRP2 induced expression of functional wild-type SCNN1B in 16HBE14o-cells (p = 0.081, Wilcoxon signed rank test).
The upper respiratory tract is the origin of 16HBE14o-cells while T84 cells are derived from intestinal cells; 16HBE14o-cells are virus-immortalized and T84 are colon cancer cells. Because ESRP2 plays a prominent role in cancer18,19, it is not surprising that these two cell lines behaved differently in the ESRP2 gene silencing assay. Furthermore, the baseline expression of both SCNN1B transcripts was lower in 16HBE14o than in T84 cells. Interestingly, T84 cells carry only one, but 16HBE14o-cells carry two of the ESRP2-receptive alleles T–G–A–C at markers rs152731–rs152745–rs152744–rs152741. This may explain the observed differences between 16HBE14o and T84 cells in response to ESRP2 silencing. Homozygosity for T–G–A–C is associated with intrapair discordance among CF twins and siblings.
Discussion
The design of the European CF twin and sibling study was inspired by Risch and Zhang22 who proposed that the use of sibling pairs with extremely concordant or discordant phenotypes will advance the discovery of quantitative trait loci in humans22–24. Due to the high power of this approach, it was estimated that the genotyping load for studies undertaken with an extreme sib-pair design, selecting for patient pairs who exhibit phenotypes below the 30th or above the 70th centile, can be reduced by up to 40-fold22. Based on this strategy for patient recruitment, 37 F508del-CFTR homozygous sibling pairs of 318 cystic fibrosis affected patient pairs were selected for the association study by a ranking algorithm12. The selected sibling pairs were comparable in terms of their birth cohort11. This strategy helped us to minimize the influence of a major nongenetic confounder25, i.e., complex therapeutic management which has improved the life expectancy of CF patients by several decades. Pulmonary and gastrointestinal disease manifestations were assessed quantitatively by CF population centiles for the normalized forced expiratory volume in 1 s (FEV1) and by weight as a percentage of predicted weight for height. For these two parameters, we selected sibling pairs with extreme phenotypes in the upper and lower 25%17,26. Our selection criteria were in line with recommendations proposed by Risch and Zhang22, however, these criteria resulted in a small study population, thus limiting the power of the genetic association study. Moreover, since our study population is of white European descent, a group in which F508del-CFTR is the most common mutation causing CF, we cannot be certain that our findings can be applied to other populations.
Risch and Zhang concluded from their simulation studies that “extremely discordant sibling pairs represent a powerful design for the association studies of candidate genes”22, and our findings fully support this idea. The use of sibling pairs with extreme clinical phenotypes has been applied before27,28, and our data support the notion that gene–gene interactions mediated by factors encoded in trans of the studied locus can be distinguished in an association study when mildly and severely affected siblings of discordant pairs are compared (Fig. 8).
Hence, in this study we identified a haplotype within SCNN1B associated with intrapair discordance in CF sibling pairs (Fig. 1, Tables 1, 2). We investigated in silico (Fig. 2) and experimentally (Figs. 3, 4) the occurrence of DNA-binding proteins interacting differentially with single or multiple SNPs within the haplotype. We were able to recognize ESRP2 as a candidate for validation among nucleic acid binding proteins, which showed an association with SCNN1B/ENaC function (Fig. 5). We further employed EMSA and co-IP as two different experimental approaches to support the allele-dependent interaction between rs152731 in SCNN1B and the nucleic acid binding protein ESRP2 (Fig. 6). Moreover, we demonstrated that siRNA mediated silencing of ESRP2 in respiratory epithelial cells causes an alteration in global SCNN1B expression (Fig. 7). Altogether, our findings consistently support the idea that SCNN1B and ESRP2 are interacting partners and that ESRP2 is capable of altering the SCNN1B transcript repertoire. It is plausible that this interaction can alter ENaC function and has an influence on the manifestation of CF.
Our proof-of-principle study has several limitations. Specifically the findings cannot fully explain whether the regulatory element within SCNN1B leads to the alternative SCNN1B transcript, from which a severely truncated, 221 amino acid SCNN1B may be translated. Similar truncated SCNN1B isoforms of 217 and 306 amino acids have been observed in patients with systemic pseudohypoaldosteronism48. In addition, heterologous expression of these truncated SCNN1B mutants in Xenopus oocytes showed that they can assemble with wild-type alpha- and gamma ENaC subunits48. These SCNN1B mutants resulted in lower ENaC activity (by 3–7%) than the wild-type protein48. Under physiological conditions, parallel expression of two SCNN1B transcript species, one of which yields a truncated SCNN1B isoform upon translation, can reduce SCNN1B function but the extent remains unclear.
Using EMSA-Pseq for the identification of DNA-binding proteins has previously been conceived29–31. To examine whether our EMSA conditions allowed the formation of high-molecular-weight multiprotein complexes with coherent DNA–protein-interactions in vivo, we investigated protein-DNA-complexes using an NFkappaB-P65-consensus as bait. In line with published data32–35, this probe attracted NFkappaB-p65 and its known interaction partners, such as STAT3 and STAT6. From the EMSA-Pseq of SCNN1B probes, we selected ESRP2 as a candidate for further validation experiments. This selection was based on the fact that this protein is characteristically expressed in epithelial cells and that similar to other SNP-specific proteins recognized by EMSA-PSeq, ESRP2 has been well-characterized as an RNA-binding protein18,19.
The defining border between RNA- and DNA-binding proteins has recently softened because typical DNA-binding proteins have become known to target long noncoding RNAs, defining dual-recognition nucleic acid binding proteins36,37. A growing number of nucleic acid binding proteins have been recognized to interact with both nucleic acid species37–44 and genomic DNA45. The ability to bind to DNA and RNA simultaneously designates a dual recognition protein capable of shuttling between both nucleic acid types during transcription38. During transcription, DNA and RNA are physically close, and therefore, cotranscriptional processes, such as pre-mRNA splicing, can be mediated by putative dual-recognition proteins, such as hnRNP splicing regulatory factors46,47.
Nevertheless, we cannot exclude that SCNN1B may have other important interacting partners in addition to ESRP2 that were not discovered in this study. In this work, we analyzed only three of six SNPs by EMSA-PSeq. Furthermore, while we filtered our primary protein sequencing data using a positive control (NFkappaB-P65) and several technical controls to recognize contamination, we did not incorporate a protein–probe-interaction with low binding affinity, which might enable the detection of weak interacting partners. In the future, the resolution of the EMSA-Pseq can be improved by using scrambled probes to control for nonspecific binding and by incorporating the false-positives captured in the data evaluation strategy. Moreover, EMSA-PSeq utilizes mass spectrometry to identify proteins. In contrast, nucleic acids such as long noncoding RNAs with the potential of interacting with the DNA sequence cannot be identified in this experimental setting, and thus, their relevance needs to be verified by other methods.
The haplotype associated with intrapair discordance covers a genomic segment of 8 kb, implying that a synergistic relationship of more than one interaction partner is responsible for the selective advantage that underlies the maintenance of linkage disequilibrium over such a distance. Thus, it is unlikely that the SCNN1B function can be fully understood based on studying single SNPs. Regardless, we are convinced that the methodology proposed herein—analysis of clinically discordant sibling pairs in combination with EMSA-PSeq—aids in our understanding of how some of the 10,000 SNPs identified by GWAS as being meaningful (www.genome.gov/gwastudies. Accessed at 03.02.2015) contribute to the manifestation of phenotypes in humans. As gene–gene interactions have been suggested to account for the phenomenon termed “missing heritability”49, the discovery of regulatory interactions such as those between ESRP2 and SCNN1B might help to annotate existing GWAS data sets that have been performed with sibling pairs50–52.
Methods
Details on the experimental procedures are provided in the supplement.
Cell culture
Biomaterials were derived from T84 colon cancer cells and immortalized respiratory epithelial 16HBE14o-, CFBE41o- and CFTE29o-cells.
RNA preparation
For RNA isolation, cells were cultured in plates, grown to confluency, snap-frozen in the gaseous phase of liquid N2 and stored at − 80 °C. RNA was extracted using QIAamp RNA Blood Mini Kit (52,304, Qiagen, Hilden, Germany) and RNase-free DNase Set (79,254, Qiagen, Hilden, Germany).
Oligonucleotides for PCR
Sequences of primers used for genotyping and combinatorial PCR are listed in the supplement (SupplTab 5).
Extraction of nuclear proteins
Nuclear extracts were prepared according to published standard methods53 (SupplTab 6). To prevent carryover of the high-salt buffer used for lysis of nuclei, nuclear proteins were dialyzed against low-salt HEPES buffer. The completeness of dialysis was verified by measuring the conductivity of the nuclear extract with a needle probe (customized, Technische Forschungswerkstätten of the Hannover Medical School). The quality of nuclear proteins was ascertained by verifying the conductivity of the final extract after dialysis and by noting the absence of degradation by SDS-electrophoresis followed by Coomassie staining.
EMSA-PSeq
To identify proteins that interact with a specific DNA sequence, we performed an EMSA experiment, visualized the shifted band, captured the DNA–protein complexes by excising the corresponding region of the polyacrylamide gel and then performed protein mass spectrometry. The composition of the EMSA binding buffer was adjusted to reflect the nuclear milieu54 (SupplTab 7). All experimental details and data analysis methods are provided in the supplement (SupplTab 8, SupplTab 9, SupplFig4, SupplFig5, SupplFig6).
Coimmunoprecipitation
Biotinylated 35-mer double-stranded DNA probes for the rs152731 allele C and allele T were incubated with nuclear extract in an EMSA experiment. The DNA–protein-complexes were precipitated using an anti-biotin antibody and protein G agarose beads. The protein–DNA-complexes were eluted from the beads in three consecutive steps, and Western blotting with anti-ESRP2 was used to detect ESRP2 in the precipitated protein–DNA-complexes. IgG instead of the anti-biotin antibody served as a negative control in all experiments. ESRP2 was identified using a signal from an unpurified nuclear extract developed in parallel in each Western blot experiment.
siRNA-mediated downregulation of ESRP2 in epithelial model cell lines and SCNN1B transcript analysis by real-time RT-PCR
siRNA directed against ESRP2 and scrambled control siRNA was purchased from GE Healthcare (mixture of four siRNAs, on-target plus pool, GE Healthcare). T84 and 16HBE14o-cells were transfected for 24 h and 48 h using a protocol supplied by the manufacturer with 10 µl of Dharmafect 1 and 100 pmol siRNA in 2 ml of cell culture medium per well of a 6-well plate. Commercially available kits were used according to the manufacturer’s instructions. RNA was isolated using the RNA-easy-mini-kit (Qiagen), transcribed into cDNA with the High-Capacity cDNA Reverse Transcription kit with RNase inhibitor (Applied Biosystems) and used as a template for real-time PCR with PowerUp SYBR Green Master Mix (Applied Biosystems) to target wild-type and read-through alternative SCNN1B transcripts with the StepOnePlus real-time PCR system (ThermoFisherScientific). The housekeeping gene aldolase was amplified from 5 ng cDNA with 400 nM forward and reverse primers. SCNN1B transcripts were amplified from 30 ng of cDNA with 400 nM (read-through alternative transcript) and 700 nM (wild-type transcript) forward and reverse primers, respectively. Amplification was carried out using annealing at 60 °C. Threshold cycle (Ct) values were retrieved using StepOne-Software (Thermo FisherScientific).
Genetic markers
Except for 7 previously typed markers11, genetic markers were developed de novo for this project. Genotyping was performed by the SNPstream assay (technology by Beckman Coulter, used at Cologne Center of Genomics, Cologne, Germany), by microsatellite genotyping using direct blotting electrophoresis17 or by PCR–RFLP (see SupplTab 5a).
Evaluation of genetic data in the association study on European cystic fibrosis twins and siblings
The work presented here derived data from an association study on European CF twins and siblings17. The study was approved by the ethics committee of Hannover Medical School and written informed consent was obtained from all participants or their parental guardians. All methods were performed in accordance with relevant guidelines and regulations. The clinical characteristics of the patients have been described in detail elsewhere12,15,17. Briefly, the 12% most informative pairs from the entire sample of 318 CF twin and sibling pairs for whom pulmonary function data and weight and height were available in 1996 were selected by a ranking algorithm12. To study genetic modifiers, we aimed to reduce the effect of the disease-causing CFTR gene on the disease phenotype, thus deciding to study only one CFTR mutation genotype. F508del-CFTR, present on 70% of CF chromosomes from white populations of Central and West-European countries, is the only CFTR mutation for which such an approach is feasible. Moreover, patient subsamples were examined to assess the manifestation of the basic defect of impaired ion conductance in the respiratory tissue, as determined in vivo by nasal potential difference measurement15, and in intestinal tissue, as determined ex vivo by intestinal current measurement15. Genetic information obtained from the case and reference populations with contrasting phenotypes was compared using the software package FAMHAP55, which allows family-based analysis56,57, accepts data evaluation in association studies on unrelated individuals as well as on affected sibling pairs55 and is adapted to handle intrapair comparison of genotype data in sibling pairs55. Correction for multiple testing at loci typed with more than one marker was performed by haplotype permutation56. For this purpose, the entire data set of cases and references was used to estimate haplotype frequencies55. To ensure a consistent assignment of rare haplotypes in small subsamples, the genotype data of 101 families with a total of 171 patients from the European CF twin and sibling study were used as a training set in all comparisons. Haplotype, or, in cases of noninformative phase or haplotype uncertainty, weighted haplotype explanation lists were assigned to each individual whereby the haplotype frequencies of the entire data set were taken into account to compute conditional likelihood weights55. Permutation was performed by randomly assigning the affection status to the individuals in each replication55. For the comparison of case sibling pairs to reference sibling pairs, the affection status was permuted or not with an equal chance for both siblings simultaneously55–57. For all comparisons described herein, the phenotypes and sample sizes of the case and reference populations are detailed within the legends, in Figs. 1, 5 and 8 as well as in Table 1.
Statistical analyses
The algorithms of Sham and Curtis58 were used to compare the observed occupancy of SCNN1B haplotypes associated with concordance vs discordance with unique transcription factors to the expectancy value derived from binding partners distributed equally between both haplotypes.
The EMSA band intensity between rs152731-C and rs152731-T probes was compared using a Mann–Whitney-U-Test in technically (electrophoresis) and biologically (cell culture and preparation of nuclear extract) independent experiments.
Changes in the expression levels of SCNN1B transcripts were judged from threshold cycle Ct values obtained by qPCR using the ΔΔCt method comparing siRNA or treated cells to untreated controls. To test against the hypothesis that no change in the SCNN1B transcript was observed, the Wilcoxon signed rank test was used to assess whether or not equal proportions of samples showed an increase or decrease in SCNN1B transcript species upon treatment with siRNA. For this analysis, technically (qPCR) and biologically (cell culture and experimental intervention) independent Ct values were used. For statistical analysis, ΔΔCt values derived from independent siRNA-treated or control samples were used.
Supplementary Information
Acknowledgements
We thank the late Dieter Gruenert for providing the 16HBE14o-, CFBE41o- and CFTE29o-cell lines and Geoffrey Sargent for providing expert advice on the culture conditions. We are grateful to Melanie Lenz for assistance with preparing the nuclear extracts and to Jörg Viering for customizing the device that measures conductivity in volumes < 100 µl (quality control of nuclear extracts).
Author contributions
B.T. and F.S. conceived and designed the project. Funding was acquired by F.S., T.B. and B.T. A.P., S.T., S.H., M.I., J.A. and N.D. performed the experiments. A.P., J.A., M.R.T., S.J. and F.S. analyzed the primary data. T.B. developed and implemented the program to perform the interpair- and intrapair comparisons of genetic data of affected sib pairs. Formal analysis of genetic data was done by F.S. and T.B. T.B., A.P., S.T., S.H., M.I., J.A., M.T., S.J., B.T. and F.S. drafted the manuscript and/or revised the draft critically for content. F.S. visualized the data and wrote, edited and revised the final manuscript.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
Supplemental Information is provided with this manuscript. Primary data will be shared with interested parties upon reasonable request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-020-79804-y.
References
- 1.Welter D, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42:D1001–1006. doi: 10.1093/nar/gkt1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhang F, Lupski JR. Non-coding genetic variants in human disease. Hum. Mol. Genet. 2015;24:R102–110. doi: 10.1093/hmg/ddv259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Birney E, et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447:799–816. doi: 10.1038/nature05874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tak YG, Farnham PJ. Making sense of GWAS: using epigenomics and genome engineering to understand the functional relevance of SNPs in non-coding regions of the human genome. Epigenetics Chromatin. 2015;8:57. doi: 10.1186/s13072-015-0050-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ratjen F, et al. Cystic fibrosis. Nat. Rev. Dis. Primer. 2015;1:15010. doi: 10.1038/nrdp.2015.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Strausbaugh SD, Davis PB. Cystic fibrosis: a review of epidemiology and pathobiology. Clin. Chest Med. 2007;28:279–288. doi: 10.1016/j.ccm.2007.02.011. [DOI] [PubMed] [Google Scholar]
- 7.Elborn JS. Cystic fibrosis. Lancet Lond. Engl. 2016;388:2519–2531. doi: 10.1016/S0140-6736(16)00576-6. [DOI] [PubMed] [Google Scholar]
- 8.Berdiev BK, Qadri YJ, Benos DJ. Assessment of the CFTR and ENaC association. Mol. Biosyst. 2009;5:123–127. doi: 10.1039/B810471A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Enuka Y, Hanukoglu I, Edelheit O, Vaknine H, Hanukoglu A. Epithelial sodium channels (ENaC) are uniformly distributed on motile cilia in the oviduct and the respiratory airways. Histochem. Cell Biol. 2012;137:339–353. doi: 10.1007/s00418-011-0904-1. [DOI] [PubMed] [Google Scholar]
- 10.Tarran R, Trout L, Donaldson SH, Boucher RC. Soluble mediators, not cilia, determine airway surface liquid volume in normal and cystic fibrosis superficial airway epithelia. J. Gen. Physiol. 2006;127:591–604. doi: 10.1085/jgp.200509468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Stanke F, et al. The TNFalpha receptor TNFRSF1A and genes encoding the amiloride-sensitive sodium channel ENaC as modulators in cystic fibrosis. Hum. Genet. 2006;119:331–343. doi: 10.1007/s00439-006-0140-2. [DOI] [PubMed] [Google Scholar]
- 12.Mekus F, et al. Categories of deltaF508 homozygous cystic fibrosis twin and sibling pairs with distinct phenotypic characteristics. Twin Res. Off. J. Int. Soc. Twin Stud. 2000;3:277–293. doi: 10.1375/136905200320565256. [DOI] [PubMed] [Google Scholar]
- 13.Stanke F, et al. Hierarchical fine mapping of the cystic fibrosis modifier locus on 19q13 identifies an association with two elements near the genes CEACAM3 and CEACAM6. Hum. Genet. 2010;127:383–394. doi: 10.1007/s00439-009-0779-6. [DOI] [PubMed] [Google Scholar]
- 14.Matys V, et al. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 2006;34:D108–D110. doi: 10.1093/nar/gkj143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bronsveld I, et al. Chloride conductance and genetic background modulate the cystic fibrosis phenotype of Delta F508 homozygous twins and siblings. J. Clin. Invest. 2001;108:1705–1715. doi: 10.1172/JCI12108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Schreck R, Zorbas H, Winnacker EL, Baeuerle PA. The NF-kappa B transcription factor induces DNA bending which is modulated by its. Nucleic Acids Res. 1990;18:6497–6502. doi: 10.1093/nar/18.22.6497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Stanke F, et al. Genes that determine immunology and inflammation modify the basic defect of impaired ion conductance in cystic fibrosis epithelia. J. Med. Genet. 2011;48:24–31. doi: 10.1136/jmg.2010.080937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Warzecha CC, Shen S, Xing Y, Carstens RP. The epithelial splicing factors ESRP1 and ESRP2 positively and negatively regulate diverse types of alternative splicing events. RNA Biol. 2009;6:546–562. doi: 10.4161/rna.6.5.9606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Warzecha CC, et al. An ESRP-regulated splicing programme is abrogated during the epithelial-mesenchymal transition. EMBO J. 2010;29:3286–3300. doi: 10.1038/emboj.2010.195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Haq IJ, Gray MA, Garnett JP, Ward C, Brodlie M. Airway surface liquid homeostasis in cystic fibrosis: pathophysiology and therapeutic targets. Thorax. 2016;71:284–287. doi: 10.1136/thoraxjnl-2015-207588. [DOI] [PubMed] [Google Scholar]
- 21.Stoltz DA, Meyerholz DK, Welsh MJ. Origins of cystic fibrosis lung disease. N. Engl. J. Med. 2015;372:351–362. doi: 10.1056/NEJMra1300109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Risch N, Zhang H. Extreme discordant sib pairs for mapping quantitative trait loci in humans. Science. 1995;268:1584–1589. doi: 10.1126/science.7777857. [DOI] [PubMed] [Google Scholar]
- 23.Eaves L, Meyer J. Locating human quantitative trait loci: guidelines for the selection of sibling pairs for genotyping. Behav. Genet. 1994;24:443–455. doi: 10.1007/BF01076180. [DOI] [PubMed] [Google Scholar]
- 24.Risch NJ, Zhang H. Mapping quantitative trait loci with extreme discordant sib pairs: sampling considerations. Am. J. Hum. Genet. 1996;58:836–843. [PMC free article] [PubMed] [Google Scholar]
- 25.Stanke F, et al. An informative intragenic microsatellite marker suggests the IL-1 receptor as a genetic modifier in cystic fibrosis. Eur. Respir. J. 2017;50:1700426. doi: 10.1183/13993003.00426-2017. [DOI] [PubMed] [Google Scholar]
- 26.Mekus F, Laabs U, Veeze H, Tummler B. Genes in the vicinity of CFTR modulate the cystic fibrosis phenotype in highly concordant or discordant F508del homozygous sib pairs. Hum. Genet. 2003;112:1–11. doi: 10.1007/s00439-002-0839-7. [DOI] [PubMed] [Google Scholar]
- 27.Rogus JJ, et al. High-density single nucleotide polymorphism genome-wide linkage scan for susceptibility genes for diabetic nephropathy in type 1 diabetes: discordant sibpair approach. Diabetes. 2008;57:2519–2526. doi: 10.2337/db07-1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Santangelo SL, et al. A discordant sib-pair linkage analysis of age-related macular degeneration. Ophthalmic Genet. 2005;26:61–67. doi: 10.1080/13816810490967944. [DOI] [PubMed] [Google Scholar]
- 29.Jiang D, Jia Y, Jarrett HW. Transcription factor proteomics: identification by a novel gel mobility shift-three-dimensional electrophoresis method coupled with southwestern blot and high-performance liquid chromatography–electrospray-mass spectrometry analysis. J. Chromatogr. A. 2011;1218:7003–7015. doi: 10.1016/j.chroma.2011.08.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Stead JA, Keen JN, McDowall KJ. The identification of nucleic acid-interacting proteins using a simple proteomics-based approach that directly incorporates the electrophoretic mobility shift assay. Mol. Cell. Proteomics MCP. 2006;5:1697–1702. doi: 10.1074/mcp.T600027-MCP200. [DOI] [PubMed] [Google Scholar]
- 31.Woo AJ, Dods JS, Susanto E, Ulgiati D, Abraham LJ. A proteomics approach for the identification of DNA binding activities observed in the electrophoretic mobility shift assay. Mol. Cell. Proteomics MCP. 2002;1:472–478. doi: 10.1074/mcp.T200003-MCP200. [DOI] [PubMed] [Google Scholar]
- 32.Dhar K, Rakesh K, Pankajakshan D, Agrawal DK. SOCS3 promotor hypermethylation and STAT3-NF-kappaB interaction downregulate SOCS3 expression in human coronary artery smooth muscle cells. Am. J. Physiol. Heart Circ. Physiol. 2013;304:H776–785. doi: 10.1152/ajpheart.00570.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Shen CH, Stavnezer J. Interaction of stat6 and NF-kappaB: direct association and synergistic activation of interleukin-4-induced transcription. Mol. Cell. Biol. 1998;18:3395–3404. doi: 10.1128/MCB.18.6.3395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Yoshida Y, et al. Interleukin 1 activates STAT3/nuclear factor-kappaB cross-talk via a unique. J. Biol. Chem. 2004;279:1768–1776. doi: 10.1074/jbc.M311498200. [DOI] [PubMed] [Google Scholar]
- 35.Yu Z, Kone BC. The STAT3 DNA-binding domain mediates interaction with NF-kappaB p65 and inducible nitric oxide synthase transrepression in mesangial cells. J. Am. Soc. Nephrol. JASN. 2004;15:585–591. doi: 10.1097/01.ASN.0000114556.19556.F9. [DOI] [PubMed] [Google Scholar]
- 36.Connolly KM, Wojciak JM, Clubb RT. Site-specific DNA binding using a variation of the double stranded RNA binding motif. Nat. Struct. Biol. 1998;5:546–550. doi: 10.1038/799. [DOI] [PubMed] [Google Scholar]
- 37.Hudson WH, Ortlund EA. The structure, function and evolution of proteins that bind DNA and RNA. Nat. Rev. Mol. Cell Biol. 2014;15:749–760. doi: 10.1038/nrm3884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Gaillard C, Cabannes E, Strauss F. Identity of the RNA-binding protein K of hnRNP particles with protein H16, a sequence-specific single strand DNA-binding protein. Nucleic Acids Res. 1994;22:4183–4186. doi: 10.1093/nar/22.20.4183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dempsey LA, Hanakahi LA, Maizels N. A specific isoform of hnRNP D interacts with DNA in the LR1 heterodimer: canonical RNA binding motifs in a sequence-specific duplex DNA binding protein. J. Biol. Chem. 1998;273:29224–29229. doi: 10.1074/jbc.273.44.29224. [DOI] [PubMed] [Google Scholar]
- 40.Chennathukuzhi VM, Kurihara Y, Bray JD, Yang J, Hecht NB. Altering the GTP binding site of the DNA/RNA-binding protein, Translin/TB-RBP, decreases RNA binding and may create a dominant negative phenotype. Nucleic Acids Res. 2001;29:4433–4440. doi: 10.1093/nar/29.21.4433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Zhao H, Yang Y, Zhou Y. Structure-based prediction of RNA-binding domains and RNA-binding sites and application to structural genomics targets. Nucleic Acids Res. 2011;39:3017–3025. doi: 10.1093/nar/gkq1266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Shkreta L, Chabot B. The RNA splicing response to DNA damage. Biomolecules. 2015;5:2935–2977. doi: 10.3390/biom5042935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Norman M, Rivers C, Lee Y-B, Idris J, Uney J. The increasing diversity of functions attributed to the SAFB family of. Biochem. J. 2016;473:4271–4288. doi: 10.1042/BCJ20160649. [DOI] [PubMed] [Google Scholar]
- 44.Ghosh P, Sowdhamini R. Genome-wide survey of putative RNA-binding proteins encoded in the human proteome. Mol. Biosyst. 2016;12:532–540. doi: 10.1039/C5MB00638D. [DOI] [PubMed] [Google Scholar]
- 45.Mikula M, Bomsztyk K, Goryca K, Chojnowski K, Ostrowski J. Heterogeneous nuclear ribonucleoprotein (HnRNP) K genome-wide binding survey reveals its role in regulating 3′-end RNA processing and transcription termination at the early growth response 1 (EGR1) gene through XRN2 exonuclease. J. Biol. Chem. 2013;288:24788–24798. doi: 10.1074/jbc.M113.496679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Busch A, Hertel KJ. Evolution of SR protein and hnRNP splicing regulatory factors. Wiley Interdiscip. Rev. RNA. 2012;3:1–12. doi: 10.1002/wrna.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Shipman KL, Robinson PJ, King BR, Smith R, Nicholson RC. Identification of a family of DNA-binding proteins with homology to RNA splicing factors. Biochem. Cell Biol. Biochim. Biol. Cell. 2006;84:9–19. doi: 10.1139/o05-139. [DOI] [PubMed] [Google Scholar]
- 48.Edelheit O, et al. Truncated beta epithelial sodium channel (ENaC) subunits responsible for multi-system pseudohypoaldosteronism support partial activity of ENaC. J. Steroid Biochem. Mol. Biol. 2010;119:84–88. doi: 10.1016/j.jsbmb.2010.01.002. [DOI] [PubMed] [Google Scholar]
- 49.Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl. Acad. Sci. U.S.A. 2012;109:1193–1198. doi: 10.1073/pnas.1119675109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Beekman M, et al. Genome-wide linkage analysis for human longevity: genetics of healthy aging study. Aging Cell. 2013;12:184–193. doi: 10.1111/acel.12039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hemani G, et al. Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs. Am. J. Hum. Genet. 2013;93:865–875. doi: 10.1016/j.ajhg.2013.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.McQueen MB, et al. The National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pairs genome-wide data. Behav. Genet. 2015;45:12–23. doi: 10.1007/s10519-014-9692-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pollock RM. DNA–protein interactions. In: Ausubel FM, editor. Current Protocols in Molecular Biology. New Work: Wiley; 1997. pp. 12.0.1–12.11. [Google Scholar]
- 54.Century TJ, Fenichel IR, Horowitz SB. The concentrations of water, sodium and potassium in the nucleus and cytoplasm of amphibian oocytes. J. Cell Sci. 1970;7:5–13. doi: 10.1242/jcs.7.1.5. [DOI] [PubMed] [Google Scholar]
- 55.Herold C, Becker T. Genetic association analysis with FAMHAP: a major program update. Bioinforma. Oxf. Engl. 2009;25:134–136. doi: 10.1093/bioinformatics/btn581. [DOI] [PubMed] [Google Scholar]
- 56.Becker T, Knapp M. A powerful strategy to account for multiple testing in the context of haplotype analysis. Am. J. Hum. Genet. 2004;75:561–570. doi: 10.1086/424390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Knapp M, Becker T. Family-based association analysis with tightly linked markers. Hum. Hered. 2003;56:2–9. doi: 10.1159/000073727. [DOI] [PubMed] [Google Scholar]
- 58.Sham PC, Curtis D. Monte Carlo tests for associations between disease and alleles at highly polymorphic loci. Ann. Hum. Genet. 1995;59:97–105. doi: 10.1111/j.1469-1809.1995.tb01608.x. [DOI] [PubMed] [Google Scholar]
- 59.Voilley N, et al. Cloning, chromosomal localization, and physical linkage of the beta and gamma subunits (SCNN1B and SCNN1G) of the human epithelial amiloride-sensitive sodium channel. Genomics. 1995;28:560–565. doi: 10.1006/geno.1995.1188. [DOI] [PubMed] [Google Scholar]
- 60.Saxena A, et al. Gene structure of the human amiloride-sensitive epithelial sodium channel beta subunit. Biochem. Biophys. Res. Commun. 1998;252:208–213. doi: 10.1006/bbrc.1998.9625. [DOI] [PubMed] [Google Scholar]
- 61.Wang M, Herrmann CJ, Simonovic M, Szklarczyk D, von Mering C. Version 4.0 of PaxDb: protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics. 2015;15:3163–3168. doi: 10.1002/pmic.201400441. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Supplemental Information is provided with this manuscript. Primary data will be shared with interested parties upon reasonable request.