Abstract
Genome-wide association studies have revealed common genetic variants with small effect sizes associated with diverse lymphoid cancers. Family studies have uncovered rare variants with high effect sizes. However, these variants explain only a portion of the heritability of these cancers. Some of the missing heritability may be attributable to rare variants with small effect sizes. We aim to identify rare germline variants associated with familial lymphoid cancers using exome sequencing. One case per family was selected from 39 lymphoid cancer families based on early onset of disease or rarity of subtype. Control data was from Non-Finnish Europeans in gnomAD exomes (N = 56,885) or ExAC (N = 33,370). Gene and pathway-based burden tests for rare variants were performed using TRAPD. Five putatively pathogenic germline variants were found in four genes: INTU, PEX7, EHHADH, and ASXL1. Pathway-based association tests identified the innate and adaptive immune systems, peroxisomal pathway and olfactory receptor pathway as associated with lymphoid cancers in familial cases. Our results suggest that rare inherited defects in the genes involved in immune system and peroxisomal pathway may predispose individuals to lymphoid cancers.
Introduction
Lymphoid cancers arise at various stages of B- and T-cell differentiation. There are more than 80 subtypes of lymphoid cancer [1] that can be grouped into three types: Hodgkin lymphoma, non-Hodgkin lymphoma, and other lymphoproliferative disorders [2]. Non Hodgkin lymphoid neoplasms are the 7th most common cancer in North America [3]. Lymphoid cancer types and subtypes represent heterogeneous group of diseases with different genetic mechanisms, clinical presentation and histological appearance and ages of onset [4, 5]. Lymphoid cancers are complex disorders caused by various genetic [6, 7] and environmental factors [8, 9].
Genetic factors for lymphoid cancers include common and rare variants [10]. Familial studies mostly detect rare variants with high effect sizes. Family-based sequencing studies have identified rare lymphoid cancer predisposition variants in several genes, including KDR [11], KLHDC8B [12], POT1 [13], ACAN [14], NPAT [15], and others [7]. In addition, significant linkage has been found for chronic lymphocytic leukemia (CLL), at 2q21.2, which contains the chemokine receptor (CXCR4) gene [16].
In contrast, genome wide association studies (GWAS) of cases and controls often identifies common variants with smaller effect sizes. Susceptibility genes that are involved in B-cell development, immune response [17], DNA repair [18] and immune function [19] have been associated with various subtypes of lymphoid cancers. The NHGRI-EBI GWAS catalog [20] contained 654 associations for lymphoid cancers as of October 3, 2022 with the most significant association being for B-cell acute lymphoblastic leukemia with a variant rs10821936-C (p-value: 1X10-106) in ARID5B [21]. The heritability estimate for Hodgkin lymphoma was 28.4% in a Swedish population study [22]. The heritability of non-Hodgkin lymphoma varies by subtype; for instance, for CLL it is 16–34% [17, 23] and for DLBCL it is 16% [24]. Although many susceptibility loci have been identified for lymphoid cancers, a substantial fraction of the disease heritability remains unexplained. Some of this ‘missing heritability’ could be due to rare variants with small effect sizes [25–27].
Rare variant association studies (RVAS) are used to identify low allele frequency variants with small effect sizes. In RVAS, the number of cases with rare damaging variants in each gene or group of genes (e.g., pathway) is compared to that of controls to generate one statistical test result per gene or group of genes, instead of one per variant. Various statistical tests have been developed to detect rare variant associations such as the burden test, the adaptive burden test, the variance-component test, and the omnibus test, all of which have been reviewed previously [28, 29]. Study designs and considerations for performing RVAS have also been described [29–31]. RVAS has been applied in lymphoid cancer cases with a mature B-cell non-Hodgkin lymphoma subtype, and identified a burden of rare germline variants in CHMP6 and GSTA4 [32]. Unlike GWAS, where the genome-wide significance threshold is defined at p = 5×108, there is no standard threshold for RVAS, as the number of tests performed depends on the sequencing platform used, the depth of coverage, and how variants are aggregated in the test [33].
To improve our understanding of the etiology of lymphoid cancers, we performed gene- and pathway-based RVAS to test for an aggregated burden of rare, deleterious variants in gene(s) and pathway(s), in familial lymphoid cancer cases compared to controls from public databases. Test Rare vAriants with Public Data (TRAPD) was used [34], as it allows for the use of controls from public databases such as the large exome sequence databases Genome Aggregation Database (gnomAD) [35] and ExAC [36].
Methods
Participants
The study was approved by the Research Ethics Board of BC Cancer and the University of British Columbia with Simon Fraser University as the harmonized partner board. All participants provided written informed consent.
Cases were derived from the Lymphoid Cancer Families Study (LCFS) [37]. Eligible families had two or more members, living or deceased, with a lymphoid cancer. The type and subtype of the cancers were classified according to the hierarchical classification provided by the International Lymphoma Epidemiology Consortium (InterLymph) Pathology Working Group [2]. Families (N = 39) were prioritized for exome sequencing based on the number of family members affected by lymphoid cancer and the number of germline samples available for sequencing.
Whole exome sequencing
Samples from 39 lymphoid cancer families, including 39 early onset cases and relatives, were part of a batch of 92 samples subject to whole exome sequencing, performed using an Illumina HiSeq2500 instrument in High Output mode with HCS (v2.2.68) at the Genome Sciences Centre (Vancouver, British Columbia, Canada). DNA extraction for 37 early onset cases was from blood; for two cases, saliva samples were used. Briefly, library preparation involved shearing of DNA using Covaris E-Series (Covaris, E210) to an average fragment size of 250 bp and using NEB Paired-End Sample Prep Kit (NEB, E6000B-25B; protocol as per kit version 1.1) for phosphorylation, d-A tailing, adaptor ligation, and PCR enrichment of sheared samples. Capture was performed with Agilent SureSelect v5 + UTR kit capture probes using four libraries per capture, for a total of 23 captures. The protocol was carried out according to the Agilent SureSelect Target Enrichment System for Illumina Multiplexed Sequencing. Amplification was performed using the HiSeq PE Cluster Kit v4 (Illumina, PE-401-4001) and cBot Cluster Station. Sequencing was performed using the HiSeq SBS Kit v4 (Illumina, FC-401-4003) and the Illumina HiSeq2500 System, generating 125 bp reads.
Variant calling
Reads were mapped and variants called at the Bioinformatics core facility at the Genome Sciences Centre. Paired-end reads were mapped against the human reference genome (GrCh37) using the Burrows-Wheeler Aligner (version 0.5.7) [38]. Aligned reads were filtered and sorted using Sambamba (version 0.5.52) [39] and the BAM file was generated using GATK’s (version 4.0.2.1) HaplotypeCaller [40] with the–L parameter to restrict the analysis to exome specific intervals. Variants were jointly called to produce genotypes for all variant positions in all samples, according to GATK’s best practice recommendations [41] to generate a single raw VCF file with 290,549 variants.
Variant quality filtering
The raw VCF file containing single nucleotide variants (SNV) and insertions and deletions (indels) was filtered for variant quality using Variant Quality Score Recalibration (VQSR) with truth sensitivity at 99.0% for both SNVs and indels. Variants that did not pass VQSR were removed. VQSR filtering was performed on a batch of 92 exomes for multiple projects including this study. A subset of VCF file containing only the genotype data for the early onset 39 cases was generated using bcftools (version 1.13) [42]. Workflow for the rare variant association study is summarized in Fig 1. Multiallelic sites were converted into biallelic sites with left-alignment, and normalization of variants were performed by using Bcftools [42]. Sequencing, variant calling, and filtering methods for control set GnomAD and ExAC were previously described [35, 36]. Variants that had a PASS filter status in both cases and controls were retained for RVAS.
Fig 1. Study workflow.
Estimation of ancestry
Input.bed files for ancestry estimation were generated using PLINK (version 1.9) [43, 44]. A principal component analysis (PCA) was conducted using KING (version 2.2.7) [45] with the 1000 Genomes Project dataset as reference. PCA plots were produced using the command—rplot with the e1071 package in R version 3.6.0 (S1 Fig). For gnomAD and ExAC controls, the ancestries were as described [35, 36].
Read depth filtering
Different exon capture methods were used for cases and controls. To reduce potential bias due to differences in coverage, the proportion of individuals covered at a read depth of > 10x was calculated separately for cases and controls. Bases with at least 90% of the samples covered at >10x in both cases and controls were retained for analysis. The coverage files for gnomAD exome and ExAC controls were obtained from the gnomAD consortium [46].
Annotation
Annotation was performed using Snpsift (version 5.0) [47] and Variant Effect Predictor (VEP) (version 103) [48] using the Genome Aggregation Database (gnomAD 2.2.1) [35], ExAC [36], and Combined Annotation Dependent Depletion (CADD) (version 1.6) [49]. European allele frequencies were annotated from gnomAD and ExAC databases. Variants not annotated for allele frequency in these databases were annotated manually with the dbSNPs Allele Frequency Aggregator (ALFA) European allele frequency.
Pre-burden test filtration
Variants in cases and controls were filtered to retain only those with CADD Phred score ≥ 20 and high impact variants such as splice acceptor, splice donor, stop-gain, frameshift, stop lost, start lost from VEP (version 103). Analyses were limited to variants with allele frequency ≤ 1% (rare variants) or ≤ 0.1% (very rare variants). Variant data from cases was filtered to retain only those genotypes where at least one case had an alternate allele.
Burden tests
Burden testing was performed to identify genes and pathways with a higher frequency of rare deleterious variants in cases compared to publicly available European controls from gnomAD (N = 56,885) and ExAC (N = 33,370) using the Test Rare vAriants with Public Data (TRAPD) method. TRAPD utilizes summary statistics for the variants from publicly available control data and allows for the application of both dominant and recessive models. TRAPD generates three files: a SNP file with qualifying rare variants mapped to genes, a count file with tabulated count of rare variants in the genes, and a burden file with p-value for each gene and pathway using a two-sided Fisher’s exact test.
Gene-based analysis included only genes that, after filtering, had one or more variants in at least one case. Therefore, for gene-based burden analysis, the same genes with rare damaging variants present in cases were selected from the controls using the TRAPD count file before carrying out the burden tests. In addition, a subset of these genes was chosen to include genes with two or more variant counts in cases before carrying out burden tests.
Pathway-based burden analysis was performed by aggregating genes into pathways based on three widely-used databases: Reactome [50], KEGG [51], and Biocarta. The molecular signatures database from the Broad institute [52] (http://www.broadinstitute.org/gsea/downloads.jsp) was used to download KEGG and Biocarta pathways. Reactome was downloaded from https://reactome.org/download-data. The Reactome database has low-level and high-level pathway hierarchies; here we used the low-level pathways. Genes with rare variants present in cases were selected from the controls using the TRAPD count file and grouped into pathways. The variant counts were summed to obtain a per-pathway rare variant count for pathway-based burden analysis.
Burden analysis was performed for the dominant model. The p-values generated by TRAPD were corrected for the inflation factor, a measure of population stratification. To measure the inflation factor, lambda 0.5 to 0.95 was calculated which fit a line to the points within 0.5 and 0.95 quantiles, as described previously [53]. Quantile-Quantile (QQ) plots were constructed using R (v3.6.0). Multiple testing corrections were performed based on the number of genes analysed at allele frequencies ≤1% and ≤0.1%. Genes or pathways were considered to be associated with lymphoid cancer if the p-value after correction for the inflation factor and multiple testing was ≤ 0.05. For the genes and pathways with significant associations, variants were visually inspected using the Integrative Genomics Viewer (version 2.4) [54] and excluded if deemed an artifact. Variants are deemed as artifacts if they show strand bias or allele frequency inconsistent with the heterozygous variant, have low base-quality of the alternate allele in the reads or low mapping quality.
Results
Exome sequencing included an early onset case from each lymphoid cancer family (N = 39). The types/subtypes of the 39 cases are shown in Table 1. Estimation of ethnicity resulted in 38 cases of European ancestry that were included in rare variant burden analyses, and one case of South Asian ancestry (S1 Fig) that was excluded from further analysis.
Table 1. Lymphoid cancer families included in exome sequencing and cases chosen for RVAS.
| Family ID | Number of family members with a lymphoid cancer | Chosen Case | |||
|---|---|---|---|---|---|
| Age of onset | Sex | Lymphoid cancer type | Lymphoid cancer subtype | ||
| 10 | 2 | 24 | Female | Hodgkin lymphoma | Nodular sclerosis |
| 12+ | 3 | 35 | Male | Hodgkin lymphoma | Nodular sclerosis |
| 14 | 4 | 52 | Female | non-Hodgkin lymphoma | Mucosa-associated lymphoid tissue lymphoma |
| 19 | 4 | 29 | Male | Hodgkin lymphoma | Nodular sclerosis |
| 32 | 2 | 34 | Male | non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 54 | 3 | 48 | Female | Non-Hodgkin lymphoma | Follicular lymphoma |
| 59 | 3 | 73 | Male | Non-Hodgkin lymphoma | Lymphoplasmacytic lymphoma |
| 66 | 3 | 41 | Male | Non-Hodgkin lymphoma | Follicular lymphoma |
| 68 | 2 | 46 | Male | Non-Hodgkin lymphoma | Follicular lymphoma |
| 69 | 3 | 46 | Male | Non-Hodgkin lymphoma | Follicular lymphoma |
| 71 | 3 | 50 | Male | Non-Hodgkin lymphoma | Diffuse large B-cell lymphoma |
| 76 | 4 | 30 | Female | Hodgkin lymphoma | Nodular sclerosis |
| 89 | 4 | 52 | Male | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 100 | 2 | 62 | Male | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 108 | 2 | 75 | Female | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 110 | 2 | 67 | Female | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 126 | 3 | 35 | Female | Hodgkin lymphoma | Nodular sclerosis |
| 129 | 2 | 9 | Male | Non-Hodgkin lymphoma | Burkitt lymphoma |
| 133+ | 5 | 19 | Male | Hodgkin lymphoma | Nodular lymphocyte predominant Hodgkin lymphoma |
| 134 | 2 | 34 | Male | Non-Hodgkin lymphoma | Diffuse large B-cell lymphoma |
| 154 | 6 | 58 | Female | Lymphoid neoplasm, not other specified | Lymphoid neoplasm, not otherwise specified |
| 157 | 2 | 58 | Male | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 162 | 2 | 48 | Female | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 173 | 3 | 29 | Female | Hodgkin lymphoma | Nodular sclerosis |
| 175* | 2 | 40 | Male | Non-Hodgkin lymphoma | Diffuse large B-cell lymphoma |
| 189 | 5 | 57 | Female | Non-Hodgkin lymphoma | Follicular lymphoma |
| 195 | 2 | 46 | Male | Non-Hodgkin lymphoma | Diffuse large B-cell lymphoma |
| 197 | 3 | 47 | Male | Non-Hodgkin lymphoma | Follicular lymphoma |
| 213 | 2 | 21 | Female | Hodgkin lymphoma | Nodular sclerosis |
| 226 | 3 | 49 | Female | Non-Hodgkin lymphoma | Follicular lymphoma |
| 229 | 2 | 45 | Male | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 238 | 4 | 78 | Female | Non-Hodgkin lymphoma | Follicular lymphoma |
| 255 | 2 | 27 | Male | Non-Hodgkin lymphoma | Primary mediastinal large B-cell lymphoma |
| 1705 | 8 | 36 | Female | Lymphoid neoplasm, not other specified | Lymphoid neoplasm, not other specified |
| 4541 | 3 | 22 | Male | Hodgkin lymphoma | Hodgkin Lymphoma, not other specified |
| 35 | 2 | 50 | Female | Non-Hodgkin lymphoma | Chronic lymphocytic leukemia |
| 113 | 3 | 65 | Female | Non-Hodgkin lymphoma | Plasma cell myeloma |
| 204 | 2 | 58 | Male | Non-Hodgkin lymphoma | Small lymphocytic lymphoma |
| 3474 | 4 | 68 | Female | Lymphoid neoplasm, not other specified | Lymphoid neoplasm, not other specified |
* Excluded after ethnicity analysis.
+ DNA sample was extracted from Saliva.
ExAC (N = 33,370) and GnomAD exomes (N = 56,885) with Non-Finnish European ethnicity were used as controls. Read depth filtering to exclude any bases that did not have read depth 10 or more, in at least 90% of cases and 90% of a control set, resulted in slightly different sets of variants being included in the analyses with GnomAD controls vs. those with ExAC controls. After read depth filtering using gnomAD controls, 102,385 variants remained, whereas 103,427 variants remained when controls were from ExAC. After filtering to retain high impact rare deleterious variants using an allele frequency cut-off of ≤1%, there were 467 rare variants with GnomAD as controls, and 400 rare variants with ExAC as controls, whereas at an allele frequency cut-off of ≤0.1%, there were 294 rare variants with GnomAD as controls and 276 rare variants with ExAC as controls.
Gene-based burden analysis
Burden tests were performed at the gene and pathway levels to identify if there was a higher burden of rare variants in lymphoid cancer cases than in controls from GnomAD exomes and ExAC. First, we analyzed genes with one or more variant counts in cases. There were 404 and 258 genes containing deleterious variants with allele frequencies ≤1% and ≤0.1% analysed when the controls were GnomAD exomes, and 355 and 210 genes at allele frequencies ≤1% and ≤0.1% when the control set was ExAC. There were no significant associations between lymphoid cancer and any individual gene, with either control set or at either allele frequency, after correcting for inflation and for multiple testing. The inflation factors for gene-based analysis for both the controls and allele frequency are summarised in S1 Table.
We then limited analysis to genes with two or more variant counts present in any of the 39 cases. At allele frequency ≤1%, 43 and 39 genes were analysed when the controls were GnomAD exomes and ExAC, respectively. Statistical power for the cases and controls when two or more variants were analysed using genpwr [55] are summarized in S2 Table. Lymphoid cancer associated genes and variants are summarized in Table 2. INTU was the only gene associated with lymphoid cancer after multiple testing correction for 43 genes with GnomAD exomes and 39 genes with ExAC controls. For allele frequency ≤0.1%, 7 and 5 genes were analysed when the controls were GnomAD exomes and ExAC, respectively. Four genes, INTU (p-value 0.005), ASXL1 (p-value 0.009), EHHADH (p-value 0.03) and PEX7 (p-value 0.02), were associated with lymphoid cancer after multiple testing correction for 7 genes when GnomAD exomes were used as controls. Three genes, INTU (p-value 0.004), PEX7 (p-value 0.02), and EHHADH (p-value 0.02) were associated with lymphoid cancers after multiple testing correction for 6 genes with ExAC controls. Data underlying the results of burden testing are summarized in S3 Table.
Table 2. Five rare variants in four genes significantly associated with lymphoid cancers in genes with two or more variant counts.
| Family ID | rsid | Gene | HGVS | AF1 | Conse-quence | Allele frequency ≤1% | Allele frequency ≤0.1% | Case | Family members | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GnomAD controls p-value* | ExAC controls p-value* | GnomAD controls p-value* | ExAC controls p-value* | Diag-nosis | Age of onset | Sex | Number of other family members exome seq’d | Relation-ship | Diag-nosis | Carries variant? | ||||||
| 32 | rs781090244 | EHHADH | NM_001966.4:c.594_595del | 0.00013 | Splice Acceptor | 0.03 | 0.02 | CLL | 34 | Male | 1 | Sister | FOLL | No | ||
| 134 | DLBCL | 34 | Male | 1 | Mother | CLL | No | |||||||||
| 129 | rs1347837845 | INTU | NM_015693.4:c.466C>T | 0 | Stop-gain | 0.04 | 0.05 | 0.005 | 0.004 | BL | 9 | Male | 1 | Sister | NS | No |
| 100 | rs1450337930 | INTU | NM_015693.4:c.581G>A | 0 | Stop-gain | CLL | 62 | Male | 1 | Mother | CLL | Yes | ||||
| 76 | rs1805137 | PEX7 | NM_000288.4:c.875T>A | 0.0007 | Stop-gain | 0.02 | 0.02 | NS | 30 | Female | 3 | Mother | Skin (BCC) | Yes | ||
| Father | LPL | No | ||||||||||||||
| Maternal cousin | NMZL | No | ||||||||||||||
| 129 | BL | 9 | Male | 1 | Sister | NS | Yes | |||||||||
| 59 | rs750318549 | ASXL1 | NM_015338.6:c.1934dup | 0.0005 | Frame-shift | 0.009 | LPL | 73 | Male | 1 | Sister | CLL | No | |||
| 35 | CLL | 50 | Female | 0 | -- | -- | No | |||||||||
HGVS = Human Genome Variation Society. CLL = chronic lymphocytic leukemia, BL = Burkitt lymphoma, NS = Nodular Sclerosis, DLBCL = diffuse large B-cell lymphoma, LPL = Lymphoplasmacytic lymphoma, BCC = Basal cell carcinoma, NMZL = Nodal marginal zone lymphoma, FOLL = Follicular lymphoid cancer.
1 allele frequency from gnomAD, ExAC, 1000genomes or ALFA databases
* p-values shown here are after correcting for the inflation factor and multiple testing, when necessary
Examination of variants of interest in family members
For cases with rare variants in genes associated with lymphoid cancer, we looked at whether any affected relatives also exome sequenced carried the variant or not. INTU has two stop-gain variants of interest: NM_015693.4:c.466C>T (rs1347837845) and NM_015693.4:c.581G>A (rs1450337930), these novel variants were not reported in Clinvar. NM_015693.4:c.466C>T was found in a case from family 129 with Burkitt lymphoma at the age of 9. This participant had one family member (a sister) diagnosed with nodular sclerosis, not otherwise specified, at age 34; the sister does not carry the NM_015693.4:c.466C>T variant. Given that the sibling does not share the NM_015693.4:c.466C>T variant in INTU, it is likely that additional genetic factors might contribute to familial lymphoid cancer in this family. NM_015693.4:c.581G>A was observed in an individual with CLL at the age of 62 from family 100; his/her mother, who was diagnosed with CLL at the age of 71, also carried this variant. As both the parent and offspring with the same lymphoma subtype carry the NM_015693.4:c.581G>A variant, it is plausible that this variant could be involved in susceptibility to lymphoid cancer in this family.
A stop-gain variant NM_000288.4:c.875T>A (rs1805137) in PEX7, which is indicated in Clinvar as pathogenic, was observed in two cases from different families; one case had nodular sclerosis at age 30, while the other had Burkitt lymphoma at age 9. The case with nodular sclerosis had two additional family members affected with lymphoid cancer; a father and a maternal cousin, as well as a mother without lymphoid cancer but with basal cell carcinoma (BCC), who were exome sequenced. The father developed lymphoplasmacytic lymphoma at 61 years, and the maternal cousin developed nodal marginal zone lymphoid cancer at 28 years; they did not carry the NM_000288.4:c.875T>A variant. The mother, who developed BCC at 45, carried the variant. Given that the mother who is not affected with lymphoid cancer had the variant in PEX7, but the affected father and the cousin did not, the NM_000288.4:c.875T>A variant seems unlikely to contribute to lymphoid cancer in this family. The case with Burkitt lymphoma had one additional exome sequenced family member, a sister diagnosed with nodular sclerosis at the age of 34. She carries the NM_000288.4:c.875T>A variant. The case who developed Burkitt lymphoma had variation in both the PEX7 and INTU and developed cancer at the age of 9. The shared NM_000288.4:c.875T>A variant amongst the siblings implies that the gene might contribute to the susceptibility of lymphoid cancer in this family.
A frameshift variant, NM_015338.6:c.1934dup (rs750318549) in ASXL1, was observed in two cases, one with lymphoplasmacytic lymphoma at the age of 73 and one with CLL at the age of 50. This variant is also reported in Clinvar as pathogenic/likely pathogenic. The case with lymphoplasmacytic lymphoma had a sister sequenced who developed CLL at the age of 78 but did not carry the NM_015338.6:c.1934dup variant. For the case with CLL, no additional family members were sequenced.
A splice acceptor deletion NM_001966.4:c.594_595del (rs781090244) in EHHADH was observed in two unrelated familial lymphoid cancer cases, one diagnosed with CLL at the age of 34 and one with DLBCL at the age of 34. This variant has unknown significance in Clinvar. For the case with DLBCL, a family member (mother) was exome sequenced who developed CLL at 72 and did not carry the NM_001966.4:c.594_595del variant. The other case with CLL also had a family member (sister) who was exome sequenced. She developed Follicular Lymphoma at 44 and did not carry the NM_001966.4:c.594_595del variant. Since affected family members of both cases did not have the same variant in ASXL1 or EHHADH, it is plausible that other factors contribute to familial lymphoid cancer in these families.
Pathway-based burden analysis
We obtained 1761, 186, and 292 pathways from the Reactome, KEGG, and Biocarta databases, respectively. Gene and variant counts generated in the gene-based burden analysis were collapsed into pathways. The variant allele frequency thresholds of ≤1% and ≤0.1% were applied while aggregating the variant counts and genes to pathways. P-values were corrected for genomic inflation when necessary. The inflation factors for pathway-based analysis for both the controls and allele frequency are summarised in S1 Table. For variants with an allele frequency of ≤1% with GnomAD exomes as controls, a total of 483 pathways were identified: 333 Reactome, 106 KEGG, and 44 Biocarta pathways. Twelve pathways (10 Reactome and 2 KEGG) showed significant association after corrections (Table 3). With ExAC controls, 308 Reactome, 94 KEGG, and 34 Biocarta pathways were detected, for a total of 436 pathways. Two KEGG pathways were significantly associated with lymphoid cancer after corrections (Table 3).
Table 3. Pathway-based burden test for variants with allele frequency ≤1% and ≤0.1%, with GnomAD exomes and ExAC as controls.
| Pathway database | Pathways | Allele frequency ≤1% | Allele frequency ≤0.1% | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Variant count in cases | Variant count in Controls (GnomAD) | P-value* | Variant count in cases | Variant count in Controls (ExAC) | P-value* | Variant count in cases | Variant count in Controls (GnomAD) | P-value* | Variant count in cases | Variant count in Controls (ExAC) | P-value* | ||
| KEGG | Olfactory transduction | 15 | 4772 | 0.0001 | 14 | 2564 | 0.0003 | 6 | 349 | 8.9E-05 | 6 | 283 | 0.0004 |
| KEGG | Peroxisome | 6 | 609 | 0.0026 | 6 | 316 | 0.001 | 6 | 609 | 0.0022 | 6 | 315 | 0.0007 |
| KEGG | Purine metabolism | 4 | 274 | 0.02 | 4 | 151 | 0.01 | ||||||
| Reactome | Post-translational protein phosphorylation | 3 | 116 | 0.05 | 3 | 71 | 0.04 | ||||||
| Reactome | Regulation of Insulin-like Growth Factor (IGF) transport and uptake by Insulin-like Growth Factor Binding Proteins (IGFBPs) | 3 | 116 | 0.05 | 3 | 71 | 0.04 | ||||||
| Reactome | FCERI mediated Ca+2 mobilization | 4 | 90 | 0.0004 | 4 | 90 | 0.0003 | ||||||
| Reactome | FCERI mediated MAPK activation | 4 | 90 | 0.0004 | 4 | 90 | 0.0003 | ||||||
| Reactome | Antigen activates B Cell Receptor (BCR) leading to generation of second messengers | 4 | 110 | 0.0008 | 4 | 110 | 0.0007 | ||||||
| Reactome | FCGR activation | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | FCGR3A-mediated IL10 synthesis | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | FCGR3A-mediated phagocytosis | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | Regulation of actin dynamics for phagocytic cup formation | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | Role of LAT2/NTAL/LAB on calcium mobilization | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | Role of phospholipids in phagocytosis | 3 | 77 | 0.0173 | 3 | 77 | 0.01 | ||||||
| Reactome | DAP12 signaling | 2 | 15 | 0.0483 | 2 | 15 | 0.04 | ||||||
| KEGG | Tryptophan metabolism | 4 | 266 | 0.02 | |||||||||
| KEGG | Retinol metabolism | 4 | 275 | 0.02 | |||||||||
| KEGG | Drug metabolism cytochrome p450 | 4 | 310 | 0.04 | |||||||||
To test for associations between lymphoid cancers and rare variants with allele frequency ≤0.1%, 346 pathways were analyzed with GnomAD exomes as controls. These included 231 Reactome, 90 KEGG, and 25 Biocarta pathways. Eighteen pathways (12 Reactome and 6 KEGG) showed a significant association with lymphoid cancer (Table 3). There were 284 pathways, with 197 from the Reactome database, 71 from KEGG, and 16 from Biocarta, when the controls were from ExAC. Five pathways showed significant association with lymphoid cancer, of which four were from Reactome and three were from KEGG (Table 3).
To identify which genes in the lymphoid cancer associated pathways had rare variants, we summarized the count of rare variants aggregated by genes in cases and controls (Table 4). The 18 pathways (6 from KEGG and 12 from Reactome) associated with lymphoid cancers included 34 genes that had rare variant counts. One of the pathways associated with lymphoid cancer is the antigen activated B cell receptor, which was significant with a corrected p-value for multiple testing and inflation factor of 0.0008 and 0.0007 for allele frequency ≤0.1% and ≤1% respectively, with gnomAD controls. This pathway was then collapsed into high-level Reactome pathway hierarchies for the adaptive immune system. The remaining 11 pathways from Reactome were grouped into three high-level pathways. One of these high-level pathways is for the innate immune systems comprising seven low-level hierarchy pathways. Five genes with rare variants, SYK, IGLV3-25, IGLV7-43, BLK, and GRAP2, are included in these adaptive and innate immune system pathways. The five variants, one in each gene, are present in 5 different lymphoid cancer cases. The pathways of olfactory transduction and peroxisome from KEGG were also associated with the lymphoid cancer across both the allele frequencies and control sets. Data underlying these findings are summarized in S4 Table.
Table 4. Genes and variant counts for pathways significantly associated with familial lymphoid cancer cases.
| Genes | GnomAD controls | ExAC controls | Associated Pathways | Higher level hierarchy | Database | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| AF ≤1% | AF ≤0.1% | AF ≤1% | AF ≤0.1% | ||||||||
| Cases | Controls | Cases | Controls | Cases | Controls | Cases | Controls | ||||
| BLK | 1 | 33 | 1 | 33 | Antigen activates B Cell Receptor (BCR) leading to generation of second messengers | Adaptive Immune System | Reactome | ||||
| IGLV3-25 | 1 | 27 | 1 | 27 | |||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| GRAP2 | 1 | 13 | 1 | 13 | DAP12 signaling | Innate Immune System | |||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| GRAP2 | 1 | 13 | 1 | 13 | FCERI mediated Ca+2 mobilization | ||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | |||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| GRAP2 | 1 | 13 | 1 | 13 | FCERI mediated MAPK activation | ||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | |||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | FCGR activation | ||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | Regulation of actin dynamics for phagocytic cup formation | ||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | Role of LAT2/NTAL/LAB on calcium mobilization | ||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | Role of phospholipids in phagocytosis | ||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | FCGR3A-mediated IL10 synthesis | Leishmania infection | |||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| IGLV3-25 | 1 | 27 | 1 | 27 | FCGR3A-mediated phagocytosis | ||||||
| IGLV7-43 | 1 | 48 | 1 | 48 | |||||||
| SYK | 1 | 2 | 1 | 2 | |||||||
| AMBN | 1 | 53 | 1 | 24 | Post-translational protein phosphorylation | Metabolism of proteins | |||||
| CP | 1 | 34 | 1 | 22 | |||||||
| PENK | 1 | 29 | 1 | 25 | |||||||
| AMBN | 1 | 53 | 1 | 24 | Regulation of Insulin-like Growth Factor (IGF) transport and uptake by Insulin-like Growth Factor Binding Proteins (IGFBPs) | ||||||
| CP | 1 | 34 | 1 | 22 | |||||||
| PENK | 1 | 29 | 1 | 25 | |||||||
| AOX1 | 1 | 85 | Drug metabolism cytochrome p450 | KEGG | |||||||
| CYP2B6 | 1 | 45 | |||||||||
| FMO3 | 1 | 113 | |||||||||
| GSTK1 | 1 | 67 | |||||||||
| OR1N2 | 1 | 41 | 1 | 41 | 1 | 58 | 1 | 58 | Olfactory transduction | ||
| OR2AG2 | 1 | 61 | 1 | 61 | 1 | 39 | 1 | 39 | |||
| OR52B2 | 1 | 54 | 1 | 54 | 1 | 29 | 1 | 29 | |||
| OR52M1 | 1 | 488 | 1 | 119 | 1 | 287 | 1 | 54 | |||
| OR6V1 | 2 | 245 | 1 | 22 | 1 | 52 | 1 | 52 | |||
| OR6C6 | 1 | 52 | -- | -- | 1 | 51 | 1 | 51 | |||
| OR6C4 | 1 | 138 | 1 | 52 | 1 | 32 | -- | -- | |||
| GUCA1C | 2 | 937 | -- | -- | 2 | 425 | -- | -- | |||
| OR10C1 | 2 | 664 | -- | -- | 2 | 388 | -- | -- | |||
| OR51E1 | 1 | 164 | -- | -- | 1 | 109 | -- | -- | |||
| OR51G1 | 1 | 1458 | -- | -- | 1 | 835 | -- | -- | |||
| OR6C70 | 1 | 470 | -- | -- | 1 | 259 | -- | -- | |||
| CROT | 1 | 317 | 1 | 317 | 1 | 160 | 1 | 159 | Peroxisome | ||
| GSTK1 | 1 | 67 | 1 | 67 | 1 | 51 | 1 | 51 | |||
| EHHADH | 2 | 119 | 1 | 119 | 2 | 58 | 2 | 58 | |||
| PEX7 | 2 | 106 | 1 | 106 | 2 | 47 | 2 | 47 | |||
| AMPD1 | 1 | 52 | 1 | 25 | Purine metabolism | ||||||
| ATIC | 1 | 73 | 1 | 47 | |||||||
| NUDT2 | 1 | 34 | 1 | 14 | |||||||
| PDE11A | 1 | 115 | 1 | 65 | |||||||
| CYP1A1 | 1 | 62 | Retinol metabolism | ||||||||
| CYP2B6 | 1 | 45 | |||||||||
| DGAT2 | 1 | 116 | |||||||||
| RPE65 | 1 | 52 | |||||||||
| AOX1 | 1 | 85 | Tryptophan metabolism | ||||||||
| CYP1A1 | 1 | 62 | |||||||||
| EHHADH | 2 | 119 | |||||||||
Discussion
We performed gene- and pathway-based RVAS using lymphoid cancer cases with a strong family history and early cancer onset. Choosing cases with a strong family history and early onset is expected to enrich for cases with a genetic component to their disease. These families often have members with different lymphoid cancer types and subtypes. We suspect that some genes could predispose to more than one type of lymphoid cancer in such families. Previous rare variant association studies have analysed only non-Hodgkin lymphoid cancers collectively [32]. The genes EHHADH, INTU, PEX7, and ASXL1 were found to have a significantly higher burden of deleterious variants in cases than controls; variants in these genes may contribute to risk of lymphoid cancers in specific individuals. However, the variants of interest segregated with lymphoid cancer in other affected members of the same family, in only two families, a variant in INTU in family 100 and a variant in PEX7 in Family 129.
PEX7 encodes the peroxin 7 protein required for the peroxisomal importation of proteins. To our knowledge, the gene has not been previously implicated in lymphoid neoplasms, but peroxisomal proteins may have a role in the regulation of innate and adaptive immune cells by controlling the synthesis and breakdown of immune bioactive metabolites such as reactive oxygen species and reactive nitrogen species [56, 57]. Drosophila Schneider 2 (S2) macrophage-like cells with PEX7 and PEX5 knocked down by RNA interference fail to perform phagocytosis, suggesting that compromised peroxisome function within macrophages may possibly reduce the immune system’s ability to eliminate cancer cells [58].
INTU is involved in Hedgehog signalling, a pathway activated in various cancers, including basal cell carcinoma, medulloblastoma and hematological malignancies [59, 60].
Variants in EHHADH and AXSL1 did not segregate with lymphoid cancer in the families studied, which implies that the variants do not play a role in all affected members of these families, it does not rule out their making a contribution to cancer predisposition in the original case. EHHADH encodes a peroxisome enzyme that oxidises medium-chain and large-chain fatty dicarboxylic acids and has delta 3, delta 2-enoyl-CoA isomerase activity [61]. It has been suggested that EHHADH affects T-cell function in tumor infiltration, affecting immune surveillance in cancer [62]. The EHHADH variant NM_001966.4:c.594_595del is reported in Clinvar as a variant of unknown significance. ASXL1 plays a role in chromatin remodelling and is a tumor suppressor gene [63]. Several ASXL1 somatic mutations have been described in CLL [64, 65]. The NM_015338.6:c.1934dup (rs750318549) variant observed in ASXL1 is a known mutation found in >50% of myeloid malignancies [66]. Germline variants in ASXL1 must be interpreted with caution, particularly in older individuals, as variants in this gene observed in blood DNA could be of somatic in nature [67, 68]. A candidate gene-based study by Hamadou et al. (2016) identified a c.1205 G>A germline missense substitution in ASXL1 in two sisters, one with large B-cell lymphoma and the other with mucosa-associated lymphoid tissue lymphoma [69]. The c.1205 G>A missense variant is different than the one identified here, but the affected family members in Hamadou et al. (2016) and our study have different lymphoid cancer subtypes; this may imply a role for different ASXL1 variants in cases with different lymphoid cancer subtypes.
Pathway-based burden analysis uses pathways from KEGG, Reactome, and Biocarta. These databases are generated mainly as an independent effort by experts, with each database having specific criteria for annotation and speed of curation; there are redundant pathways amongst the databases which would make the multiple testing correction overly conservative. The Reactome pathways of innate and adaptive immune systems showed association with lymphoid cancers. Previous studies have demonstrated the involvement of variants in immune system related genes in different subtypes of lymphoid cancer [70–72]. These studies, combined with our results, support that genetic variation in immune-related genes plays a role in the development of lymphoid neoplasms. SYK, IGLV3-25, and IGLV7-43 are involved in the innate and adaptive immune pathways defined in Reactome. SYK is a spleen tyrosine kinase activated by binding to the phosphorylated Immunoreceptor tyrosine- based activation motif, triggering a downstream signalling cascade regulated by its tyrosine phosphorylation [73]. This gene plays a role in cell adhesion, innate pathogen recognition, platelet activation, and vascular development [74]. SKY amplification has been identified as a genetic alteration in primary BCR-type DLBCL tumours [75]. IGLV3-25 and IGLV7-43 are immunoglobin light chain variable regions involved in antigen recognition. Cells that express a point mutation that changes amino acid position 110 from glycine to arginine in IGLV3-21 trigger autonomous B-cell receptor (BCR) signalling by BCR-BCR interaction and confer aggressive behaviour in CLL [76]. The pattern of somatic mutation in sporadic CLL for IGLV3-21*04 and R110 is similar to that of familial cases of CLL [77]. BLK, a B-lymphoid tyrosine kinase involved in the adaptive pathway, is a proto-oncogene and a member of the SRC family kinases involved in B-cell development [78]. This gene has been identified in a family of two members affected with Hodgkin lymphoma [79]. The case with the variant in the proto-oncogene BLK also had a variation in AXSL1, a tumour suppressor gene. GRAP2 is a tumour suppressor gene involved in T cell signalling, development and regulate NF-AT activation [80]. GRAP2 has been implicated in extranodal NK/T cell lymphoma, a rare and aggressive subtype of T cell or NK lineage extranodal lymphoma [81]. GARP2 is a specific marker of T lineage differentiation; it was interesting to observe a variant in this gene in a case affected with CLL, suggesting a role of GRAP2 in B cell lymphoma as well. These genes would not have been detected if pathway-based burden analysis had not been performed.
The KEGG pathway that was associated, for both ≤1% and ≤0.1% allele frequencies and both control groups, was olfactory transduction; this pathway had the highest number of genes, 12, with rare variants. Although there is no previously known involvement of olfactory genes in lymphoid cancers, these genes have been studied in other neoplasms such as prostate cancer; for instance the activation of OR51E2 facilitates the cellular transformation of the disease into a more aggressive form [82]. Olfactory gene OR2B6 is expressed in over 80% of breast carcinoma tissues and its expression has been suggested as a potential biomarker in breast carcinoma [83]. It is also not implausible that this pathway could influence the sense of smell of individuals and contribute to avoidance of exposure to some carcinogens, thus affecting cancer risk. Olfactory receptor genes are a large gene family with over 900 genes and pseudogenes [84], which can complicate the mapping of sequenced reads and lead to erroneous genotypes and potentially spurious association. Though all variants of interest in this study were individually verified by examining them in IGV, this may not be sufficient to eliminate this type of error.
This study has three strengths: first, it uses cases with early onset and a strong family history of lymphoid cancers; it thus enriches for cases that are likely to have a stronger genetic component to their disease. Second, it enables investigation of variants of interest in other affected family members. Third, we assess rare variant associations at both the gene- and pathway-level, improving the possibility of making meaningful inferences regarding the etiology of lymphoid cancers, which might not have been possible by analysing individual genes.
This work also has several limitations. The use of public control datasets can give rise to issues due to differences in population structure and sequencing methods, including differences in depth of sequencing and in data processing. Some of these potential issues were limited; however, by restricting analyses to variants with ≥10x coverage in both controls and cases, as suggested by Guo et al. [34].
A second limitation is calculation of the inflation factor lambda. The inflation factor lambda reflects the inflation in the test statistic due to ancestry; not correcting for this factor can lead to false associations. TRAPD calculates the observed and expected -log10(p values) of lambda at the 95th percentile with the highest expected -log10 (p value) for genes with p-value 1.0; however, in our study we only included genes with rare variants present in the cases, making it unfeasible to use the TRAPD inflation factor calculation. Instead, the inflation factor lambda0.5–0.95 was calculated as proposed by Chen et al., 2021 for burden analysis using public datasets as controls; this method excludes inflated value at the extremes of the plot [53]. The inflation factor in some instances was >1, indicating potential confounding by genetic background; this was offset by correcting for the inflation factor by multiplying it with the p-value. Yang et al. (2011) showed substantial inflation in the context of polygenic inheritance, even when data was controlled for population structure and technical artifacts [85]. The inflation >1 observed in lymphoid neoplasms might also be due to polygenic inheritance.
The success of rare variant association analyses depends on various factors, two of which are statistical power and a suitable method for the association study. RVAS suffers from low statistical power due to the rarity of the variants being tested. Attempts to improve statistical power in RVAS are made by increasing the sample sizes or collapsing rare variants into gene annotation categories and biological functions. In this RVAS of lymphoid cancers, we considered genes with two or more variants, as done by others [86, 87], and also carried out pathway-based analyses. These analyses reduce the number of tests, thereby preserving power. For disease-causing genes in lymphoid cancers, we expect that a majority of rare causal variants will increase disease risk. Given that burden analysis assumes that all the variants influence the phenotype in the same direction [29], the selection of this test seems appropriate for our analyses.
Conclusion
RVAS was performed using lymphoid cancer cases with a strong family history and early disease onset. Cases with early–onset and family history are more likely to have a genetic basis than later onset cases. Publicly available data sets with large sample sizes were used as controls to maximize statistical power of burden analysis. Nevertheless, the case sample size was small and therefore statistical power was limited; large studies are necessary to substantiate these findings. We identified putative pathogenic germline variants in four genes, and biological pathways related to immune systems, as possible contributors to lymphoid cancer risk. Stop-gain variant NM_015693.4:c.581G>A in INTU in the Hedgehog signalling pathway and stop-gain variant NM_000288.4:c.875T>A in PEX7 in peroxisomal pathway each segregated with lymphoid cancers in families, suggesting that these variants in these genes may be related to risk of lymphoid cancer.
Supporting information
Thirty-eight cases with European ethnicity were selected for the rare variant association study while one case of non-European ancestry was removed.
(TIFF)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
Acknowledgments
We thank the families and their family members for participating in the Lymphoid Cancer Families study. We thank the Canada’s Michael Smith Genome Sciences Centre (GSC) at BC cancer Sequencing and Bioinformatics platforms for the expert generation of high-quality data. We would also like to thank Martin Krzywinski, Genome Sciences Centre, for his help with the workflow figure design.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
The Canadian Institutes for Health Research (MOP-130311) and the BC Cancer Foundation Research Sustainment Fund, provided to Dr. Angela Brooks-Wilson, funded this work. PhD student Sneha Ralli was also supported by Graduate Fellowships from Simon Fraser University, and through an NSERC-CREATE grant for Bioinformatics training held jointly by Simon Fraser University and the University of British Columbia.
References
- 1.De Leval L, Jaffe ES. Lymphoma classification. Cancer J (United States). 2020;26(3):176–85. doi: 10.1097/PPO.0000000000000451 [DOI] [PubMed] [Google Scholar]
- 2.Turner JJ, Morton LM, Linet MS, Clarke CA, Kadin ME, Vajdic CM, et al. InterLymph hierarchical classification of lymphoid neoplasms for epidemiologic research based on the WHO classification (2008): Update and future directions. Blood. 2010;116(20):90–8. doi: 10.1182/blood-2010-06-289561 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ferlay J, Ervik M LFCM, Mery L, Piñeros M, Znaor A, Soerjomataram I, et al. Global cancer observatory: cancer today. [Internet]. Lyon, France: international agency for research on cancer. 2018. [cited 2021 Dec 22]. p. 1–6. Available from: https://gco.iarc.fr/today [Google Scholar]
- 4.Weniger MA, Küppers R. Molecular biology of Hodgkin lymphoma. Leukemia [Internet]. 2021;35(4):968–81. Available from: doi: 10.1038/s41375-021-01204-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pratap S, Scordino TS. Molecular and cellular genetics of non-Hodgkin lymphoma: Diagnostic and prognostic implications. Exp Mol Pathol [Internet]. 2019;106(November 2018):44–51. Available from: doi: 10.1016/j.yexmp.2018.11.008 [DOI] [PubMed] [Google Scholar]
- 6.Skibola CF, Curry JD, Nieters A. Genetic susceptibility to lymphoma. Haematologica. 2007;92(7):960–9. doi: 10.3324/haematol.11011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Szmyd B, Mlynarski W, Pastorczak A. Genetic predisposition to lymphomas: Overview of rare syndromes and inherited familial variants. Mutat Res—Rev Mutat Res [Internet]. 2021;788(November 2020):108386. Available from: doi: 10.1016/j.mrrev.2021.108386 [DOI] [PubMed] [Google Scholar]
- 8.Royer RH, Koshiol J, Giambarresi TR, Vasquez LG, Pfeiffer RM, McMaster ML. Differential characteristics of Waldenström macroglobulinemia according to patterns of familial aggregation. Blood. 2010;115(22):4464–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jones SJ, Stroshein S, Williams AM, Liu D, Spinelli JJ, Connors JM, et al. Birth order, sibship size, childhood environment and immune-related disorders, and risk of lymphoma in lymphoid cancer families. Cancer Epidemiol Biomarkers Prev. 2020;29(6):1168–78. doi: 10.1158/1055-9965.EPI-19-1204 [DOI] [PubMed] [Google Scholar]
- 10.Cerhan JR, Slager SL. Familial predisposition and genetic risk factors for lymphoma. Blood. 2015;126(20):2265–73. doi: 10.1182/blood-2015-04-537498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rotunno M, McMaster ML, Boland J, Bass S, Zhang X, Burdett L, et al. Whole exome sequencing in families at high risk for hodgkin lymphoma: Identification of a predisposing mutation in the KDR gene. Haematologica. 2016;101(7):853–60. doi: 10.3324/haematol.2015.135475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Salipante SJ, Mealiffe ME, Wechsler J, Krem MM, Liu Y, Namkoong S, et al. Mutations in a gene encoding a midbody kelch protein in familial and sporadic classical Hodgkin lymphoma lead to binucleated cells. Proc Natl Acad Sci USA. 2009;106(35):14920–5. doi: 10.1073/pnas.0904231106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.McMaster ML, Sun C, Landi MT, Savage SA, Rotunno M, Yang XR, et al. Germline mutations in Protection of Telomeres 1 in two families with Hodgkin lymphoma. Br J Haematol. 2018;181(3):372–7. doi: 10.1111/bjh.15203 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ristolainen H, Kilpivaara O, Kamper P, Taskinen M, Saarinen S, Leppä S, et al. Identification of homozygous deletion in ACAN and other candidate variants in familial classical Hodgkin lymphoma by exome sequencing. Br J Haematol. 2015;170(3):428–31. doi: 10.1111/bjh.13295 [DOI] [PubMed] [Google Scholar]
- 15.Saarinen S, Aavikko M, Aittomäki K, Launonen V, Lehtonen R, Franssila K, et al. Exome sequencing reveals germline NPAT mutation as a candidate risk factor for Hodgkin lymphoma. Blood. 2011;118(3):493–8. doi: 10.1182/blood-2011-03-341560 [DOI] [PubMed] [Google Scholar]
- 16.Sellick GS, Goldin LR, Wild RW, Slager SL, Ressenti L, Strom SS, et al. A high-density SNP genome-wide linkage search of 206 families identifies susceptibility loci for chronic lymphocytic leukemia. Blood. 2007;110(9):3326–33. doi: 10.1182/blood-2007-05-091561 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Law PJ, Berndt SI, Speedy HE, Camp NJ, Sava GP, Skibola CF, et al. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia. Nat Commun. 2017;8(14175). doi: 10.1038/ncomms14175 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Rendleman J, Antipin Y, Reva B, Adaniel C, Przybylo JA, Dutra-Clarke A, et al. Genetic variation in DNA repair pathways and risk of non-Hodgkin’s lymphoma. PLoS One. 2014;9(7):1–14. doi: 10.1371/journal.pone.0101685 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Cozen W, Li D, Best T, Van Den Berg DJ, Gourraud PA, Cortessis VK, et al. A genome-wide meta-analysis of nodular sclerosing Hodgkin lymphoma identifies risk loci at 6p21.32. Blood. 2012;119(2):469–75. doi: 10.1182/blood-2011-03-343921 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Buniello A, Macarthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47(D1):D1005–12. doi: 10.1093/nar/gky1120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Vijayakrishnan J, Qian M, Studd JB, Yang W, Kinnersley B, Law PJ, et al. Identification of four novel associations for B-cell acute lymphoblastic leukaemia risk. Nat Commun [Internet]. 2019;10(5384). Available from: 10.1038/s41467-019-13069-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Shugart YY, Hemminki K, Vaittinen P, Kingman A, Dong C. A genetic study of Hodgkin’s lymphoma: An estimate of heritability and anticipation based on the familial cancer database in Sweden. Hum Genet. 2000;106(5):553–6. doi: 10.1007/s004390000291 [DOI] [PubMed] [Google Scholar]
- 23.Berndt SI, Skibola CF, Joseph V, Camp NJ, Nieters A, Wang Z, et al. Genome-wide association study identifies multiple risk loci for chronic lymphocytic leukemia. Nat Genet. 2013;45(8):868–76. doi: 10.1038/ng.2652 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Cerhan JR, Berndt SI, Vijai J, Ghesquières H, McKay J, Wang SS, et al. Genome-wide association study identifies multiple susceptibility loci for diffuse large B cell lymphoma. Nat Genet. 2014;46(11):1233–8. doi: 10.1038/ng.3105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JPA, et al. Genome-wide association studies for complex traits: Consensus, uncertainty and challenges. Nat Rev Genet. 2008;9(5):356–69. doi: 10.1038/nrg2344 [DOI] [PubMed] [Google Scholar]
- 26.Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature [Internet]. 2009;461(7265):747–53. Available from: doi: 10.1038/nature08494 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci USA. 2012;109(4):1193–8. doi: 10.1073/pnas.1119675109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Povysil G, Petrovski S, Hostyk J, Aggarwal V, Allen AS, Goldstein DB. Rare-variant collapsing analyses for complex traits: guidelines and applications. Nat Rev Genet [Internet]. 2019;20(12):747–59. Available from: doi: 10.1038/s41576-019-0177-4 [DOI] [PubMed] [Google Scholar]
- 29.Lee S, Abecasis GR, Boehnke M, Lin X. Rare-variant association analysis: Study designs and statistical tests. Am J Hum Genet. 2014;95(1):5–23. doi: 10.1016/j.ajhg.2014.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zuk O, Schaffner SF, Samocha K, Do R, Hechter E, Kathiresan S, et al. Searching for missing heritability: Designing rare variant association studies. Proc Natl Acad Sci U S A. 2014;111(4). doi: 10.1073/pnas.1322563111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Auer PL, Lettre G. Rare variant association studies: Considerations, challenges and opportunities. Genome Med. 2015;7(1):1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Orgueira AM, López MC, Raíndo AP, Arias JÁD, Rodríguez BA, Pérez LB, et al. Detection of rare germline variants in the genomes of patients with b-cell neoplasms. Cancers (Basel). 2021;13(6):1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Jiang D, Wang M. Recent developments in statistical methods for GWAS and high-throughput sequencing association studies of complex traits high-throughput sequencing association studies of complex. Biostat Epidemiol [Internet]. 2018;9360(2:1):132–59. Available from: 10.1080/24709360.2018.1529346 [DOI] [Google Scholar]
- 34.Guo MH, Plummer L, Chan YM, Hirschhorn JN, Lippincott MF. Burden Testing of Rare Variants Identified through Exome Sequencing via Publicly Available Control Data. Am J Hum Genet. 2018;103(4):522–34. doi: 10.1016/j.ajhg.2018.08.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. bioRxiv [Internet]. 2019;531210. Available from: https://www.biorxiv.org/content/10.1101/531210v3 [Google Scholar]
- 36.Lek M, Karczewski KJ, Minikel E V., Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616). doi: 10.1038/nature19057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Jones SJ, Voong J, Thomas R, English A, Schuetz J, Slack GW, et al. Nonrandom occurrence of lymphoid cancer types in 140 families. Leuk Lymphoma. 2017;58(9):2134–43. doi: 10.1080/10428194.2017.1281412 [DOI] [PubMed] [Google Scholar]
- 38.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. doi: 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31(12):2032–2034. doi: 10.1093/bioinformatics/btv098 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Poplin R, Ruano-Rubio V, DePristo MA, Fennell TJ, Carneiro MO, Van der Auwera GA, et al. Scaling accurate genetic variant discovery to tens of thousands of samples. bioRxiv. 2017;201178:1–22. [Google Scholar]
- 41.Germline short variant discovery (SNPs + Indels)–GATK [Internet]. [cited 2021 Dec 23]. Available from: https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels-
- 42.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):1–4. doi: 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience. 2015;4(1):1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chang Christopher, Purcell Shaun. PLINK 1.90 [Internet]. [cited 2021 Dec 22]. Available from: https://www.cog-genomics.org/plink/1.9/ [Google Scholar]
- 45.Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen W min. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26(22):2867–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581(7809):434–43. doi: 10.1038/s41586-020-2308-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Cingolani P, Patel VM, Coon M, Nguyen T, Land SJ, Ruden DM. Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Front Genet. 2012;3(March):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, et al. The Ensembl Variant Effect Predictor. Genome Biol [Internet]. 2016;17(122):1–14. Available from: doi: 10.1186/s13059-016-0974-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: Predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47(D1):D886–94. doi: 10.1093/nar/gky1016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Croft D, Mundo AF, Haw R, Milacic M, Weiser J, Wu G, et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 2014;42:D472–D477. doi: 10.1093/nar/gkt1102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000;28(1):27–30. doi: 10.1093/nar/28.1.27 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–40. doi: 10.1093/bioinformatics/btr260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chen W, Wang S, Tithi SS, Ellison D, Wu G. CoCoRV: a rare variant analysis framework using publicly available genotype summary counts to prioritize germline disease-predisposition genes. bioRxiv [Internet]. 2021; Available from: 10.1101/2021.09.29.462472 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): High-performance genomics data visualization and exploration. Brief Bioinform. 2013;14(2):178–92. doi: 10.1093/bib/bbs017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Moore CM, Jacobson SA, Fingerlin TE. Power and Sample Size Calculations for Genetic Association Studies in the Presence of Genetic Model Misspecification. Hum Hered. 2019;84(6):256–71. doi: 10.1159/000508558 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Di Cara F, Andreoletti P, Trompier D, Vejux A, Bülow MH, Sellin J, et al. Peroxisomes in immune response and inflammation. Int J Mol Sci. 2019;20(16):1–29. doi: 10.3390/ijms20163877 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Vijayan V, Srinu T, Karnati S, Garikapati V, Linke M, Kamalyan L, et al. A new immunomodulatory role for peroxisomes in macrophages activated by the TLR4 ligand lipopolysaccharide. J Immunol. 2017;198(6):2414–25. doi: 10.4049/jimmunol.1601596 [DOI] [PubMed] [Google Scholar]
- 58.Di Cara F, Sheshachalam A, Braverman NE, Rachubinski RA, Simmonds AJ. Peroxisome-mediated metabolism is required for immune Response to microbial infection. Immunity [Internet]. 2017;47(1):93–106.e7. Available from: doi: 10.1016/j.immuni.2017.06.016 [DOI] [PubMed] [Google Scholar]
- 59.Yang N, Leung E, Lui C, Eguether T, Jun Yao X, Jones E, et al. INTU is essential for oncogenic Hh signaling through regulating primary cilia formation in basal cell carcinoma. Oncogene. 2017;36(35):4997–5005. doi: 10.1038/onc.2017.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Jiang J, Hui C chung. Hedgehog signaling in development and cancer. Dev Cell [Internet]. 2008;15(6):801–12. Available from: 10.1016/j.devcel.2008.11.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Houten SM, Denis S, Argmann CA, Jia Y, Ferdinandusse S, Reddy JK, et al. Peroxisomal L-bifunctional enzyme (Ehhadh) is essential for the production of medium-chain dicarboxylic acids. J Lipid Res. 2012;53(7):1296–303. doi: 10.1194/jlr.M024463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Rogers LM, Wang Z, Mott SL, Dupuy AJ, Weiner GJ. A genetic screen to identify gain- And loss-of-function modifications that enhance T-cell infiltration into tumors. Cancer Immunol Res. 2020;8(9):1206–14. doi: 10.1158/2326-6066.CIR-20-0056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Abdel-Wahab O, Adli M, Saunders L, Gao J, Shih AH, Pandey S, et al. ASXL1 Mutations Promote Myeloid Transformation Through Inhibition of PRC2-Mediated Gene Repression. Cancer Cell. 2012;22(2):180–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Katoh M. Functional and cancer genomics of ASXL family members. Br J Cancer [Internet]. 2013;109(2):299–306. Available from: doi: 10.1038/bjc.2013.281 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Quesada V, Conde L, Villamor N, Ordóñez GR, Jares P, Bassaganyas L, et al. Exome sequencing identifies recurrent mutations of the splicing factor SF3B1 gene in chronic lymphocytic leukemia. Nat Genet. 2012;44(1):47–52. [DOI] [PubMed] [Google Scholar]
- 66.Abdel-Wahab O, Kilpivaara O, Patel J, Busque L, Levine RL. The most commonly reported variant in ASXL1 (c.1934dupG;p.Gly646TrpfsX12) is not a somatic alteration. Leukemia. 2010;24(9):1656–64. doi: 10.1038/leu.2010.144 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Roberts NJ, Norris AL, Petersen GM, Bondy ML, Brand R, Gallinger S, et al. Whole genome sequencing defines the genetic heterogeneity of familial pancreatic cancer. Cancer Discov. 2016;6(2):166–75. doi: 10.1158/2159-8290.CD-15-0402 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Xie M, Lu C, Wang J, McLellan MD, Johnson KJ, Wendl MC, et al. Age-related cancer mutations associated with clonal hematopoietic expansion. Nat Med. 2014;20(12):1472–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Hamadou WS, Abed RE, Besbes S, Bourdon V, Fabre A, Youssef YB, et al. Familial hematological malignancies: ASXL1 gene investigation. Clin Transl Oncol. 2016;18(4):385–90. doi: 10.1007/s12094-015-1379-7 [DOI] [PubMed] [Google Scholar]
- 70.Purdue MP, Lan Q, Menashe I, Zheng T, Zhang Y, Yeager M, et al. Variation in innate immunity genes and risk of multiple myeloma. Hematol Oncol. 2011;29(1):42–6. doi: 10.1002/hon.954 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Hosgood HD, Purdue MP, Wang SS, Zheng T, Morton LM, Lan Q, et al. A pooled analysis of three studies evaluating genetic variation in innate immunity genes and non-Hodgkin lymphoma risk. Br J Haematol. 2011;152(6):721–6. doi: 10.1111/j.1365-2141.2010.08518.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Cerhan JR, Ansell SM, Fredericksen ZS, Kay NE, Liebow M, Call TG, et al. Genetic variation in 1253 immune and inflammation genes and risk of non-Hodgkin lymphoma. Blood. 2007;110(13):4455–63. doi: 10.1182/blood-2007-05-088682 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Geahlen RL. Syk and pTyr’d: Signaling through the B cell antigen receptor. Biochim Biophys Acta. 2009;1793(7):1115–27. doi: 10.1016/j.bbamcr.2009.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Mócsai A, Ruland J, Tybulewicz VLJ. The SYK tyrosine kinase: A crucial player in diverse biological functions. Nat Rev Immunol. 2010;10(6):387–402. doi: 10.1038/nri2765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Chen L, Monti S, Juszczynski P, Ouyang J, Chapuy B, Neuberg D, et al. SYK Inhibition Modulates Distinct PI3K/AKT- Dependent Survival Pathways and Cholesterol Biosynthesis in Diffuse Large B Cell Lymphomas. Cancer Cell [Internet]. 2013;23(6):826–38. Available from: doi: 10.1016/j.ccr.2013.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Nadeu F, Royo R, Clot G, Duran-Ferrer M, Navarro A, Martín S, et al. IGLV3-21R110 identifies an aggressive biological subtype of chronic lymphocytic leukemia with intermediate epigenetics. Blood. 2021;137(21):2935–46. doi: 10.1182/blood.2020008311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Armand M, Verrier A, Theves F, Bravetti C, Garff-Tavernier M Le, Choquet S, et al. Prevalence of IGLV3-21R110 among familial CLL: a retrospective study of 45 cases. Blood Adv. 2022;6(12):3632–3635. doi: 10.1182/bloodadvances.2021006658 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Petersen DL, Berthelsen J, Willerslev-Olsen A, Fredholm S, Dabelsteen S, Bonefeld CM, et al. A novel BLK-induced tumor model. Tumor Biol. 2017;39(7):1–8. doi: 10.1177/1010428317714196 [DOI] [PubMed] [Google Scholar]
- 79.Flerlage JE, Myers JR, Maciaszek JL, Oak N, Rashkin SR, Hui Y, et al. Discovery of novel predisposing coding and noncoding variants in familial Hodgkin lymphoma. Blood. 2022; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Liu SKW, Berry DM, McGlade CJ. The role of Gads in hematopoietic cell signalling. Oncogene. 2001;20(44 REV. ISS. 5):6284–90. doi: 10.1038/sj.onc.1204771 [DOI] [PubMed] [Google Scholar]
- 81.Miyata-Takata T, Chuang SS, Takata K, Toji T, Maeda Y, Sato Y, et al. Expression of T-cell receptor signalling pathway components in extranodal NK/T-cell lymphoma. Histopathology. 2018;73(6):1030–8. doi: 10.1111/his.13728 [DOI] [PubMed] [Google Scholar]
- 82.Abaffy T, Bain JR, Muehlbauer MJ, Spasojevic I, Lodha S, Bruguera E, et al. A testosterone metabolite 19-hydroxyandrostenedione induces neuroendocrine trans-differentiation of prostate cancer cells via an ectopic olfactory receptor. Front Oncol. 2018;8(162):1–15. doi: 10.3389/fonc.2018.00162 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Weber L, Maßberg D, Becker C, Altmüller J, Ubrig B, Bonatz G, et al. Olfactory receptors as biomarkers in human breast carcinoma tissues. Front Oncol. 2018;8(33):1–12. doi: 10.3389/fonc.2018.00033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Glusman G, Yanai I, Rubin I, Lancet D. The complete human olfactory subgenome. Genome Res. 2001;11(5):685–702. doi: 10.1101/gr.171001 [DOI] [PubMed] [Google Scholar]
- 85.Yang J, Weedon MN, Purcell S, Lettre G, Estrada K, Willer CJ, et al. Genomic inflation factors under polygenic inheritance. Eur J Hum Genet. 2011;19(7):807–12. doi: 10.1038/ejhg.2011.39 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Zheng Q, Zhang Y, Jiang J, Jia J, Fan F, Gong Y, et al. Exome-Wide Association Study Reveals Several Susceptibility Genes and Pathways Associated With Acute Coronary Syndromes in Han Chinese. Front Genet. 2020;11(April):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Liu Y, Lusk CM, Cho MH, Silverman EK, Qiao D, Zhang R, et al. Rare Variants in Known Susceptibility Loci and Their Contribution to Risk of Lung Cancer. J Thorac Oncol. 2018;13(10):1483–95. doi: 10.1016/j.jtho.2018.06.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Thirty-eight cases with European ethnicity were selected for the rare variant association study while one case of non-European ancestry was removed.
(TIFF)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.

