Abstract
Objectives
At present, most genetic tests or carrier screening are performed with blood samples, and the known carrier rate of disease-causing variants is also derived from blood. For semen donors, what is really passed on to offspring is the pathogenic variant in their sperm. This study aimed to determine whether pathogenic variants identified in the sperm of young semen donors are also present in their blood, and whether matching results for blood are consistent with results for sperm.
Methods
We included 40 paired sperm and blood samples from 40 qualified semen donors at the Hunan Province Human Sperm Bank of China. All samples underwent exome sequencing (ES) analysis, and the pathogenicity was assessed according to the American College of Medical Genetics (ACMG) guidelines. Scoring for sperm donation matching, which was based on gene scoring and variant scoring, was also used to assess the consistency of sperm and blood genetic test results.
Results
A total of 108 pathogenic (P)/likely pathogenic (LP) variants in 82 genes were identified. The highest carrier had 7 variants, and there was also one donor did not carry any P/LP variant. On average, each donor carried 2.7 P/LP variants. Among all the P/LP variants, missense mutation was the dominant type and most of them were located in exonic regions. Chromosome 1 harboured the largest number of variants and no pathogenic copy number variants (CNV) was identified in semen donors. The P/LP variant of all the 40 semen donors was consistent by comparing sperm and blood. Except for one case that was slightly different, the rest simulated matching results for blood were all consistent with results for sperm.
Conclusions
It is reasonable to choose either blood or sperm for genetic screening in semen donors.
Keywords: Sperm donor, genetic analysis, whole-exome sequencing, sperm, blood
Introduction
The establishment of China’s first human sperm bank by Lu Hui-Lin and Lu Guang-Xiu in 1981 has led to the growth of 29 sperm banks across the country today [1]. Existing regulations mandate rigorous laboratory testing, including karyotype analyses, to exclude genetic disorders in potential donors [2,3]. However, these protocols are insufficient to meet the increasing need for comprehensive genetic screening. Karyotype analyses have limitations in detecting a wide range of hereditary risks, including both autosomal recessive and dominant conditions with complex inheritance patterns [4]. This situation highlights the necessity for improved genetic testing and evaluation protocols for prospective semen donors.
To date, genetic screening for semen donors has focused on specific disorders such as spinal muscular dystrophy, cystic fibrosis, thalassemia, Tay-Sachs disease, and fragile X syndrome, or has been limited to small expanded carrier screening (ECS) panels that cover only 46 disorders [5–8]. Considering the significant carrier rates for these conditions, there is a compelling argument for implementing broader genetic testing panels to enhance donor screening protocols.
Exome sequencing (ES) is a powerful and cost-effective tool for dissecting the genetic basis of diseases and traits that have proved to be intractable to conventional gene-discovery strategies [9]. ES is more effective than ECS in terms of both efficiency and accuracy, achieving a 30–40% rate in detecting pathogenic or likely pathogenic variants [10]. Sequencing of sperm from healthy men has identified clonal mosaic mutations that are different from those in blood and stable over time [11]. However, genetic screening in clinical practice generally is conducted using blood samples. To our knowledge, there is a lack of studies that compare the genetic analysis of blood and sperm samples from the same patients by ES, especially the comparison of pathogenic variants. We aimed to determine whether pathogenic variants identified in the sperm of semen donors are also present in their blood and whether matching results for blood are consistent with results for sperm. To this end, we conducted a prospective study of pathogenic variants in the blood and sperm of semen donors by ES combined with additional panels.
Methods
Study population and participants
From January 1, 2022, to May 1, 2022, 40 qualified semen donors were enrolled at the Hunan Province Human Sperm Bank of China. All donors signed informed consent forms during their first visit to the sperm bank, consenting to the use of their sperm and blood samples or data by the sperm bank for scientific research purposes. During the same period, two women seeking sperm donation for conception were randomly selected to participate in simulated matching tests (primarily using their ES test data), and they also signed informed consent. both sperm donors and patients were deidentified through the use of personal numerical identifiers instead of names. This study was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board (IRB) of Central South University. The number for the approval was 2021-KT48.
Criteria for semen donor screening in China
Table S1 illustrates the key guidelines and standards for sperm donor screening in China, and donors must meet this criteria [12,13].
Sperm and blood samples
Sperm and blood samples were collected from 40 semen donors who ejaculated into a sterile container after abstaining for 2–7 days. The samples were assessed according to WHO recommendations [14], analysing volume, sperm concentration, morphology and motility based on WHO’s motility grades (PR, progressive sperm; NP, non-progressive sperm; and IM, immotile sperm). Blood samples of 1.5 mL were collected from each donor and stored in 2-mL cryopreservation tubes at −80 °C until needed.
Genetic testing
Peripheral blood and sperm samples were collected from semen donors. The TIANamp Blood DNA Kit (TIANGEN) was used to extract blood genomic DNA (gDNA), and the nucleic acid extraction or purification reagent (Basecare, China) was used to extract sperm DNA. The SureSelect Human ALL Exon V6 (Agilent) kit was used for liquid capture hybridization, elution and library amplification, and magnetic beads purification to obtain the whole Exon library. The final library was sequenced at 150 bp in pair-ends by Novaseq 6000 (Illumina) platform. For SMN1/2 and CYP21A2, which were limited by the ES assay and relatively prevalent in the Chinese population (spinal muscular atrophy, and congenital adrenal hyperplasia).
We additionally added panel assays. CYP21A2 was detected using LR-PCR combined with NGS, and we designed a long fragment primer of specific full-length CYP21A2 fragment for NGS with DA8600 platform. For gene copy number detection of SMN1/2, we used multiplex fluorescent PCR and capillary electrophoresis (ABI 3730XL), and LR-PCR combined with NGS was used to detect and validate 15 common SMN1 mutations in the Chinese population.
Data analysis
The sequencing data were aligned to GRCh37/hg19 reference genome (map reads to reference) by Sentieon® software (San Jose, CA, USA). And then marking duplicates, base quality score recalibration (BQSR) and variant calling were processed. For the VCF files after variant identification, single nucleotide polymorphisms (SNPs) and indels were filtered and screened according to sequence depth and mutation quality. Annovar© (https://annovar.openbioinformatics.org/en/latest/) was used to annotate variants, including the population frequency database 1000 Genomes (https://www.internationalgenome.org/), gnomAD (https://gnomad.broadinstitute.org/), ExAC (https://gnomad.broadinstitute.org/), ESP. Provean© (https://www.jcvi.org/research/provean), Sift© (https://sift.bii.a-star.edu.sg/), PolyPhen-2© (http://genetics.bwh.harvard.edu/pph2/index.shtml), LRT© (https://sites.google.com/site/jpopgen/dbNSFP), MutationTaster© (https://www.mutationtaster.org/), MutationAssessor© (http://mutationassessor.org/r3/), FATHMM© (https://fathmm.biocompute.org.uk/fathmm-xf/), CADD© (https://cadd.gs.washington.edu/), VEST© (http://www.cravat.us/CRAVAT/), MetaSVM© (https://sites.google.com/site/jpopgen/dbNSFP), MetaLR© (https://sites.google.com/site/jpopgen/dbNSFP), M-CAP© (http://bejerano.stanford.edu/mcap/), and Revel© (https://sites.google.com/site/jpopgen/dbNSFP) were used for protein biological function prediction
Synonymous mutations and deep intronic variants were filtered out. Variants were classified according to ACMG guidelines as ‘pathogenic (P)’, ‘likely pathogenic (LP)’, ‘unknown clinical significance’, ‘likely benign’ and ‘benign’ [15]. Variants were also divided into three types: blood-detectable only (BDO), sperm-detectable only (SDO) and blood-sperm shared (BSS) [16]. Data were also used for CNV calling by Sentieon software, and the identified CNVs (>200kb) were filtered and annotated. CNVs with ≥50% reciprocal overlap are classified as the same CNV locus between sperm and blood [17]. CNV pathogenicity was assessed according to the ACMG guidelines [15].
As sperm parameters of 40 sperm donors follow markedly skewed (non-normal) distributions, unadjusted mean and median values, standard deviation (SD) and 5th to 95th percentiles were calculated for each variable. Paired t-test was used for paired samples by SPSS V26.0 software, which obtained a copyright license.
Simulated matching test in artificial insemination by donor
A simulated matching test was conducted using two randomly selected women. The sperm donation matching process primarily involves gene and variant scoring. Gene scoring evaluates several factors: mode of inheritance, gene–disease association degree, age of onset and disease severity. Data for this evaluation are sourced from OMIM, Orphadata, DisGeNET and GenCC databases. These factors are assigned different weights in the scoring process. Variant evaluation follows ACMG guidelines. In the matching process, each woman is paired with 40 potential semen donors. Donors are categorized as ‘recommended’ or ‘not recommended’ based on the presence of pathogenic risk genes. Additionally, a matching score is calculated to facilitate the decision-making process.
Results
Clinical characteristics of semen donors
The clinical characteristics of all qualified semen donors are summarized in Supplementary Table S2. Meanwhile, the average semen volume among donors was 2.8 (2.0–4.8) [Median (5th–95th%)] mL, with mean sperm concentration and progressive motility levels of 63 (60–71) [Median (5th–95th %] million/mL and 52 (50–54) [Median (5th–95th %)], respectively.
Consistent pathogenicity variant in sperm and blood
Figure 1 compares mutation types between sperm and blood of sperm donors, the results of the comparison analysis of sperm and blood samples from each donor revealed a high degree of concordance in the number of pathogenic or likely pathogenic (P/PL) variants (Figure 1(A)). As shown in Table S3 which compares of all the pathogenic and likely pathogenic SNVs between semen and blood, a total of 108 P/LP variants in 82 genes were identified, and the highest carrier had 7 variants, and there was also one donor did not carry any P/LP variant. All P/LP variants are present in both blood and sperm. On average, each donor carried 2.7 P/LP variants. Further, the proportion of mutation genes in sperm donors presented in Table 1, nine donors had P/LP variants (NM_004004.6: c.109G > A, c.235delC) in GJB2 (Deafness, recessive autosomal 1 A), four donors each carried CD36 (Platelet glycoprotein IV deficiency; NM_000072.3: c.329_330del, c.1227_1238del, c.1156C > T) and DUOX2 (Thyroid dyshormonogenesis 6; NM_014080.4: c.1588A > T, c.2654G > A, c.1871delG, c.1588A > T) mutation, and three donors each carried CFTR (Cystic fibrosis; Congenital bilateral absence of vas deferens; NM_000492.4: c.1210-12T[5]/c.1210-34TG[12], c.1210-12T[5]/c.1210-34TG[13]) and LIPH (Hypotrichosis 7; NM_139248.3: c.736T > A, c.686_687insCCTGGCT, c.685_686insGTAGAACCCA) mutation (Table 1). As shown in Supplementary Table S3, the most frequent variant detected in semen donors was GJB2 c.109G > A. The detailed all variants were shown in Supplementary Table S3 and S4.
Figure 1.
Comparison of mutation types between sperm and blood of sperm donors.
(A) Comparison of the number of P/LP variants between sperm and blood of semen donors, the abscissa label is the sample number. BDO: blood-detectable only; SDO: sperm-detectable only; and BSS: blood-sperm shared (B) Comparison of mutation types (nonsense mutation, stoploss, splicing, synonymous/nonsynonymous SNV) between sperm and blood of sperm donors. (C) Indels (frameshift deletion/insertion, non-frameshift deletion/insertion) comparison between sperm and blood of sperm donors.
Table 1.
Genes with the highest proportion of mutations in sperm donors (genes with more than 2 cases).
| Gene | Inheritance | Associated disease | Positive case |
N = 40 Percentage(%) |
|---|---|---|---|---|
| Top 13 gene | ||||
| GJB2 | AR | Deafness, autosomal recessive 1 A | 9 | 22.5 |
| CD36 | AR | Platelet glycoprotein IV deficiency | 4 | 10 |
| DUOX2 | AR | Thyroid dyshormonogenesis 6 | 4 | 10 |
| CFTR | AR | Cystic fibrosis; Congenital bilateral absence of vas deferens | 3 | 7.5 |
| LIPH | AR | Hypotrichosis 7 | 3 | 7.5 |
| ABCA4 | AR | Stargardt disease 1 | 2 | 5 |
| ABCG5 | AR | Sitosterolemia 2 | 2 | 5 |
| AGL | AR | Glycogen storage disease IIIa; Glycogen storage disease IIIb | 2 | 5 |
| AIRE | AR/AD | Autoimmune polyendocrinopathy syndrome, type I, with or without reversible metaphyseal dysplasia | 2 | 5 |
| GALC | AR | Krabbe disease | 2 | 5 |
| HFE | AR | Hemochromatosis 1 | 2 | 5 |
| SERPINB7 | AR | Palmoplantar keratoderma, Nagashima type | 2 | 5 |
| UGT1A1 | AR | Gilbert syndrome | 2 | 5 |
Note: AD, autosomal dominant; AR, autosomal recessive; “+” for consistent and “-” for inconsistent with the results between semen and blood.
Unexpectedly, two semen donors were found to harbor autosomal dominant (AD) mutations, which were consistent in both sperm and blood samples, one donor had a likely pathogenic (LP) mutation in the TPM2 gene (NM_003289.3: c.564-1G > A), potentially causing distal arthrogryposis (MIM #108120) or nemaline myopathy 4 (MIM #609285); and the other with ACTN1 (NM_001102.4: c.1348C > T, LP), which would cause bleeding disorder (platelet-type, MIM #615193). However, these two semen donors did not exhibit any clinical manifestations of the respective diseases.
Analysis revealed that sperm samples exhibited a slightly higher frequency of certain types of non-pathogenic variants compared to blood samples. The detection threshold for variant allele frequency (VAF) in our methodology was 0.05. Variants specific to sperm samples are detailed in Supplementary Table S5. Moreover, Supplementary Figure S1, Supplementary Figure S2 visualizes these variants included benign, likely benign and VUS (variant of uncertain significance). Benign and likely benign variant in the sperm and blood of 40 semen donors showed that SDO variant number was significantly higher than that of BDO in the types of variants which were near splicing site, synonymous SNV, nonsynonymous SNV, non-frameshift deletion, frameshift deletion, frameshift insertion, and non-frameshift insertion. And for variant of uncertain significance, variant number in 3’UTR was more numerous in SDO than that in BDO. However, the number of unique non-pathogenic SNVs in sperm or blood was much less numerous than that of BSS variants (Figures S1 and S2).
Missense mutation was the dominant type
To advance this research, we analysed Different mutation types. As visualized in Figure 1(B) provides comparison of mutation types (nonsense mutation, stoploss, splicing, synonymous/nonsynonymous SNV) between sperm and blood of sperm donors and Figure 1(C) provides Indels (frameshift deletion/insertion, non-frameshift deletion/insertion) comparison between sperm and blood of sperm donors, the sperm and blood results within donor were consistent, with the most missense mutations (43), 30 indels, and 21 nonsense mutations (Figure 1(B)). From the indel point of view, there were 16 frameshift deletions, 10 frameshift insertions, and four non-frameshift deletions (Figure 1(C)).
No pathogenic CNV was identified in semen donors
Table 2 depicts pathogenicity and CNV types in 40 sperm donors. A total of 328 CNVs (ranged from 0.2 Mb to5.8 Mb), were identified in these 40 semen donors; however, none of them was pathogenic. It was found that 164/328 CNVs (82 types each) were present in sperm as well as blood. Of these 82 CNV types, 16 were deletions and 66 were duplications. Of the other 164/328 CNVs, 122 CNVs were BDO, 42 CNVs were SDO (Table 2). Based on Supplementary Table S6 compares of all the non-pathogenic CNVs between semen and blood, the most frequent duplication was dup (chr17:44407503-44627544 (overlapped region)), and this region included the ARL17B, NSFP1 and LRRC37A2. While the most frequent deletion was del (chr2:89975517-90260520 (overlapped region)), and this region included IGKV2D-28 (Supplementary Table S6). CNVs found in blood were most prevalent in Chr17 (36), followed by Chr15 (21), while none were found in Chr12 or ChrY. Among the CNVs detected in sperm, Chr17 had the most (32), followed by Chr16 (13); Chr3, Chr11, Chr12, Chr13, Chr18, and ChrY had none, relevant details listed in Supplementary Table S7represented of chromosome distribution of CNVs.
Table 2.
Pathogenicity and CNV types in sperm donors (n = 40).
| Total | Pathogenicity | Types | |||
|---|---|---|---|---|---|
| Blood specific | 169 | Benign | 113 | Deletion | 13 |
| Duplication | 100 | ||||
| VUS | 56 | Deletion | 32 | ||
| Duplication | 24 | ||||
| Blood consistent with semen | 70 (35 pairs) | Benign | 58 | Deletion | 18 |
| Duplication | 40 | ||||
| VUS | 12 | Deletion | 6 | ||
| Duplication | 6 | ||||
| Semen specific | 89 | Benign | 79 | Deletion | 17 |
| Duplication | 62 | ||||
| VUS | 10 | Deletion | 2 | ||
| Duplication | 8 | ||||
Simulated matching results for blood are consistent with results for sperm
Two women were randomly selected for simulated matching testing: a healthy woman (female 1) without pathogenic variants and another (female 2) who was an autosomal recessive carrier of oculocutaneous albinism (TYR (NM000372.5) c.896G > A p.Arg299His). Exome sequencing (ES) results were matched and scored against sperm and blood samples from 40 donors. For female 1, only three (7.5%) semen donors were not recommended for matching, whereas for female 2, 20 (50%) semen donors were not recommended. For female 1, recommended semen donors had a mean score of 95.5 and a maximum score of 98 (achieved by 10 semen donors). For female 2, recommended semen donors had a mean score of 92.3, with only one donor achieving the maximum score of 98. Matching scores between sperm and blood samples were nearly identical for female 1 across all 40 semen donors. However, a slight discrepancy was observed for female 2 with donor A5502, where the blood sample scored 98 and the sperm sample scored 97. This difference was primarily due to the detection of a heterozygous variant of uncertain significance (VUS) in the SUFU gene (NM016169.4: c.40G > C) in the sperm of A5502 donor, which was absent in his blood sample but present in female 2. Consequently, the sperm sample yielded a marginally lower matching score than the blood sample. Relevant details are documented in Figure 2 presents simulated matching of blood and sperm with two women.
Figure 2.
Simulated matching of blood and sperm with 2 women.
Matching score of female 1/2 with 40 semen donors. The red bar marked was the not recommended donor.
Discussion
Bell et al. conducted the initial carrier screening utilizing NGS, examining 448 severe recessive childhood diseases, revealing an average genomic carrier burden of 2.8 [18]. In 2015, ES was first implemented as a valuable genetic screening method for infertile couples undergoing assisted reproductive technology (ART). Analysis of the clinical dataset, comprising 2,161 samples, revealed that 84% were positive for genetic variants, with an average carrier burden of 2.3 variants per sample [19]. A previous study conducted exome sequencing (ES) on 43 semen donors, revealing that 90.7% tested positive for genetic variants, with an average carrier burden of 2.58 variants per donor. [4]. In our study, we conducted ES on sperm and blood samples obtained from 40 Chinese semen donors, revealing an average of 2.7 variants per donor. Despite the limited sample size, the observed variant count per sample aligns with findings from prior research studies [4]. In China, regulations permit a single semen donor to contribute to pregnancies for up to five women. If a donor carries genetic diseases, this significantly increases the risk of inherited disorders in the offspring of multiple families. Therefore, the screening of genetic disorders in semen donors is required to avoid scattering genetic diseases in the recipient population.
Although multiple facilities have conducted carrier screening using designed panels for different populations, and ACMG (American College of Medical Genetics) [20] and ACOG (American College of Obstetricians and Gynecologists) [21] have provided recommendations and guidelines for carrier screening, based on data from the US and EU populations, Chinese semen donors have been the subject of few carrier screening studies [4]. Detecting carrier frequencies in carrier screening is strongly influenced by the panel selection. The carrier frequencies for 415 genes connected to severe hereditary diseases in six varied ancestries were estimated by Guo and Gregg using an exome sequencing database. Ashkenazi Jewish carriers had up to 62.9% cumulative carrier rates, while East Asian carriers had 32.6% [22]. Meanwhile, data from the Translational Medicine Center at Fudan University Children’s Hospital demonstrated greater concordance with East Asian populations in the Exome Aggregation Consortium (ExAC) database compared to European cohorts [23]. This observed variability in allele frequencies across different ethnic groups underscores the importance of population-specific considerations in genetic screening and counselling practices; hence, genetic screening panels utilized in other nations may not be relevant to China. The ES method is widely utilized for targeted enrichment, mainly for its ability to efficiently sequence all exons in the human genome, making it ideal for studying monogenic disorders. The exome is believed to contain around 85% of mutations that have substantial effects on disease-related traits [24], necessitating only about 2% of the sequencing load in comparison to whole-genome sequencing (WGS). Despite its capability to accurately pinpoint SNVs and small InDels, ES requires extensive sequencing depth and meticulous data interpretation to detect large InDels, CNVs, and genomic rearrangements, aspects that are often not incorporated into the conventional analysis workflows of many projects. At present, ES combined with a panel screening is a potential and suitable option for semen donors for genetic assessment.
As we mainly performed the analysis by ES, the variants found were mainly located in exonic regions and dominated by point mutations. There might also be some specific structural variations that are not detected. In our study, we identified GJB2 variants as the predominant pathogenic mutations among semen donors, surpassing the carrier frequencies reported in previous studies [25,26]. To date, a total of 453 GJB2 variants have been identified, of which more than 380 variants have been associated with deafness. Variants of GJB2 varied by ethnicity and region, and the c.35delG variant was mostly found in European and Middle Eastern populations [27,28]; the p.W24X variant was mainly distributed in India [29]; the c.167delT and p.R143W could be found in Jewish and Ghanaian populations, respectively [30,31].
While the c.235delC variant often appeared in East Asian populations [32], and one semen donor was also found carrying this variant in our study. In particular, 8 semen donors carried the c.109G > A (p.Val37Ile) variant, a higher proportion (8/40, 20%) than we expected, although this variant was known to be relatively common in Asian, especially Southeast Asian populations [33,34]. As GJB2 variants were diverse and some were non-penetrant, and genotypes were associated with the penetrance time of hearing impairment, variants in GJB2 could present as delayed and progressive deafness after birth [34–36]. Analysis revealed that among individuals with homozygous variants at the GJB2 locus (c.109G > A p.Val37Ile), 65% (11/17) experienced congenital hearing loss, while 35% (6/17) developed delayed hearing loss [37]. According to a 2012 epidemiological study, the p.Val37Ile variant, whether compound heterozygous or homozygous, resulted in progressive, delayed hearing loss [38]. Therefore, the high-frequency pathogenic variant carriage of GJB2 in semen donors is a problem that cannot be ignored.
We found that 7.5% of qualified semen donors were carriers of pathogenic variants (NM_000492.4: c.1210-12T [5] /c.1210-34TG [12] , c.1210-12T [5] /c.1210-34TG [13]) of CFTR. CFTR variants was the main pathogenic cause of congenital bilateral absence of the vas deferens (CBAVD) and cystic fibrosis (CF) [39]. 5 T is the most common mild CFTR variant, and the frequency of 5 T in CBAVD patients is approximately 0.25 in a pooled 38 CBAVD independent studies, and this variant is also prevalent in the general population at a frequency of approximately 0.05 [40]. Together with its adjacent TG repeat, poly-T impacts splicing efficiency of exon 10 of CFTR [41]. The combination of the 5 T variant with a severe CFTR mutation on another chromosome is the main cause of CBAVD [42]. Although incomplete penetrance of 5 T variant has also been reported [43,44], and CF is the least common inheritable disease in Asians [45]. The frequency of CFTR variants should be a concern in semen donors in China.
Among 40 semen donors, we found two patients with pathogenic variants of AD disease, corresponding to genes TPM2 and ACTN1, but both had no clinical manifestations. TPM2, associated with muscle contractility and mutations, can lead to muscle-related disorders without immediate clinical manifestations, suggesting variable expressivity. ACTN1 involves platelet functions, with mutations potentially causing mild thrombocytopenia with minimal clinical impact. As TPM2 caused distal arthrogryposis is a heterogeneous category of inherited limb malformation syndromes with substantial clinical and genetic heterogeneity and variable expressivity [46–48]. This might explain why this semen donor had no apparent clinical manifestations. The ACTN1 gene, which encodes the actin-crosslinking protein α-actinin, is situated on chromosome 14q24.1. This gene comprises 21 exons spanning approximately 3.78 kb. α-Actinin 1 plays a crucial role in cytoskeleton organization, serving as an anchor for actin to various intracellular structures. It is predominantly expressed in platelets and megakaryocytes [49,50]. The mutation in ACTN1 is associated with a 50% reduction in platelet counts and a 30% increase in platelet size [51,52]. It has been reported that bleeding in ACTN1-related thrombocytopenia was absent or mild [53]. Thus, semen donors carrying pathogenic variants might not show clinical symptoms or show subclinical manifestations of disease.
Although it was postulated that the externalization of the testes was an evolutionarily conserved mechanism to restrict mutagenesis [54], all 108 pathogenic variants detected were consistently found in blood and sperm within the same patient, suggesting that all those variants might originate in embryonic development and occurred prior to primordial germ cell specification, and were stable throughout life. Nevertheless, low-frequency variants as well as non-pathogenic SNVs that are unique to blood (BDO) or sperm (SDO) were also found during data analysis. Our results showed that there were more non-pathogenic variants in sperm overall than in blood (Figure S1 and Figure S2). However, the non-pathogenic variant number of BDO or SDO was relatively small, when compared with non-pathogenic BSS variants. Previously, blood, saliva and oral samples from four unrelated individuals were compared by WGS. The data quality of different DNA sources was significantly different [55]. The accuracy difference in detecting short insertions and deletions was negligible. However, some saliva and oral samples exhibited a higher false positive rate (FPR). Moreover, CNV detection varied among different sample types from the same individual. Blood samples demonstrated sensitivity up to 25%, while saliva and oral samples with higher bacterial concentrations showed poor sensitivity [55]. Detection of CNV in blood-derived DNA is more sensitive than in unenriched saliva and oral samples, and the magnitude of the effect depends on the size and type of CNV [55]. Scheinfeldt et al. compared the mutation consistency between blood and blood lymphoblastoid cell line (LCL) cells of three generations and found that SNP consistency was good between whole blood DNA and LCL DNA of passage, while CNV consistency may decrease with the increase of passage [56]. In our analysis of sperm and blood samples from semen donors, we observed a higher prevalence of CNVs in blood compared to sperm, despite no significant difference in data quality. Notably, there is a current lack of published reports comparing SNVs and CNVs between sperm and blood samples. Theoretically, due to the increased number of cell divisions undergone by sperm, one would expect a higher frequency of variants in sperm samples. [57,58]. The results of SNV part were indeed the same, but blood had more CNVs. There was also a situation where, based on current sample types and testing methods, we had no way to rule out the effects of mosaicism, such as three sperm mosaicism types [59]. Although bulk sequencing could be used to detect male gonadal mosaicism, only type III mosaic mutations could reach the allele fraction of sperm samples (AF > 0.01) and be detected by this way [59]. Regarding the type III mutations detected in the sperm in some studies, between 33% and 50% are evident in the blood, saliva, or both [16,59]. Considering all the SNVs and CNVs with inconsistencies were non-pathogenic and low proportion mosaicism has a very low recurrence risk in offspring, we believe blood can be used to perform genetic screening on semen donors.
Conclusions
We performed ES and panel analysis in sperm and blood samples from 40 Chinese semen donors, and the results of pathogenic variants were consistent between both samples, and no pathogenic CNV was detected in either sample. The matching results of sperm and blood were also consistent through simulated sperm donation matching tests performed by two women. Therefore, we believe that it is reasonable to choose either blood or sperm for genetic screening in semen donors.
Supplementary Material
Funding Statement
This work was supported by grants from the National Key R&D Program of China (2022YFC2702700), Natural Science Foundation of Hunan Province (2024JJ6725, 2022JJ40657)
Authors’ contributions
C. H., L-M. C., H-C. N., and L-Y. K. were responsible for the study design. C. H., H-L. W., and B-S. L. contributed to sample acquisition, and clinical data collection and analysis. C. H., L-M. C., S.R., and M-L. H., contributed to ES data analysis and data interpretation. C.H., L-M. C., S. R., H-C. N., L-Y. K., L-Q. F., J. D., B. L., and W-B. Z were responsible for providing important intellectual input into the work and preparation, drafting and final approval of the manuscript. All authors approved the final version of the manuscript.
Ethical approval
The present study was approved by the Ethics Committee of Central South University (approval no. 2021-KT48). Signed informed consent to participate in the study was obtained from all participants.
Disclosure statement
L-M. C., S.R., M-L. H., and L-Y. K. were employed by Basecare Medical Device Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Availability of data and materials
The datasets and material used or analysed during the current study are available from the corresponding author on reasonable request.
References
- 1.Huang C, Lei L, Wu HL, et al. Long-term cryostorage of semen in a human sperm bank does not affect clinical outcomes. Fertil Steril. 2019;112(4):663–669 e1. doi: 10.1016/j.fertnstert.2019.06.008. [DOI] [PubMed] [Google Scholar]
- 2.Chinese Ministry of Health . Basic standards and technical specifications for human sperm bank. Chin J Reprod Heal. 2004;15(2):481. [Google Scholar]
- 3.Wang X, Chen M-X, Zhang F, et al. Sperm donors in Shanghai, China: a study of motivations, characteristics, and semen parameters of actual sperm donors. Reprod Develop Med. 2021;5(4):213–219. doi: 10.4103/2096-2924.327879. [DOI] [Google Scholar]
- 4.Huang C, Wu HL, Zhou WJ, et al. Genetic testing of sperm donors at a human sperm bank in China. Front Endocrinol (Lausanne). 2022;13:942447. doi: 10.3389/fendo.2022.942447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Landaburu I, Gonzalvo MC, Clavero A, et al. Genetic testing of sperm donors for cystic fibrosis and spinal muscular atrophy: evaluation of clinical utility. Eur J Obstet Gynecol Reprod Biol. 2013;170(1):183–187. doi: 10.1016/j.ejogrb.2013.06.022. [DOI] [PubMed] [Google Scholar]
- 6.Callum P, Iger J, Ray M, et al. Outcome and experience of implementing spinal muscular atrophy carrier screening on sperm donors. Fertil Steril. 2010;94(5):1912–1914. doi: 10.1016/j.fertnstert.2009.12.071. [DOI] [PubMed] [Google Scholar]
- 7.Urbina MT, Benjamin I, Medina R, et al. Expanded carrier screening in gamete donors of Venezuela. JBRA Assist Reprod. 2017;21(4):356–360. doi: 10.5935/1518-0557.20170062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Payne MR, Skytte AB, Harper JC.. The use of expanded carrier screening of gamete donors. Hum Reprod. 2021;36(6):1702–1710. doi: 10.1093/humrep/deab067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bamshad MJ, Ng SB, Bigham AW, et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet. 2011;12(11):745–755. doi: 10.1038/nrg3031. [DOI] [PubMed] [Google Scholar]
- 10.Zhang X. Exome sequencing greatly expedites the progressive research of Mendelian diseases. Front Med. 2014;8(1):42–57. doi: 10.1007/s11684-014-0303-9. [DOI] [PubMed] [Google Scholar]
- 11.Yang X, Breuss MW, Xu X, et al. Developmental and temporal characteristics of clonal sperm mosaicism. Cell. 2021;184(18):4772–4783 e15. doi: 10.1016/j.cell.2021.07.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Huang C, Li B, Xu K, et al. Decline in semen quality among 30,636 young Chinese men from 2001 to 2015. Fertil Steril. 2017;107(1):83–88 e2. doi: 10.1016/j.fertnstert.2016.09.035. [DOI] [PubMed] [Google Scholar]
- 13.Huang C, Zhou SF, Gao LD, et al. Risks associated with cryopreserved semen in a human sperm bank during and after the COVID-19 pandemic. Reprod Biomed Online. 2021;42(3):589–594. doi: 10.1016/j.rbmo.2020.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.WHO . WHO laboratory manual for the examination and processing of human semen.2010. 5th ed. Geneva, Switzerland: World Health Organization (WHO). [Google Scholar]
- 15.Riggs ER, Andersen EF, Cherry AM, et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med. 2020;22(2):245–257. doi: 10.1038/s41436-019-0686-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Breuss MW, Antaki D, George RD, et al. Autism risk in offspring can be assessed through quantification of male sperm mosaicism. Nat Med. 2020;26(1):143–150. doi: 10.1038/s41591-019-0711-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bergen SE, Ploner A, Howrigan D, C.N.V.A. Group, C. the Schizophrenia Working Group of the Psychiatric Genomics ., et al. Joint contributions of rare copy number variants and common SNPs to risk for schizophrenia. Am J Psychiatry. 2019;176(1):29–35. doi: 10.1176/appi.ajp.2018.17040467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bell CJ, Dinwiddie DL, Miller NA, et al. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med. 2011;3(65):65ra4. doi: 10.1126/scitranslmed.3001756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Martin, J., Yi Y., Alberola T., Rodriguez-Iglesias B., Jimenez-Almazan J., Li Q., Du H., Alama P., Ruiz A., Bosch E., Garrido N., Simon C., Comprehensive carrier genetic test using next-generation deoxyribonucleic acid sequencing in infertile couples wishing to conceive through assisted reproductive technology, Fertil Steril 104(5) (2015). 1286–93. doi: 10.1016/j.fertnstert.2015.07.1166. [DOI] [PubMed] [Google Scholar]
- 20.Richards S, Aziz N, Bale S, A.L.Q.A. Committee ., et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology,. Genet Med. 2015;17(5):405–424. doi: 10.1038/gim.2015.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Haque IS, Lazarin GA, Kang HP, et al. Modeled fetal risk of genetic diseases identified by expanded carrier screening. JAMA. 2016;316(7):734–742. doi: 10.1001/jama.2016.11139. [DOI] [PubMed] [Google Scholar]
- 22.Guo MH, Gregg AR.. Estimating yields of prenatal carrier screening and implications for design of expanded carrier screening panels. Genet Med. 2019;21(9):1940–1947. doi: 10.1038/s41436-019-0472-7. [DOI] [PubMed] [Google Scholar]
- 23.Ling Y, Wang H, Wu B, et al. Carrier analysis of known disease—causing genes based on WES dalca from single Shanghai pediatric center. Chin J Evid Based Pediatr. 2016;11(3):204–209. [Google Scholar]
- 24.Majewski J, Schwartzentruber J, Lalonde E, et al. What can exome sequencing do for you? J Med Genet. 2011;48(9):580–589. doi: 10.1136/jmedgenet-2011-100223. [DOI] [PubMed] [Google Scholar]
- 25.Gravina LP, Foncuberta ME, Estrada RC, et al. Carrier frequency of the 35delG and A1555G deafness mutations in the Argentinean population. Impact on the newborn hearing screening. Int J Pediatr Otorhinolaryngol. 2007;71(4):639–643. doi: 10.1016/j.ijporl.2006.12.015. [DOI] [PubMed] [Google Scholar]
- 26.Chan DK, Chang KW.. GJB2-associated hearing loss: systematic review of worldwide prevalence, genotype, and auditory phenotype. Laryngoscope. 2014;124(2):E34–53. doi: 10.1002/lary.24332. [DOI] [PubMed] [Google Scholar]
- 27.Mahdieh N, Rabbani B.. Statistical study of 35delG mutation of GJB2 gene: a meta-analysis of carrier frequency. Int J Audiol. 2009;48(6):363–370. doi: 10.1080/14992020802607449. [DOI] [PubMed] [Google Scholar]
- 28.Koohiyan M. Genetics of Hereditary Hearing Loss in the Middle East: A Systematic Review of the Carrier Frequency of the GJB2 Mutation (35delG). Audiol Neurootol. 2019;24(4):161–165. doi: 10.1159/000502201. [DOI] [PubMed] [Google Scholar]
- 29.Kelsell DP, Dunlop J, Stevens HP, et al. Connexin 26 mutations in hereditary non-syndromic sensorineural deafness. Nature. 1997;387(6628):80–83. doi: 10.1038/387080a0. [DOI] [PubMed] [Google Scholar]
- 30.Zelante L, Gasparini P, Estivill X, et al. Connexin26 mutations associated with the most common form of non-syndromic neurosensory autosomal recessive deafness (DFNB1) in Mediterraneans. Hum Mol Genet. 1997;6(9):1605–1609. doi: 10.1093/hmg/6.9.1605. [DOI] [PubMed] [Google Scholar]
- 31.Brobby GW, Müller-Myhsok B, Horstmann RD.. Connexin 26 R143W mutation associated with recessive nonsyndromic sensorineural deafness in Africa. N Engl J Med. 1998;338(8):548–550. doi: 10.1056/NEJM199802193380813. [DOI] [PubMed] [Google Scholar]
- 32.Yao J, Lu Y, Wei Q, et al. A systematic review and meta-analysis of 235delC mutation of GJB2 gene. J Transl Med. 2012;10(1):136. doi: 10.1186/1479-5876-10-136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Shen N, Peng J, Wang X, et al. Association between the p.V37I variant of GJB2 and hearing loss: a pedigree and meta-analysis. Oncotarget. 2017;8(28):46681–46690. doi: 10.18632/oncotarget.17325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wu CC, Tsai CH, Hung CC, et al. Newborn genetic screening for hearing impairment: a population-based longitudinal study. Genet Med. 2017;19(1):6–12. doi: 10.1038/gim.2016.66. [DOI] [PubMed] [Google Scholar]
- 35.Minami SB, Mutai H, Nakano A, et al. GJB2-associated hearing loss undetected by hearing screening of newborns. Gene. 2013;532(1):41–45. doi: 10.1016/j.gene.2013.08.094. [DOI] [PubMed] [Google Scholar]
- 36.Norris VW, Arnos KS, Hanks WD, et al. Does universal newborn hearing screening identify all children with GJB2 (Connexin 26) deafness? Penetrance of GJB2 deafness. Ear Hear. 2006;27(6):732–741. doi: 10.1097/01.aud.0000240492.78561.d3. [DOI] [PubMed] [Google Scholar]
- 37.Chai Y, Chen D, Sun L, et al. The homozygous p.V37I variant of GJB2 is associated with diverse hearing phenotypes. Clin Genet. 2015;87(4):350–355. doi: 10.1111/cge.12387. [DOI] [PubMed] [Google Scholar]
- 38.Li L, Lu J, Tao Z, et al. The p.V37I exclusive genotype of GJB2: a genetic risk-indicator of postnatal permanent childhood hearing impairment. PLoS One. 2012;7(5):e36621. doi: 10.1371/journal.pone.0036621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.McBride JA, Kohn TP, Mazur DJ, et al. Sperm retrieval and intracytoplasmic sperm injection outcomes in men with cystic fibrosis disease versus congenital bilateral absence of the vas deferens. Asian J Androl. 2021;23(2):140–145. doi: 10.4103/aja.aja_48_20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yu J, Chen Z, Ni Y, et al. CFTR mutations in men with congenital bilateral absence of the vas deferens (CBAVD): a systemic review and meta-analysis. Hum Reprod. 2012;27(1):25–35. doi: 10.1093/humrep/der377. [DOI] [PubMed] [Google Scholar]
- 41.Cuppens H, Lin W, Jaspers M, et al. Polyvariant mutant cystic fibrosis transmembrane conductance regulator genes. The polymorphic (Tg)m locus explains the partial penetrance of the T5 polymorphism as a disease mutation. J Clin Invest. 1998;101(2):487–496. doi: 10.1172/JCI639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chillón M, Casals T, Mercier B.. Mutations in the cystic fibrosis gene in patients with congenital absence of the vas deferens. N Engl J Med. 1995;332(22):1475–1480. doi: 10.1056/NEJM199506013322204. [DOI] [PubMed] [Google Scholar]
- 43.Zielenski J, Patrizio P, Corey M, et al. CFTR gene variant for patients with congenital absence of vas deferens. Am J Hum Genet. 1995;57(4):958–960. [PMC free article] [PubMed] [Google Scholar]
- 44.Shin D, Gilbert F, Goldstein M, et al. Congenital absence of the vas deferens: incomplete penetrance of cystic fibrosis gene mutations. J Urol. 1997;158(5):1794–1799. discussion 1798-9. doi: 10.1016/s0022-5347(01)64131-4. [DOI] [PubMed] [Google Scholar]
- 45.O’Sullivan BP, Freedman SD.. Cystic fibrosis. Lancet. 2009;373(9678):1891–1904. doi: 10.1016/S0140-6736(09)60327-5. [DOI] [PubMed] [Google Scholar]
- 46.Krakowiak PA, Bohnsack JF, Carey JC, et al. Clinical analysis of a variant of Freeman-Sheldon syndrome (DA2B). Am J Med Genet. 1998;76(1):93–98. doi: . [DOI] [PubMed] [Google Scholar]
- 47.Tajsharghi H, Ohlsson M, Palm L, et al. Myopathies associated with beta-tropomyosin mutations. Neuromuscul Disord. 2012;22(11):923–933. doi: 10.1016/j.nmd.2012.05.018. [DOI] [PubMed] [Google Scholar]
- 48.Sung SS, Brassington AM, Grannatt K, et al. Mutations in genes encoding fast-twitch contractile proteins cause distal arthrogryposis syndromes. Am J Hum Genet. 2003;72(3):681–690. doi: 10.1086/368294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Haudek VJ, Slany A, Gundacker NC, et al. Proteome maps of the main human peripheral blood constituents. J Proteome Res. 2009;8(8):3834–3843. doi: 10.1021/pr801085g. [DOI] [PubMed] [Google Scholar]
- 50.Luo FM, Fan LL, Sheng Y, et al. Case report: exome sequencing identified a novel frameshift mutation of alpha-actin 1 in a Chinese family with macrothrombocytopenia and mild bleeding. Front Pediatr. 2021;9:679279. doi: 10.3389/fped.2021.679279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kunishima S, Okuno Y, Yoshida K, et al. ACTN1 mutations cause congenital macrothrombocytopenia. Am J Hum Genet. 2013;92(3):431–438. doi: 10.1016/j.ajhg.2013.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yasutomi M, Kunishima S, Okazaki S, et al. ACTN1 rod domain mutation associated with congenital macrothrombocytopenia. Ann Hematol. 2016;95(1):141–144. doi: 10.1007/s00277-015-2517-6. [DOI] [PubMed] [Google Scholar]
- 53.Westbury SK, Shoemark DK, Mumford AD.. ACTN1 variants associated with thrombocytopenia. Platelets. 2017;28(6):625–627. doi: 10.1080/09537104.2017.1356455. [DOI] [PubMed] [Google Scholar]
- 54.Short RV. The testis: the witness of the mating system, the site of mutation and the engine of desire. Acta Paediatr Suppl. 1997;422:3–7. doi: 10.1111/j.1651-2227.1997.tb18336.x. [DOI] [PubMed] [Google Scholar]
- 55.Trost B, Walker S, Haider SA, et al. Impact of DNA source on genetic variant detection from human whole-genome sequencing data. J Med Genet. 2019;56(12):809–817. doi: 10.1136/jmedgenet-2019-106281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Scheinfeldt LB, Hodges K, Pevsner J, et al. Genetic and genomic stability across lymphoblastoid cell line expansions. BMC Res Notes. 2018;11(1):558. doi: 10.1186/s13104-018-3664-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sohni A, Tan K, Song HW, et al. The neonatal and adult human testis defined at the single-cell level. Cell Rep. 2019;26(6):1501–1517 e4. doi: 10.1016/j.celrep.2019.01.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Schulze W, Rehder U.. Organization and morphogenesis of the human seminiferous epithelium. Cell Tissue Res. 1984;237(3):395–407. doi: 10.1007/BF00228424. [DOI] [PubMed] [Google Scholar]
- 59.Breuss MW, Yang X, Gleeson JG.. Sperm mosaicism: implications for genomic diversity and disease. Trends Genet. 2021;37(10):890–902. doi: 10.1016/j.tig.2021.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets and material used or analysed during the current study are available from the corresponding author on reasonable request.


