Skip to main content
BMC Genomics logoLink to BMC Genomics
. 2025 Jul 11;26:656. doi: 10.1186/s12864-025-11744-1

Genome-wide association studies and candidate genes networks affecting reproductive traits using Iranian Holstein sequence data

Narges Maddahi 1, Mostafa Sadeghi 1,, Seyed Reza Miraee Ashtiani 1, Muna Kholghi 1, Ali Jalil Sarghale 1
PMCID: PMC12247366  PMID: 40646471

Abstract

Background

The reproduction process in domestic animals is one of the most important challenges of animal husbandry. Fertility is an important trait that contributes to herd profitability and can be improved by genomic information. One of the best ways to investigate the association between single nucleotide polymorphisms (SNPs) and phenotypic performance is the genome-wide association study (GWAS). The aim of our study was to identify the genomic regions affecting reproductive traits, interval between first and last insemination (IFL), days open (DO), days from calving to first service (DFS), number of services per conception (NSPC), age at first calving (AFC) and age at first insemination (AFI) using SNP chip data in Iranian Holstein cows.

Results

GWAS analysis for all reproductive traits based on the significant-association threshold P < 1 × 10–8, led to the identification of 55 single nucleotide polymorphisms (SNPs) for IFL (n = 3), DFS (n = 0), DO (n = 5), NSPC (n = 5), AFI (n = 33), and AFC (n = 9) traits. Based on the results of gene ontology analysis, 54 different candidate genes for reproductive traits were identified in this study. For IFL, NSPCC, DO, AFC, and AFI traits 4, 8, 11, 10, and 21 candidate genes were identified in the vicinity of significant SNPs, respectively. Key genes with biologically important positions for heifers (ATG7, PTPN5, STAC, GAD2, PLXDC2, KARS1, PRIM2, and ZNF597) and cows (LPL, SERP2, BIRC6, CENPU, PIK3C3, and MYLK3) can be mentioned.

Conclusions

Our results identified 55 marker-trait associations (MTAs) and 54 different candidate genes associated with reproductive traits. As a result, the SNPs and candidate genes discovered in this study can be used in genomic experiments to improve the reproductive performance of Iranian Holstein dairy cows and provide new information about the genetic architecture of these traits.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-025-11744-1.

Keywords: Candidate genes, Iranian Holstein cattle, Reproductive traits, Whole-genome sequence

Background

Following the increase and growth of the population in the world, there is a need to provide animal protein for people [1]. On the other hand, due to the limited land and water resources in the world, cattle management is necessary for higher productivity [2]. In this regard, paying attention to reproductive traits can have a great impact on production in dairy cows. Reproductive traits are among the most important traits in dairy cattle breeding [3]. Non-economic reproduction can lead to an increase in the calving interval, which reduces the replacement rate in dairy herds, and then the genetic improvement in the herd decreases, which ultimately results in a decrease in milk production as well as economic loss [4]. Improvement of reproductive traits, in addition to the effect they have on herd reproduction, can affect treatment and veterinary costs, animal welfare, and most importantly, the number of inoculations [3]. Today, reproductive problems are among the main reasons for the lack of profitability in dairy cattle herds [5]. The decline in reproductive efficiency in recent years has been significant in many studies. Research shows that the rate of stillbirth and mortality in cows, especially in dairy races, is increasing [6]. This issue has not only increased concern about the health and comfort of animals but has also caused a lot of economic damage to the dairy cattle industry. Therefore, the importance of fertility should be considered more in breeding programs [7]. The availability of data for the date of insemination allows animal breeders to calculate reproductive traits, e.g., AFI, AFC, IFL, DFS, DO, CI (calving interval), and GL (gestation length). In addition, fertility can be measured using categorical records such as NRR (non-return rate) and NSPC (number of services per conception) [3]. Reproductive traits are less responsive to genetic improvement through selective breeding due to their low heritability [3]. Reproductive traits are influenced by environmental factors and genetic factors. Environmental factors include season, temperature, nutrition, and management, which have a great impact on the length of the reproductive period [8].

During the past years, there have been studies on the identification of quantitative trait loci (QTL) on various traits in dairy cows, which has led to the identification of many QTLs affecting these traits on different chromosomes [9]. However, since measuring reproductive traits is not as easy as measuring productive traits [10], and due to the complexity of the structure of genes in these traits, fewer studies have been conducted in this field. Association mapping, or linkage disequilibrium (LD), is a relatively new and promising method to study quantitative traits. In this technique, the recombination events that have occurred during distant times until today are used for mapping [1114]. In this technique, it is possible to identify quantitative trait control sites that are associated the desired trait or even causal polymorphisms within the gene that are related to two alternative traits [11, 15]. Association mapping is based on the hypothesis of linkage disequilibrium (LD), which can be useful for identifying the association between haplotype blocks and the desired trait [1619]. Many studies using GWAS have been performed on reproductive traits in dairy cows and have identified candidate genes for these traits [3, 2023]. The whole genome scanning study can help to better identify genes and variants related to economic traits, and this information can help our understanding of the mechanisms of the studied traits. Since the goal of breeding is to increase the economic efficiency of the system, it is necessary to investigate the reproductive traits and genes affecting these traits using the GWAS method and find the loci affecting these traits in different Holstein cattle populations. Ultimately, these data can help expand our understanding of the structure of genes, reproductive biology of female cattle and improve genetic predictions within a race in a specific region [24].

Interest in SNP markers has increased in recent years because they are the smallest unit of genetic variation and the basis of most genetic variation between individuals. On the other hand, in recent years, many single nucleotide mutations, or SNPs, have been discovered at the genome level, which has made it possible to prepare a genetic map with high density [25, 26]. These high-density SNP chips have provided an extraordinary throughput for genotyping and genome scanning of economically important traits in dairy cattle [27]. Abdulahi Arpanahi et al. [28] reported that the correlation between SNP effects for heifer and cow fertility traits ranges from 0.29 to 0.41. On the other hand, previous studies showed that the genetic correlation between fertility traits recorded on heifers and cows is less than one [29, 30]. Therefore, we strongly believe that genetic and biological mechanisms regulating reproductive traits in heifers and cows are not the same, and selection programs should be designed separately. The present study is the first to be conducted on the reproductive traits of Iranian Holstein cows based on the whole genome. While the previous studies have been conducted based on low-density chips, which are far less accurate, therefore, in the present study, the GWAS method was used to identify genomic regions affecting reproductive traits in Iranian Holstein heifers and cows. which can provide valuable information to breeders. Because there is a lack of information in this field in the dairy cattle herds of Iran, it is necessary to conduct a more comprehensive study.

Results

Descriptive statistics for reproductive traits are presented in Table 1. In the least squares analysis of the studied traits, the effect of contemporary group and age was statistically significant for all traits (p < 0.05). The results showed that the highest amount of variation was related to the trait IFL (126.99%), and the lowest amount was assigned to the trait AFC (7.19%).

Table 1.

Descriptive statistics of the studied reproductive traits in Iranian Holstein

Traits N Mean Minimum Maximum Range Variance Coeff of Variation Std Error
AFC 192 24.05 20.41 31.44 11.03 2.99 7.19 0.13
AFI 202 14.47 11.70 19.26 7.56 1.50 8.46 0.09
DFS 189 63.71 35.50 136.00 100.50 311.69 27.71 1.28
IFL 189 62.39 0.00 384.50 384.50 6277.70 126.99 5.76
NIPCC 190 2.63 1.00 15.00 14.00 4.02 76.21 0.15
OD 189 127.66 40.50 564.50 524.00 7653.66 68.53 6.36

Genome-wide association studies for reproductive traits in Holstein cows

The results of the GWAS analysis for all reproductive traits were reported based on two significant thresholds: P < 1 × 10–6 and P < 1 × 10–8. For the DFS trait, based on the significance threshold (P < 1 × 10–8), no significant SNP markers were observed, but based on the significance threshold (P < 1 × 10–6), 24 marker-trait associations (MTAs) were identified, which were respectively located on chromosomes BTA1, BTA9, BTA10 (two markers), BTA11, BTA13 (six markers), BTA25 (six markers), and BTA26 (seven markers) (supplementary 1 Fig. 1A). Three significant marker-trait associations (P < 1 × 10–8) in BTA8, BTA20, and BTA24 regions and 15 significant marker-trait associations (P < 1 × 10–6) in BTA4, BTA5, BTA8, and BTA14 regions. nine markers), BTA20 (two markers), and BTA24 for the IFL trait were detected (Fig. 1). For the OD trait, 5 SNP markers based on a significance threshold of P < 1 × 10–8, were observed in the BTA2, BTA6 (two markers), and BTA14 (two markers) regions. In addition, based on the significance threshold of P < 1 × 10–6, 59 MTAs were identified on the chromosomes BTA2 (six markers), BTA5, BTA6 (26 markers), BTA9, BTA14 (11 markers), BTA17 (three markers), BTA18, BTA25 (eight markers), and BTA26 (two markers) (supplementary 1 Fig. 1B). The results showed that five SNP markers had a significant relationship with the NSPC trait based on the significance threshold of P < 1 × 10–8, which were located on the BTA24 (four markers) and BTA28 chromosomes. Based on the significance threshold of P < 1 × 10–6, seven MTAs were identified, which were located at BTA24 (four markers) and BTA28 (three markers) positions (supplementary 2 Fig. 1C). All significant markers for reproductive traits in cows are reported in Supplementary 2.

Fig. 1.

Fig. 1

Manhattan plot of the genome-wide p values of association for interval between first and last insemination trait in cow. The solid line represents the < 1 × 10−6 and < 1 × 10−8 significance threshold

Genome-wide association studies for reproductive traits in Holstein heifers

The results for the AFI trait, the number of 35 SNP markers, were able to cross the significance threshold line of P < 1 × 10–8 in the regions BTA1, BTA2, BTA6, BTA7 (three markers), BTA11 (nine markers), BTA13 (three markers), BTA15 (two markers), BTA18, BTA20 (three markers), BTA22 (nine markers), BTA25, and BTA29 were located (Fig. 2). For the AFC trait, nine SNP markers were identified based on the significance threshold of P < 1 × 10–8 in the BTA18, BTA25, and BTA26 regions (seven markers), respectively (supplementary 1 Fig. 2). Also, based on the significance threshold of P < 1 × 10–6, 401 and 375 significant SNP markers were identified for AFI and AFC traits, respectively. All significant markers for reproductive traits in cows are reported in Supplementary 2.

Fig. 2.

Fig. 2

Manhattan plot of the genome-wide p values of association for age at first insemination trait in cow. The solid line represents the p < 1 × 10−6 and < 1 × 10−8 significance threshold

QTL regions for reproductive traits in Holstein cows

A summary of significant SNPs (P < 1 × 10–8) associated with reproductive traits in cattle that are near the reported QTLs is presented in Table 2. The results showed that in the chromosomes BTA20 and BTA24 in the vicinity of the SNP affecting the IFL trait, there are important QTLs with related traits: milk yield (MY), milk fat yield (FY), milk protein yield (PY), muscularity (MUSC), body weight (BW), interval to first estrus after calving (CALEST), feed conversion ratio (FCR), stillbirth (SB), fertility index (FERIND), heifer pregnancy (HPG), first service conception (FSC), inseminations per conception (CONCEPT), finishing precocity (FPREC), calving ease (CALEASE), foot angle (FANG), pregnancy rate (PREGRATE), and calving index (CALVIND) were identified (Table 2). In the proximity of SNPs related to NSPC traits in chromosomes BTA24 and BTA28, QTLs related to traits: carcass weight (BW), body height (BW), milk protein yield (PY), udder height (UHT), interval to first estrus after calving (CALEST), interval from first to last insemination (INSINT), foot angle (FANG), and calving interval (CALVIND) were observed (Table 2). Further, for the DO trait, QTLs are associated with some traits: milk protein percentage (PP), milk protein yield (PY), body height (BW), milk fat yield (FY), calving ease (CALEASE), foot angle (FANG), interval from first to last insemination (INSINT), gestation length (GLENGTH), and rib thickness (RIBTH) was identified (Table 2).

Table 2.

QTLs located in close distance to the most significant single nucleotide polymorphisms (SNPs) associated with reproductive traits in Holstein cows

Trait SNP name CHR SNP position QTL trait QTL symbol
IFL 20:35332061 20 35332061 Somatic cell score SCS
Sperm motility SPMOT
Milk yield MY
Milk fat yield FY
Milk protein yield PY
Stature STA
Muscularity MUSC
Stayability STAY
24:39426460 24 39426460 Body depth BD
Carcass weight CWT
Body weight BW
Sole ulcer SULC
Milk fat percentage FP
Interval to first estrus after calving CALEST
Bovine tuberculosis susceptibility BTBS
Milk fat yield FY
Feed conversion ratio FCR
Milk protein percentage PP
Average daily gain ADG
Body height HEIGHT
Stillbirth SB
Interval from first to last insemination INSINT
Fertility index FERIND
Milk protein yield PY
Residual feed intake RFI
Milk yield MY
Milk potassium content MK
Feed efficiency FEEDEFF
Heifer pregnancy HPG
Sperm motility SPMOT
First service conception FSC
Stillbirth SB
Body length BL
Milk yield MY
Milk protein yield PY
Inseminations per conception CONCEPT
Conception rate CONCRATE
Finishing precocity FPREC
Calving ease CALEASE
Foot angle FANG
Pregnancy rate PREGRATE
Feed efficiency FEEDEFF
Calving index CALVIND
Antral follicle number AFOLN
Rump width RUMWD
NSPC 24:14252019, 24 14252019, Body weight BW
24:14817979, 14817979, Carcass weight CWT
24:14841115, 14841115, Longissimus muscle area LMA
24:14847278 14847278 Calving ease CALEASE
Conception rate CONCRATE
Udder swelling score USS
Feed conversion ratio FCR
Stillbirth SB
Milk stearic acid content MFA-C18:0
Finishing precocity FPREC
Feet and leg conformation FTLEG
Dressing percentage DRESSPERC
28:43469861 28 43469861 Teat placement—front FTPL
Carcass weight CWT
Body height HEIGHT
Marbling score MARBL
Calving ease CALEASE
Milk protein percentage PP
Stillbirth SB
Body weight BW
Udder height UHT
Metabolic body weight MBW
Scrotal circumference SCRCIR
Conformation score CONF
Milk protein yield PY
Subcutaneous fat thickness SUBFAT
Abnormal flavor intensity ABFLAV
Foot angle FANG
Longissimus muscle area LMA
Interval to first estrus after calving CALEST
Palmitoleic acid content FA-C16:1
Pelvic area PELAR
Interval from first to last insemination INSINT
Heifer pregnancy HPG
Milk palmitoleic acid content MFA-C16:1
Milk C16 index MC16
Sole hemorrhage SHEM
Meat color MCOL
Average daily gain ADG
Udder swelling score USS
Calving interval CALINTV
trans-12-C18:1 fatty acid content FA-C18:1n6
OD 2:15024774 2 15024774 Myristoleic acid conten FA
Immunoglobulin G level SIGG

14:80517355,

14:80519317

14

80517355, 

80519317

Milk protein percentage PP
Milk protein yield PY
Tick resistance TICKR
Body weight BW
Clinical mastitis CM
Somatic cell score SCS
Milk fat percentage FP
Milk fat yield FY
Marbling score MARBL
Gestation length GLENGTH
Interval from first to last insemination INSINT
Dry matter intake DMI
Calving ease CALEASE
Foot angle FANG
Muscularity MUSC
Maintenance efficiency MAINEF
trans-Vaccenic acid content FA-C18:1t

6:15796637,

6:15814726

6

15796637,

15814726

Calving ease CALEASE
Rib thickness RIBTH

QTL regions for reproductive traits in Holstein heifers

The results of significant SNPs (P < 1 × 10–8) associated with reproductive traits in heifers that are near the reported QTLs are presented in Table 3. The results showed that in the BTA26 chromosome, in the vicinity of the SNPs affecting the AFC trait, there are QTLs with the traits: milk fat yield (FY), milk yield (MY), milk protein yield (PY), milk fat percentage (FP), stillbirth (SB), bone quality (BQ), interval from first to last insemination (INSINT) and udder attachment (UA) were observed (Table 3). QTLs associated with traits such as carcass weight (CWT), meat color (MCOL), and dry matter intake (DMI) were also observed in the vicinity of effective SNPs in the BTA18 chromosome. In the vicinity of significant SNPs in the BTA25 chromosome, there are QTLs related to traits: age at puberty (PUBAGE), milk fat yield (FY), milk yield (MY), conception rate (CONCRATE), bone percentage (BONEP), stillbirth (SB), calving ease (CALEASE), and number of embryos (EMBN) was discovered (Table 3). Near significant AFI SNPs in chromosomes BTA1, BTA2, BTA6, BTA7, BTA11, BTA13, BTA15, BTA18, BTA20, BTA22, and BTA25 and positions related to traits: gestation length (GLENGTH), milk protein percentage (PP), meat color (MCOL), longissimus muscle area (LMA), calving ease (CALEASE), number of embryos (EMBN), fertility index (FERIND), body weight (BW), feed conversion ratio (FCR), and mean corpuscular volume (MCV) were detected (Table 3).

Table 3.

QTLs located in close distance to the most significant single nucleotide polymorphisms (SNPs) associated with reproductive traits in Holstein heifers

Trait SNP name CHR SNP position QTL trait QTL symbol
AFC 26:35254329, 26 35254329, Milk fat yield FY
26:35260042, 35260042, Milk yield MY
26:35261296, 35261296, Milk protein yield PY

26:35261978, 

26:35261830,

35261978, Milk fat percentage FP
26:35263013, 35261830, Stillbirth SB
26:35263174 35263013, Bone quality BQ
35263174 Interval from first to last insemination INSINT
Udder attachment UA
Structural soundness SOUND
18:29660489 18 29660489 Carcass weight CWT
Meat color MCOL
Dry matter intake DMI
25:3032618 25 3032618 Age at puberty PUBAGE
Milk fat yield FY
Milk yield MY
Conception rate CONCRATE
Bone percentage BONEP
Stillbirth SB
Calving ease CALEASE
Number of embryos EMBN
AFI 7:32022337, 7 32022337, Gestation length GLENGTH
7:31973977, 31973977,
7:31986904 31986904 Milk protein percentage PP
2:88625030 2 88625030 Meat color MCOL
15:40739864 15 40739864 Udder swelling score USS
Shear force SF
20:9880631, 20 9880631, Longissimus muscle area LMA
20:9897995, 9897995, Calving ease CALEASE
20:9907264 9907264 Immunoglobulin G level SIGG
25:41217739 25 41217739 Residual feed intake RFI
Number of embryos EMBN
13:32439203, 32439203, Fertility index FERIND
13:32442132, 32442132,
13:22165240 22165240 Body weight BW
11:12676017, 11 12,676,017, Finishing precocity FPREC
11:12680526, 12680526,
11:12683949 12683949
18:41119642 18 41119642 Interdigital hyperplasia IDH
Somatic cell score SCS
Residual feed intake RFI
Body weight BW
Milk fat yield FY
Longissimus muscle area LMA
22:55540238, 22 55540238, Milk fat yield FY
22:55540506, 55540506, Milk energy yield EY
22:55540517, 55540517, Milk yield MY
22:55540539,  55540539, Feed conversion ratio FCR
22:55555856 55555856 Mean corpuscular volume MCV

GO for reproductive traits in Holstein cows and heifers

The results of gene ontology analysis identified more than 200 genes for reproductive traits in heifers and Holstein cows, among which 54 important genes related to the desired traits are around (500 Kb higher than SNP and 500 Kb lower than SNP) SNPs correlated with the studied traits were detected (Tables 4 and 5). For the IFL trait, four candidate genes were identified in the vicinity of SNPs 8:66,532,988 (two numbers), 20:35,332,061, and 24:39,426,460, which were effective on the activity of the LPL, CNTNAP3, DAB2, and EPB41L3 genes. Eight candidate genes were detected for the NSPCC trait, which affected the activity of the PIK3C3, ERCC6, PARG, VSTM4, ERCC6, and ARHGAP22 genes. For the DO trait, 11 different candidate genes were identified in the vicinity of 2:15,024,774 (7), 6:15,796,637, 6:15,814,726, 14:80,517,355 (4), and 14:80,519,317 (4), which affect the activity of SERP2, BIRC6, CYP8B1, CENPU, CDK10, MYLK3, ASH1L, SLC10A5, SNX16, IMPA1, and CHMP4C genes (Table 4). The results of gene ontology analysis for the AFC trait led to the identification of 10 candidate genes in the vicinity of SNP, 29:25,831,763, which affect the activity of DST, EEF1A1, NPM1, KARS1, PRIM2, ALDH9A1, MIS18A, ADCY9, ZNF597, and ZNF93 genes. For the AFI trait, 21 different candidate genes were observed in the vicinity of significant SNPs, which affect the activity of some genes such as TMEM86A, UEVLD, CSRP3, PTPN5, TSG101, CRTAM, SLC6A1, GRHL1, HPCAL1, etc. (Table 5).

Table 4.

The candidate or nearest genes to the most significant single nucleotide polymorphisms (SNPs) in significant regions based on P < 1 × 10−8 for reproductive traits in Holstein cows

Trait SNP name SNP position CHR Gene start Gene end Ensembl gene ID Gene name
IFL 8:66532988 66532988 8 66032988 67032988 ENSBTAT00000017086, lipoprotein lipase (LPL),
ENSBTAT00000008454, contactin associated protein-like 3(CNTNAP3)
20:35332061 35332061 20 34832061 35832061 ENSBTAG00000016152 DAB adaptor protein 2(DAB2)
24:39426460 39426460 24 38926460 39926460 ENSBTAG00000019251 erythrocyte membrane protein band 4.1 like 3(EPB41L3)
NSPCC 24:14252019 14252019 24 13752019 14752019 ENSBTAT00000077983, phosphatidylinositol 3-kinase catalytic subunit type 3(PIK3C3)
ENSBTAT00000067289,
ENSBTAT00000037295
24:14817979 14817979 24 14317979 15317979 ENSBTAT00000077983, phosphatidylinositol 3-kinase catalytic subunit type 3(PIK3C3)
ENSBTAT00000067289,
ENSBTAT00000037295
24:14841115 14841115 24 14341115 15341115 ENSBTAT00000077983, phosphatidylinositol 3-kinase catalytic subunit type 3(PIK3C3)
ENSBTAT00000067289,
ENSBTAT00000037295
24:14847278 14847278 24 14347278 15347278 ENSBTAT00000077983, phosphatidylinositol 3-kinase catalytic subunit type 3(PIK3C3)
ENSBTAT00000067289,
ENSBTAT00000037295
28:43469861 43469861 28 42969861 43969861 ENSBTAT00000046144, ERCC excision repair 6, chromatin remodeling factor (ERCC6),
ENSBTAT00000079760, poly (ADP-ribose) glycohydrolase (PARG),
ENSBTAT00000008254, V-set and transmembrane domain containing 4(VSTM4),
ENSBTAT00000083009, ERCC excision repair 6, chromatin remodeling factor (ERCC6),
ENSBTAT00000010363 Rho GTPase activating protein 22 (ARHGAP22),
OD 2:15024774 15024774 2 14524774 15524774 ENSBTAG00000051524, stress associated endoplasmic reticulum protein family member 2(SERP2),
ENSBTAG00000027932, baculoviral IAP repeat containing 6(BIRC6),
ENSBTAG00000034106, cytochrome P450 family 8 subfamily B member 1(CYP8B1),
ENSBTAG00000018864, centromere protein U(CENPU),
ENSBTAG00000033333, cyclin dependent kinase 10(CDK10),
ENSBTAG00000014818, myosin light chain kinase 3(MYLK3),
ENSBTAG00000003954 ASH1 like histone lysine methyltransferase (ASH1L)
6:15796637 15796637 6 15296637 16296637 ENSBTAG00000003954 ASH1 like histone lysine methyltransferase (ASH1L)
6:15814726 15814726 6 15314726 16314726 ENSBTAG00000003954 ASH1 like histone lysine methyltransferase (ASH1L)
14:80517355 80517355 14 80017355 81017355 ENSBTAT00000027342, solute carrier family 10 member 5(SLC10A5),
ENSBTAT00000078298, sorting nexin 16(SNX16),
ENSBTAT00000015548, inositol monophosphatase 1(IMPA1),
ENSBTAT00000006291 charged multivesicular body protein 4C(CHMP4C)
14:80519317 80519317 14 80019317 81019317 ENSBTAT00000027342, solute carrier family 10 member 5(SLC10A5),
ENSBTAT00000078298, sorting nexin 16(SNX16),
ENSBTAT00000015548, inositol monophosphatase 1(IMPA1),
ENSBTAT00000006291 charged multivesicular body protein 4C(CHMP4C)

Table 5.

The candidate or nearest genes to the most significant single nucleotide polymorphisms (SNPs) in significant regions based on P < 1 × 10−8 for reproductive traits in Holstein heifers

Trait SNP name SNP position CHR Gene start Gene end Ensembl gene ID Gene name
AFI 29:25831763 25831763 29 25331763 26331763 ENSBTAG00000048059, transmembrane protein 86A(TMEM86A),

ENSBTAG00000013568,

ENSBTAG00000002488,

ENSBTAG00000011869,

ENSBTAG00000020257,

UEV and lactate/malate dehyrogenase domains (UEVLD), lactate dehydrogenase C(LDHC), cysteine and glycine rich protein 3(CSRP3), protein tyrosine phosphatase non-receptor type 5(PTPN5),
ENSBTAG00000017446, E2F transcription factor 8(E2F8),
ENSBTAG00000013563 tumor susceptibility 101(TSG101)
22:55681344 55681344 22 55181344 56181344 ENSBTAG00000000842, ENSBTAG00000005193 ubiquitin associated and SH3 domain containing B (UBASH3B), cytotoxic and regulatory T cell molecule (CRTAM)
22:55540238 55540238 22 55040238 56040238 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
22:55540506 55540506 22 55040506 56040506 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
22:55540517 55540517 22 55040517 56040517 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
22:55540539 55540539 22 55040539 56040539 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
22:55555856 55555856 22 55055856 56055856 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
15:33112811 33112811 15 32612811 33612811 ENSBTAG00000005857, ENSBTAG00000035827 solute carrier family 6 member 1(SLC6A1),
autophagy related 7(ATG7)
11:92438410 92438410 11 91938410 92938410 ENSBTAG00000001140, ENSBTAG00000001141, isoamyl acetate hydrolyzing esterase 1 (putative)(IAH1), ADAM metallopeptidase domain 17(ADAM17), grainyhead like transcription factor 1(GRHL1),
ENSBTAG00000007485, TATA-box binding protein associated factor, RNA polymerase I subunit B(TAF1B), hippocalcin like 1(HPCAL1), cytidine/uridine monophosphate kinase 2(CMPK2), KLF transcription factor 11(KLF11),
ENSBTAG00000007543, radical S-adenosyl methionine domain containing 2(RSAD2)
ENSBTAG00000004259, ENSBTAG00000019979,
ENSBTAG00000046218,
ENSBTAG00000016061
11:87794544 87794544 11 87294544 88294544 ENSBTAG00000001140, ENSBTAG00000001141, isoamyl acetate hydrolyzing esterase 1 (putative)(IAH1), ADAM metallopeptidase domain 17(ADAM17), grainyhead like transcription factor 1(GRHL1),
ENSBTAG00000007485, TATA-box binding protein associated factor, RNA polymerase I subunit B(TAF1B), hippocalcin like 1(HPCAL1), cytidine/uridine monophosphate kinase 2(CMPK2), KLF transcription factor 11(KLF11),
ENSBTAG00000007543, radical S-adenosyl methionine domain containing 2(RSAD2)
ENSBTAG00000004259, ENSBTAG00000019979,
ENSBTAG00000046218,
ENSBTAG00000016061
22:10534777 10534777 22 10034777 11034777 ENSBTAG00000006735 SH3 and cysteine rich domain (STAC)
22:10539654 10539654 22 10039654 11039654 ENSBTAG00000006735 SH3 and cysteine rich domain (STAC)
22:10547117 10547117 22 10047117 11047117 ENSBTAG00000006735 SH3 and cysteine rich domain (STAC)
13:32439203 32439203 13 31939203 32939203 ENSBTAG00000009475 plexin domain containing 2(PLXDC2)
13:32442132 32442132 13 31942132 32942,132 ENSBTAG00000009475 plexin domain containing 2(PLXDC2)
13:22165240 22165240 13 21665240 22665240 ENSBTAG00000009475 plexin domain containing 2(PLXDC2)
11:12676017 12676017 11 12176017 13176017 ENSBTAG00000009475 plexin domain containing 2(PLXDC2),

Gene networks

The results of gene network analysis for reproductive traits including IFL, DO, NSPC, AFI, and AFC are shown in Figs. 3 and 4. Based on the results of gene network analysis, there were contributions among candidate genes through co-expression, pathways, physical interactions, and shared protein domains for reproductive traits. Most of these contributions were related to physical interactions between genes, which indicated protein–protein interaction, and if two genes showed the same protein–protein interaction, their products were linked. Therefore, the candidate genes identified in our study had significant protein–protein interactions with each other or with related genes.

Fig. 3.

Fig. 3

Gene networks analysis for interval between first and last insemination (A), days open (D), number of services per conception (C) and reproductive traits in Holstein cows (D). Dark circles with and without slash represent candidate genes and associated genes, respectively. Arrows in pink, blue, red and bone color represent co-expression, pathway, physical interactions and shared protein domains, respectively

Fig. 4.

Fig. 4

Gene networks analysis for age at first insemination (A), age at first calving (B) and reproductive traits in Holstein heifers (C). Dark circles with and without slash represent candidate genes and associated genes, respectively. Arrows in pink, blue, red and bone color represent co-expression, pathway, physical interactions and shared protein domains, respectively

Discussion

Identification of genomic regions affecting reproductive traits can provide new opportunities to improve fertility in dairy cows. However, this study introduced genes and biological networks affecting reproductive traits in Holstein cattle [3].

Based on our GWAS results, three SNP markers in BTA8, BTA20, and BTA24 regions with the IFL trait, five markers in BTA2, BTA6 (two markers), and BTA14 (two markers) regions for the DO trait, and five markers on chromosomes BTA24 (four markers) and BTA28 showed a significant relationship with the NSPC trait based on the significance threshold of P < 10–8. The results obtained from this study are consistent with the results of other studies [3, 21, 31, 32]. Chen et al. [31] reported that the SNP markers that had a significant relationship with the IFL trait were in the BTA8, BTA7 and BTA20 regions. Mohammadi et al. [3] reported that several SNP markers located in BTA19 and BTA20 had a significant effect on the IFL trait. Eight QTLs located on BTA2, BTA1, and BTA26 chromosomes were significantly correlated with the DO trait [32]. Most SNPs affecting the NSPC trait were located on chromosomes BTA24, and BTA28 [21]. A marker affecting the age at first insemination (AFI) trait was observed in the cM 25 region of chromosome number two [33]. A SNP in the BTA29 region affecting this trait was also reported in Swiss-bred cattle [31]. In another study, Alves et al. [34] reported significant regions in BTA7 for AFI. In our study, significant SNP markers were observed the AFC trait in the BTA18, BTA25, and BTA26 regions (seven markers). In a recent study, 19 significant SNP markers for AFC were in the BTA13, BTA25 and BTA26 regions [35]. In the last research conducted by Nascimento et al. [36] on Nellore heifers, it was found that significant markers on the AFC trait were mostly located in the BTA18, BTA21, and BTA26 regions.

The results of QTLs identified in this study affecting the IFL trait were consistent with the results of Chen et al. [31] and Mohammadi et al. [3] on Holstein cows. Considering that the structure of the udder in cattle is one of the most important traits in profitability, the more suitable the structure of the udder is, the better the milk production will be, and the probability of udder infection in livestock and the number of somatic cells will decrease. The results of studies have proven the relationship between breast health traits, milk production [37], and reproductive traits [21]. The results of a study showed that five candidate genes with high performance, which were in the form of haplotype blocks, had an effective effect on stillbirth and fertility traits [38]. The results of this study show that the decrease in fertility rate can be affected by the residual properties of the consumed feed [39]. Because the negative energy balance affects the amount of insulin produced and reduces the production of growth hormone, it will eventually lead to a decrease in hepatic IGF-1 synthesis [40]. In this study, around the observed SNP, QTLs were observed that were correlated with milk quantity traits and udder infection, which can indicate that the animal became pregnant later at the peak of lactation. Around the SNP observed in this study, several QTLs affecting multiple births were observed. In the study of Widmer et al. [41] on chromosome 11, many QTLs in the form of haplotype blocks had a significant relationship with fertility and calving traits. In this study, it was also reported that the correlation between calving and fertility traits and the trait of multiple birth was negative and unfavorable. In the vicinity of SNPs related to IFL, DO, and NSPC traits, QTLs were observed in this study, which are related to body weight traits, which can be a proof of the claim that heifers should reach their ideal weight at the time of insemination. That this weight will be different depending on different breeds, for example, in Brahman breed cows, cows with higher weight had more inappropriate fertility [39]. In this study, QTLs related to calving ease were identified. The results of the study show that calving ease in the Brown Swiss breed is affected by body weight, so cows with higher weight have more difficulty calving and calving interval will be higher [42]. Around the SNP associated with the number of services per conception trait, a QTL was also effective on the fertility index trait. Fertility index is the result of multiplying the capture index by the estrus index, and since the inverse capture index is the trait of the NSPC, the relationship between these two traits is not far from expected [22]. The identification of QTLs affecting stillbirth and puberty age in this area shows the important role of the identified markers on the age at first calving trait. In general, and experimentally, heifer body weight, calf size, Keppel's angle, and age at the first insemination can have an effect on the ease of reproduction trait. The results of a study proved the moderate genetic correlation between calving ease traits, calving weight, and fetal survival [43]. In the last study by Nascimento et al. [36] that was conducted on Nellore heifers, it was found that in the vicinity of significant markers on the trait of age at first calving, there were QTLs with traits of milk yield, milk protein yield, milk fat percentage, non-return rate, and stillbirth. Finally, the QTLs identified in the margins of significant SNPs confirm the fact that the age of the first insemination and the age of the first calving can be effective on production traits, reproduction, longevity, and finally on the profitability of the herd in these conditions. It can be said that herd management plays an important role in advancing breeding goals [20]. Also, the identification of the QTLs related to traits such as stillbirth, calving ease, number of embryos, length of pregnancy, percentage of milk protein, fertility index, and body weight in these areas has already been confirmed by other researchers [20, 33, 34].

Four key LPL, CNTNAP3, DAB2, and, EPB41L3 genes were identified for the IFL distance trait. The Lipoprotein lipase (LPL) gene is a key enzyme for fat metabolism and plays an essential role in fat synthesis in adipose tissue and milk. LPL is also expressed in other places, including mammary gland, nervous system, heart, liver, pancreatic islet cells, and lungs [44]. Several key genes were identified for the DO trait. The SERP2 gene plays a role in udder infection. The effect of this gene on camel udder infection has been confirmed. Udder infection is among the things that affect the days open traits [45]. The MDFI gene is a gene that inhibits muscle growth, this gene can also affect growth traits and prevent the proper growth of livestock [46]. The BIRC6 gene causes fetal growth at the beginning of pregnancy. Embryo absorption in early pregnancy increases the number of inseminations per pregnancy [47]. In the BTA2 chromosome, two genes, CYP7A1 and CYP8B1, are involved in the conversion of cholesterol into bile acids, which play an important role in nutrient absorption, digestion, fat, and glucose regulation. These genes also play a role in the positive regulation of cholesterol biosynthesis in heifers and increasing milk production two weeks after giving birth, especially in multiple births [48]. The CENPU gene in BTA2 was effective on embryo survival in dairy cows. This gene has an abundance of 0.08% in French Holstein cows and can be among the genes that cause embryo death in cows [49]. The CDK10 gene plays an important role in fat distinctness. This gene is one of the 185 genes in the CDK family. In this study, another gene from this family (CDK2) in BTA13 had a SNP effect on the age of first insemination [50]. Eight key genes were identified for the NSPC trait, among them, the PIK3C3 gene was observed in BTA24 in the margins of SNPs affecting NSPC trait, which plays a role in growth factors, embryo growth during pregnancy, and protein synthesis. Also, PIK3C3 plays a role in the development of skeletal muscle 21 days after delivery, and this protein plays a role in the transition period and negative energy balance in protein balance. Postpartum energy balance is effective on the number of inseminations per pregnancy [51].

Some key genes were identified for the AFI trait, e.g., in BTA15, the CRTAM gene was observed. This gene is a regulator of T cells, and it has been reported that this gene has a relationship with the immune response in cattle [52]. It seems that the ATG7 gene in BTA22 is effective for muscle and bone growth. The results of a study on Japanese black beef cattle showed that the expression of ATG7 increased during obesity in skeletal muscles [53]. In BTA29, the E2F8 gene is one of the essential genes for the development and proliferation of the cell cycle. This gene is also effective in the growth and development of the cow follicle. On the other hand, this gene affects the growth of the fetus and the development of the uterine system in the first pregnancy [54]. The Tsg101 gene is required for the maintenance of uterine epithelial cells, and its deletion may cause the breakdown of these cells and lead to embryo damage at the beginning of implantation [55]. The GRHL1 gene affects the process of yolk sac formation at the beginning of pregnancy and plays an important role in placenta development [56]. In general, the function of this gene on reproductive traits, including abortion and stillbirth, has been proven [57]. The KLF11 gene in BTA11 is a member of a gene family called KLFs. which affects muscle fat content [58]. Muscle fat content is one of the determinants of meat quality traits in livestock. There are also several different KLFs that regulate cell growth and differentiation in mammals and, like the previous gene, have a great effect on pregnancy traits [58]. In BTA22, the STAC gene (SH3 and cysteine rich domain) plays an important role in the growth and contraction of skeletal muscles. STAC3 expression is observed during the differentiation of bovine satellite cells during weight gain and muscle growth [59]. The MLLT10 gene in BTA22 is associated with fertility traits. It plays a role in the biological processes of mammary gland morphogenesis, uterine growth, morphogenesis and mesoderm formation and blood vessel growth. The expression of this gene may be affected in the reproductive traits of goats under natural or artificial selection [60]. Among other identified genes, we can mention that the PLXDC2 gene in BTA11 is effective on the weight trait. The results of a study conducted on Lori Bakhtiari sheep showed that this gene can be considered a candidate gene for body weight traits after weaning [61]. PHYH is another gene that is effective on fatty acid oxidation and fatty acid metabolism affects successful fertility, which shows that there is a relationship between immune traits and reproduction [62]. In BTA11, the sperm associated antigen 6 (SPAG6) gene, sperm motility, the percentage of sperm with forward progress, is one of the characteristics of semen quality, which is highly related to male fertility potential in farm animals. Low sperm motility disrupts reproductive efficiency. Studies have shown that SPAG6 protein is a positive regulator for testis growth, spermatogenesis, and regulation of sperm movement in chickens [63].

For the AFC trait in BTA13, the DST gene plays an important role in fighting the virus, especially the herpes virus in cattle, and is one of the genes that improves the immune system. The results of a study showed that the DST gene is effective on the survivability of female bulls [64]. The EEF1A1 gene, which encodes one of the most abundant plasma proteins, plays an important role in the translation machine. In a study conducted on dairy cows, it was found that there is a high correlation between body weight gain, muscle growth, and feed residue. It can be concluded that this gene plays a role in tissue and muscle growth, and the presence of this gene next to the SNP associated with the AFC trait confirms the effect of this SNP [65]. In BTA26, the PRIM2 gene has been effective on carcass weight traits. The results of a study conducted on dairy cows showed that this gene was effective on carcass weight, daily weight gain, and maturity weight [66]. The ADCY9 gene influences the improvement of reproductive organs in heifers. In general, the reproductive efficiency of heifers directly depends on the health of the reproductive system, especially the ovaries and uterus. In the last research conducted on the genes of the ADCY family, it was found that this gene group has a great effect on the formation of uterine tissue and ovaries in Xiangxi cattle [67]. The bovine ZNF597 gene is another maternally expressed gene, and its expression is regulated by DNA methylation. It was shown that this gene was effective in the formation of placenta tissue and breast cell structure in dairy cows [68]. However, since cows were selected based on the estimated breeding value of milk yield in this study, QTLs and genes identified around significant SNPs may increase milk or fat production and thus improve reproductive traits in cattle.

Conclusions

The use of GWAS facilitates the combination of genomic data in the genetic evaluation of reproductive traits in Holstein cattle. We have identified several SNPs, significant regions in different BTAs, and a list of locus candidate genes (new and known) that may contribute to variation in reproductive traits in Holstein cattle. The most important QTLs identified included: interval from first to last insemination (INSINT), age at puberty (PUBAGE), calving ease (CALEASE), fertility index (FERIND), and body weight (BW). Key genes with an important biological position for heifers (ATG7, PTPN5, STAC, GAD2, PLXDC2, KARS1, PRIM2, and ZNF597) and cows (LPL, SERP2, BIRC6, CENPU, PIK3C3, and MYLK3) can be mentioned. Which can be used as key genes in future programs. In general, these results can help to better understand the studied traits and develop and improve them.

Materials and methods

Phenotypic data

After receiving permission from two companies affiliated with Ferdous Pars Agriculture and Animal Husbandry Development Holding for sampling and receiving company information from Iran's Livestock Breeding and Improvement Center, information about the cows of these companies was received. It is worth mentioning that the breeding value of the milk production trait for these cows was estimated by that center through the following equation (Eq. 1).

yij=μ+hysi+aij+eij 1

where yij is milk yield (adjusted to 305 days and twice a day milking); μ is the population mean; hysi is the effect of herd-year-season group i; aij is the animal breeding for jth animal and ith herd-year-season group, and eij is the random residual effects.

The cows were sorted according to the breeding value of the milk production trait from large to small, and 210 female cows born between 2012–2013 were selected for this study (150 and 60 cattle, respectively, in herds no. 1 and 2) [69]. These 210 animals were divided into two subpopulations of 105 based on the highest and lowest breeding values of milk yield.

In addition to the mentioned cases, the following cases were also considered during the sampling: during the sampling, the pedigree analysis of the livestock was done using the CFC software [70] and tried livestocks with minimal kinship relationships were selected so that the diversity of livestock in both herds was high [71]. The selected animals had complete pedigrees and records, and it was ensured that the selected animals were not candidate animals for elimination.

In this study, the reproductive traits of Holstein heifers and cows (lactation 1 to 3), collected from 2013 to 2020, were used. The following traits, including AFI, AFC, DFS, DO, IFL, and NSPC, were analyzed. Attributes are presented and defined in Table 6.

Table 6.

Definition of the studied reproductive traits in Iranian Holstein

Traits Definition
Age at first insemination (days) 1st insemination date – birth date
Age at first calving (days) Calving date – birth date
Interval between first and last insemination (days) Last – first insemination date
Days from calving to first service (days) 1st insemination date – previous calving date
Days open (days) Last insemination date – previous calving date
Number of services per conception Number of services / conception

Genotype imputation and quality control (QC)

Hair samples were collected from 210 Iranian Holstein cows in a breeding population (Firdous Pars dairy farm, Isfahan, Iran) and no anesthetic was used in this study. One-hundred forty and 60 animals from herd 1 and 2 were genotyped by the GGP-LD v.4 SNP panel (with 30,108 SNPs) and the Affymetrix Axiom Bovine Array-50 K (with 51,987 SNPs), respectively. To control the quality of genotyping, four criteria were used in the PLINK 2.0 software. In this criterion, animals with more than 5% of missing genotypes were excluded, SNPs with minor allele frequency (MAF) less than 5% were excluded, and SNPs that were not genotyped for more than 5% of animals and chi scores were less than 10–6. Chi-square < 10−6) were excluded from the Hardy–Weinberg equilibrium test. Minimac3 software [72]. was used for imputation to check the accuracy of imputation and identify and remove markers with lower accuracy and stepwise imputation.

First, 210 cows were genotyped based on 30 K and 50 K SNPs, and the target population was named. Then, based on the reference population including 234 cows (the 1000 Bull Genome Project database containing 129, 43, 15, and 47 key progenitors from the global Holstein–Friesian, Fleckoyer, Jersey, and Angus breeds, respectively) with BovineHD beadchip and the whole-genome, imputation was performed [33]. The same quality control was performed on BovineHD beadchip and the sequence data, and a total of 578,505 SNPs and 12,063,146 SNPs were retained for analysis after filtering, respectively. In addition, Eagle (version 2.3) was used to phase genotypes before using Minimac3 for reference and target populations separately [73]. Finally, genotypes assigned with an accuracy of less than 0.30 were excluded, and a total of 6,583,595 SNPs were retained for further analysis [69]. Finally, the remaining 6,583,595 SNPs were used for GWAS analysis.

Genome-wide association studies

Analysis of the relationship between the imputed genotypes and the studied traits was performed using a general linear model (GLM) in EMMAX software [74]. The equation used in GWAS is as follows (Eq. 2):

y=Xb+Zu+e 2

In this model, y is the vector of phenotypic values, X is the matrix of coefficients for fixed factors, including mean, fixed effects, random effects, and SNP effects. b is the vector of fixed effects, Z is the matrix of coefficients of additive genetic random effects, u is the vector of additive genetic random effects, and e is the vector of residual effects. In addition, Varu=σg2K and Vare=σe2I, where K is a genomic kinship matrix calculated using genotype data and then used in the model, and I is an identity matrix.

Significance levels

In this research, we used the following three methods to identify the threshold of significant association in the GWAS study: 1- The threshold of P < 1 × 10–8 suggested by Reed et al. [75] 2- Benferoni correction threshold for the whole genome, which is equivalent to P < 1 × 10–9 (total number of SNPs / P < 0.05). 3- Benferoni correction threshold per chromosome (total number of SNPs per chromosome/P < 0.05) has been proposed as an alternative solution [76]. Because the Benferoni correction for the whole genome does not consider the connection between SNPs, this approach increases conservatism and the false negative rate [77]. Two significant thresholds were considered: P < 1 × 10–6 and P < 1 × 10–8. To draw the Manhattan plot, the qqman package was used in the R 4.3.2 software [78].

Gene annotation

Ensembl annotation of the UMD3.1 genome version (http://www.ensembl.org/biomart/martview) was used to identify candidate genes around (within one megabase) SNPs that passed the threshold of P < 1 × 10–8. DAVID Bioinformatics Resources version 6.7 (http://david.abcc.ncifcrf.gov/) was used for gene ontology analysis. In addition, to identify the QTLs around (within 1 Mb) SNPs that passed the threshold of P < 1 × 10–8, the QTLdb of cattle (https://www.animalgenome.org/cgibin/QTLdb/BT/index) was used. Finally, gene networks were drawn by GeneMANIA (http://genemania.org/).

Supplementary Information

Supplementary Material 1. (583.9KB, docx)

Acknowledgements

We would like to extend thanks to the Ferdows Pars Agricultural Holding Company, and National Animal Breeding Centre of Iran for giving us access to the animals and recording. Finally, we acknowledge the 1000 Bull Genomes Project for making their research data publicly available.

Authors’ contributions

NM performed the experiments and data analysis and wrote the article draft; MS and AJS supervised the project and provided editorial input on the writing. SRMA and MK contributed to data analysis and writing the article draft. All authors discussed the results and contributed to the final manuscript. The author(s) read and approved the final manuscript.

Funding

This research did not receive any specific funding.

Data availability

The datasets generated and analyzed during the current study are available in the Figshare repository [https://doi.org/10.6084/m9.figshare.28604060].

Declarations

Ethics approval and consent to participate

The samples collected from the studied animals were performed in accordance with animal ethics and approved by the Animal Use Committee of the University of Tehran and the National Animal Breeding Centre of Iran. In addition, permission for sampling was obtained from the farmers on site. Also, we have obtained the informed consent of the owner(s) to use the animals in our study in this area from the University of Tehran and the National Animal Breeding Center of Iran.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Henchion M, Moloney AP, Hyland J, Zimmermann J, McCarthy S. Trends for meat, milk and egg consumption for the next decades and the role played by livestock systems in the global production of proteins. Animal. 2021;15: 100287. [DOI] [PubMed] [Google Scholar]
  • 2.Heinke J, Rolinski S, Müller C. Modeling the role of livestock grazing in C and N cycling in grasslands with LPJmL5. 0-grazing. Geoscientific Model Development Discussions. 2022;2022:1–31. [Google Scholar]
  • 3.Mohammadi A, Alijani S, Rafat SA, Abdollahi-Arpanahi R. Single-step genome-wide association study and candidate genes networks affecting reproductive traits in Iranian Holstein cattle. Livest Sci. 2022;262:104971. [Google Scholar]
  • 4.Inchaisri C, Jorritsma R, Vos PL, Van der Weijden GC, Hogeveen H. Economic consequences of reproductive performance in dairy cattle. Theriogenology. 2010;74(5):835–46. [DOI] [PubMed] [Google Scholar]
  • 5.Ahlman T, Berglund B, Rydhmer L, Strandberg E. Culling reasons in organic and conventional dairy herds and genotype by environment interaction for longevity. J Dairy Sci. 2011;94(3):1568–75. [DOI] [PubMed] [Google Scholar]
  • 6.McConnel CS, Lombard JE, Wagner BA, Garry FB. Evaluation of factors associated with increased dairy cow mortality on United States dairy operations. J Dairy Sci. 2008;91(4):1423–32. [DOI] [PubMed] [Google Scholar]
  • 7.Liu Z, Jaitner J, Pasman E, Rensing S, Reinhardt F, Reents R. Genetic evaluation of fertility traits of dairy cattle using a multiple trait model. Interbull Bulletin. 2007;37:134. [DOI] [PubMed] [Google Scholar]
  • 8.Espinoza Villavicencio JL, Ceró Rizo Á, Guerra Iglesias D, Palacios Espinosa A, Domínguez Viveros J, González-Peña Fundora D. Environmental factors and genetic parameters for some reproductive traits in Chacuba cattle. Revista mexicana de ciencias pecuarias. 2015;6(4):431–41. [Google Scholar]
  • 9.Lomba YL. Review on quantitative trait locus (QTLs) for yield and quality of milk, meat, fibre, eggs and their application. Int J Animal Sci. 2020;6(1):285–91. [Google Scholar]
  • 10.Ma L, Cole JB, Da YA, VanRaden PM. Symposium review: Genetics, genome-wide association study, and genetic improvement of dairy fertility traits. J Dairy Sci. 2019;102(4):3735–43. [DOI] [PubMed] [Google Scholar]
  • 11.Alonso-Hearn M, Badia-Bringué G, Canive M. Genome-wide association studies for the identification of cattle susceptible and resilient to paratuberculosis. Front Vet Sci. 2022;9:935133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ersoz ES, Yu J, Buckler ES. Applications of linkage disequilibrium and association mapping in maize. InMolecular genetic approaches to maize improvement,. Berlin, Heidelberg: Springer, Berlin Heidelberg; 2009. p. 173–95. [Google Scholar]
  • 13.Rabieyan E, Bihamta MR, Esmaeilzadeh Moghaddam M, Mohammadi V, Alipour H. Genome-wide association mapping and genomic prediction for preharvest sprouting resistance, low α-amylase and seed color in Iranian bread wheat. BMC Plant Biol. 2022;22(1):1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rabieyan E, Bihamta MR, Moghaddam ME, Mohammadi V, Alipour H. Genome-wide association mapping and genomic prediction of agronomical traits and breeding values in Iranian wheat under rain-fed and well-watered conditions. BMC Genomics. 2022;23(1):831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Abdi H, Alipour H, Bernousi I, Jafarzadeh J, Rabieyan E. Morpho-colorimetric diversity and genome-wide association study of wheat spike architecture based on image processing under well-watered and rain-fed conditions. J Plant Growth Regul. 2024;5:1–8. [Google Scholar]
  • 16.McKay SD, Schnabel RD, Murdoch BM, Matukumalli LK, Aerts J, Coppieters W, Crews D, Neto ED, Gill CA, Gao C, Mannen H. Whole genome linkage disequilibrium maps in cattle. BMC Genet. 2007;8(1):1–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Rabieyan E, Darvishzadeh R, Alipour H. Genetic analyses and prediction for lodging-related traits in a diverse Iranian hexaploid wheat collection. Sci Rep. 2024;14(1):275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rabieyan E, Bihamta MR, Moghaddam ME, Alipour H, Mohammadi V, Azizyan K, Javid S. Analysis of genetic diversity and genome-wide association study for drought tolerance related traits in Iranian bread wheat. BMC Plant Biol. 2023;23(1):431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Rabieyan E, Bihamta MR, Moghaddam ME, Mohammadi V, Alipour H. Genome-wide association mapping for wheat morphometric seed traits in Iranian landraces and cultivars under rain-fed and well-watered conditions. Sci Rep. 2022;12(1):17839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Frischknecht M, Bapst B, Seefried FR, Signer-Hasler H, Garrick D, Stricker C, Intergenomics Consortium, Fries R, Russ I, Sölkner J, Bieber A. Genome-wide association studies of fertility and calving traits in Brown Swiss cattle using imputed whole-genome sequences. BMC genomics. 2017;18:1–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Liu A, Lund MS, Wang Y, Guo G, Dong G, Madsen P, Su G. Variance components and correlations of female fertility traits in Chinese Holstein population. J Animal Sci Biotechnol. 2017;8:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Abdollahi-Arpanahi R, Carvalho MR, Ribeiro ES, Peñagaricano F. Association of lipid-related genes implicated in conceptus elongation with female fertility traits in dairy cattle. J Dairy Sci. 2019;102(11):10020–9. [DOI] [PubMed] [Google Scholar]
  • 23.Stephen MA, Burke CR, Steele N, Pryce JE, Meier S, Amer PR, Phyn CV, Garrick DJ. Genome-wide association study of anogenital distance and its (co) variances with fertility in growing and lactating Holstein-Friesian dairy cattle. J Dairy Sci. 2023;106(11):7846–60. [DOI] [PubMed] [Google Scholar]
  • 24.Otto PI, Guimarães SE, Calus MP, Vandenplas J, Machado MA, Panetto JC, da Silva MV. Single-step genome-wide association studies (GWAS) and post-GWAS analyses to identify genomic regions and candidate genes for milk yield in Brazilian Girolando cattle. J Dairy Sci. 2020;103(11):10347–60. [DOI] [PubMed] [Google Scholar]
  • 25.Higgins MG, Fitzsimons C, McClure MC, McKenna C, Conroy S, Kenny DA, McGee M, Waters SM, Morris DW. GWAS and eQTL analysis identifies a SNP associated with both residual feed intake and GFRA2 expression in beef cattle. Sci Rep. 2018;8(1):14301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rabieyan E, Darvishzadeh R, Mohammadi R, Gul A, Rasheed A, Akhar FK, Abdi H, Alipour H. Genetic diversity, linkage disequilibrium, and population structure of tetraploid wheat landraces originating from Europe and Asia. BMC Genomics. 2023;24(1):682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Pierce CF, Speidel SE, Coleman SJ, Enns RM, Bailey DW, Medrano JF, Cánovas A, Meiman PJ, Howery LD, Mandeville WF, Thomas MG. Genome-wide association studies of beef cow terrain-use traits using Bayesian multiple-SNP regression. Livest Sci. 2020;232:103900. [Google Scholar]
  • 28.Abdollahi-Arpanahi R, Gianola D, Peñagaricano F. Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes. Genet Sel Evol. 2020;52:1–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Oltenacu PA, Frick A, Lindhe B. Relationship of fertility to milk yield in Swedish cattle. J Dairy Sci. 1991;74(1):264–8. [DOI] [PubMed] [Google Scholar]
  • 30.Jamrozik J, Fatehi J, Kistemaker GJ, Schaeffer LR. Estimates of genetic parameters for Canadian Holstein female reproduction traits. J Dairy Sci. 2005;88(6):2199–208. [DOI] [PubMed] [Google Scholar]
  • 31.Chen SY, Schenkel FS, Melo AL, Oliveira HR, Pedrosa VB, Araujo AC, Melka MG, Brito LF. Identifying pleiotropic variants and candidate genes for fertility and reproduction traits in Holstein cattle via association studies based on imputed whole-genome sequence genotypes. BMC Genomics. 2022;23(1):1–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Saowaphak P, Duangjinda M, Plaengkaeo S, Suwannasing R, Boonkum W. Genetic correlation and genome-wide association study (GWAS) of the length of productive life, days open, and 305-days milk yield in crossbred Holstein dairy cattle. Genet Mol Res. 2017;16(2):1–18. [DOI] [PubMed] [Google Scholar]
  • 33.Daetwyler HD, Capitan A, Pausch H, Stothard P, Van Binsbergen R, Brøndum RF, Liao X, Djari A, Rodriguez SC, Grohs C, Esquerré D. Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat Genet. 2014;46(8):858–65. [DOI] [PubMed] [Google Scholar]
  • 34.Alves K, Brito LF, Sargolzaei M, Schenkel FS. Genome-wide association studies for epistatic genetic effects on fertility and reproduction traits in Holstein cattle. J Anim Breed Genet. 2023;140(6):624–37. [DOI] [PubMed] [Google Scholar]
  • 35.Costa RB, Camargo GM, Diaz ID, Irano N, Dias MM, Carvalheiro R, Boligon AA, Baldi F, Oliveira HN, Tonhati H, Albuquerque LG. Genome-wide association study of reproductive traits in Nellore heifers using Bayesian inference. Genet Sel Evol. 2015;47(1):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Nascimento AV, Romero ÂR, Utsunomiya YT, Utsunomiya AT, Cardoso DF, Neves HH, Carvalheiro R, Garcia JF, Grisolia AB. Genome-wide association study using haplotype alleles for the evaluation of reproductive traits in Nelore cattle. PLoS ONE. 2018;13(8):e0201876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zhou C, Li C, Cai W, Liu S, Yin H, Shi S, Zhang Q, Zhang S. Genome-wide association study for milk protein composition traits in a Chinese Holstein population using a single-step approach. Front Genet. 2019;10:72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Fonseca PA, Schenkel FS, Cánovas A. Genome-wide association study using haplotype libraries and repeated-measures model to identify candidate genomic regions for stillbirth in Holstein cattle. J Dairy Sci. 2022;105(2):1314–26. [DOI] [PubMed] [Google Scholar]
  • 39.Olson TA. Reproductive efficiency of cows of different sizes. 42nd Annual Florida Beef Cattle Short Course Proceedings. 1993:103.
  • 40.Fenwick MA, Fitzpatrick R, Kenny DA, Diskin MG, Patton J, Murphy JJ, Wathes DC. Interrelationships between negative energy balance (NEB) and IGF regulation in liver of lactating dairy cows. Domest Anim Endocrinol. 2008;34(1):31–44. [DOI] [PubMed] [Google Scholar]
  • 41.Widmer S, Seefried FR, von Rohr P, Häfliger IM, Spengeler M, Drögemüller C. A major QTL at the LHCGR/FSHR locus for multiple birth in Holstein cattle. Genet Sel Evol. 2021;53(1):57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Guo J, Jorjani H, Carlborg Ö. A genome-wide association study using international breeding-evaluation data identifies major loci affecting production traits and stature in the Brown Swiss cattle breed. BMC Genet. 2012;13(1):1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Junior GO, Schenkel FS, Alcantara L, Houlahan K, Lynch C, Baes CF. Estimated genetic parameters for all genetically evaluated traits in Canadian Holsteins. J Dairy Sci. 2021;104(8):9002–15. [DOI] [PubMed] [Google Scholar]
  • 44.Gu M, Cosenza G, Gaspa G, Iannaccone M, Macciotta NP, Chemello G, Di Stasio L, Pauciullo A. Sequencing of lipoprotein lipase gene in the Mediterranean river buffalo identified novel variants affecting gene expression. J Dairy Sci. 2020;103(7):6374–82. [DOI] [PubMed] [Google Scholar]
  • 45.Ateya A, Safhi FA, El-Emam H, Marawan MA, Fayed H, Kadah A, Mamdouh M, Hizam MM, Al-Ghadi MQ, Abdo M, Fericean L. Combining nucleotide sequence variants and transcript levels of immune and antioxidant markers for selection and improvement of mastitis resistance in dromedary camels. Agriculture. 2023;13(10):1909. [Google Scholar]
  • 46.Hou L, Xu J, Jiao Y, Li H, Pan Z, Duan J, Gu T, Hu C, Wang C. MiR-27b promotes muscle development by inhibiting MDFI expression. Cell Physiol Biochem. 2018;46(6):2271–83. [DOI] [PubMed] [Google Scholar]
  • 47.Salilew-Wondim D, Hölker M, Rings F, Phatsara C, Mohammadi-Sangcheshmeh A, Tholen E, Schellander K, Tesfaye D. Depletion of BIRC6 leads to retarded bovine early embryonic development and blastocyst formation in vitro. Reprod Fertil Dev. 2010;22(3):564–79. [DOI] [PubMed] [Google Scholar]
  • 48.Cheng Z, Little MW, Ferris C, Takeda H, Ingvartsen KL, Crowe MA, Wathes DC, GplusE Consortium. Influence of the concentrate inclusion level in a grass silage–based diet on hepatic transcriptomic profiles in Holstein-Friesian dairy cows in early lactation. J Dairy Sci. 2023;106(8):5805–24. [DOI] [PubMed] [Google Scholar]
  • 49.Hoze C, Escouflaire C, Mesbah-Uddin M, Barbat A, Boussaha M, Deloche MC, Boichard D, Fritz S, Capitan A. A splice site mutation in CENPU is associated with recessive embryonic lethality in Holstein cattle. J Dairy Sci. 2020;103(1):607–12. [DOI] [PubMed] [Google Scholar]
  • 50.Pan C, Lei Z, Wang S, Wang X, Wei D, Cai X, Luoreng Z, Wang L, Ma Y. Genome-wide identification of cyclin-dependent kinase (CDK) genes affecting adipocyte differentiation in cattle. BMC Genomics. 2021;22:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Thanh LP, Wichasit N, Li Y, Batistel F, Tartrakoon W, Parys C, Guyader J, Loor JJ. Alterations in skeletal muscle abundance of protein turnover, stress, and antioxidant proteins during the periparturient period in dairy cows fed ethyl-cellulose rumen-protected methionine. J Dairy Sci. 2023;106(7):5127–45. [DOI] [PubMed] [Google Scholar]
  • 52.Ben-Jemaa S, Senczuk G, Ciani E, Ciampolini R, Catillo G, Boussaha M, Pilla F, Portolano B, Mastrangelo S. Genome-wide analysis reveals selection signatures involved in meat traits and local adaptation in semi-feral Maremmana cattle. Front Genet. 2021;12:675569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Nakanishi T, Tokunaga T, Ishida T, Kobayashi I, Katahama Y, Yano A, Erickson L, Kawahara S. Changes in expression of the autophagy-related genes microtubule-associated protein 1 light chain 3β and autophagy related 7 in skeletal muscle of fattening Japanese Black cattle: a pilot study. Asian Australas J Anim Sci. 2019;32(4):592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Nazar M, Lu X, Abdalla IM, Ullah N, Fan Y, Chen Z, Arbab AA, Mao Y, Yang Z. Genome-wide association study candidate genes on mammary system-related teat-shape conformation traits in chinese holstein cattle. Genes. 2021;12(12):2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Byun H, Kwon S, Wagner KU, Shin H, Lim HJ. Tumor susceptibility gene 101 is required for the maintenance of uterine epithelial cells during embryo implantation. Reprod Biol Endocrinol. 2021;19(1):1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Pawlak M, Walkowska A, Mlącki M, Pistolic J, Wrzesiński T, Benes V, Jane S, Wesoły J, Kompanowska-Jezierska E, Wilanowski T. Consequences of the loss of the Grainyhead-like 1 gene for renal gene expression, regulation of blood pressure and heart rate in a mouse model. Acta Biochim Pol. 2015;62(2):287–96. [DOI] [PubMed] [Google Scholar]
  • 57.Walkowska A, Pawlak M, Jane SM, Kompanowska-Jezierska E, Wilanowski T. Effects of high and low sodium diet on blood pressure and heart rate in mice lacking the functional Grainyhead-like 1 gene. Physiol Res. 2017;66(1):163. [DOI] [PubMed] [Google Scholar]
  • 58.Raza SH, Pant SD, Wani AK, Mohamed HH, Khalifa NE, Almohaimeed HM, Alshanwani AR, Assiri R, Aggad WS, Noreldin AE, Abdelnour SA. Krüppel-like factors family regulation of adipogenic markers genes in bovine cattle adipogenesis. Mol Cell Probes. 2022;65:101850. [DOI] [PubMed] [Google Scholar]
  • 59.Zhang Y, Cong X, Wang A, Jiang H. Identification of the STAC3 gene as a skeletal muscle-specifically expressed gene and a novel regulator of satellite cell differentiation in cattle. J Anim Sci. 2014;92(8):3284–90. [DOI] [PubMed] [Google Scholar]
  • 60.Wang JJ, Li ZD, Zheng LQ, Zhang T, Shen W, Lei CZ. Genome-wide detection of selective signals for fecundity traits in goats (Capra hircus). Gene. 2022;818:146221. [DOI] [PubMed] [Google Scholar]
  • 61.Almasi M, Zamani P, Mirhoseini SZ, Moradi MH. Genome-wide association study for postweaning weight traits in Lori-Bakhtiari sheep. Trop Anim Health Prod. 2021;53:1–8. [DOI] [PubMed] [Google Scholar]
  • 62.Sbardella AP, Watanabe RN, da Costa RM, Bernardes PA, Braga LG, Baldi Rey FS, Lôbo RB, Munari DP. Genome-wide association study provides insights into important genes for reproductive traits in nelore cattle. Animals. 2021;11(5):1386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Sun Y, Li Y, Shi L, Xue F, Xu H, Meng Q, Liu Y, Zong Y, Ma H, Chen J. Differential proteomics highlights specific testicular proteins associated with chicken sperm motility and fertility potential. Ann Agri Sci. 2023;68(1):36–47. [Google Scholar]
  • 64.Simões MC, Lee J, Djikeng A, Cerqueira GC, Zerlotini A, da Silva-Pereira RA, Dalby AR, LoVerde P, El-Sayed NM, Oliveira G. Identification of Schistosoma mansoni microRNAs. BMC Genomics. 2011;12:1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Cardoso TF, Bruscadin JJ, Afonso J, Petrini J, Andrade BG, de Oliveira PS, Malheiros JM, Rocha MI, Zerlotini A, Ferraz JB, Mourão GB. EEF1A1 transcription cofactor gene polymorphism is associated with muscle gene expression and residual feed intake in Nelore cattle. Mamm Genome. 2022;33(4):619–28. [DOI] [PubMed] [Google Scholar]
  • 66.Wang T, Niu Q, Zhang T, Zheng X, Li H, Gao X, Chen Y, Gao H, Zhang L, Liu GE, Li J. Cis-eQTL analysis and functional validation of candidate genes for carcass yield traits in beef cattle. Int J Mol Sci. 2022;23(23):15055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Song W, Li J, Sun A, Long Y, Luo Y, Lei H, Zhang B, Li C, Li X, Yi K. Transcriptomic analysis of differentially expressed genes between uterus and ovary in Xiangxi cattle. Pakistan Journal of Agricultural Sciences. 2022;59(5).
  • 68.Gu S, Li J, Chen W, Li D, Zhang C, Li R, Xu D, Zhang C, Li S. ZNF597 is a maternally expressed imprinted gene in the Holstein breed. Theriogenology. 2020;143:133–8. [DOI] [PubMed] [Google Scholar]
  • 69.Jalil Sarghale A, Moradi Shahrebabak M, Moradi Shahrebabak H, Nejati Javaremi A, Saatchi M, Khansefid M, Miar Y. Genome-wide association studies for methane emission and ruminal volatile fatty acids using Holstein cattle sequence data. BMC Genet. 2020;21(1):1–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Sargolzaei M, Schenkel FS, Jansen GB, Schaeffer LR. Extent of linkage disequilibrium in Holstein cattle in North America. J Dairy Sci. 2008;91(5):2106–17. [DOI] [PubMed] [Google Scholar]
  • 71.Abdollahi Arpanahi R, Razkabir M, Sayadnejad M, Eghbal A. Determination of the number of test day records is required to replace lactation model with random regression model? Iranian Journal of animal Science. 2017;48(3):391–8. [Google Scholar]
  • 72.Fuchsberger C, Abecasis GR, Hinds DA. minimac2: faster genotype imputation. Bioinformatics. 2014;31(5):782–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Loh PR, Danecek P, Palamara PF, Fuchsberger C, Reshef YA, Finucane HK, Schoenherr S, Forer L, McCarthy S, Abecasis GR, Durbin R. Reference-based phasing using the Haplotype Reference Consortium panel. Nature Genet. 2016;48(11):1443–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, Sabatti C, Eskin E. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010;42(4):348–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Reed E, Nunez S, Kulp D, Qian J, Reilly MP, Foulkes AS. A guide to genome-wide association analysis and post-analytic interrogation. Stat Med. 2015;34(28):3769–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Sahana G, Guldbrandtsen B, Bendixen C, Lund MS. Genome-wide association mapping for female fertility traits in Danish and Swedish Holstein cattle. Anim Genet. 2010;41(6):579–88. [DOI] [PubMed] [Google Scholar]
  • 77.Han B, Kang HM, Eskin E. Rapid and accurate multiple testing correction and power estimation for millions of correlated markers. PLoS Genet. 2009;5(4):e1000456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Turner SD. qqman: an R package for visualizing GWAS results using QQ and manhattan plots. Biorxiv. 2014:005165.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1. (583.9KB, docx)

Data Availability Statement

The datasets generated and analyzed during the current study are available in the Figshare repository [https://doi.org/10.6084/m9.figshare.28604060].


Articles from BMC Genomics are provided here courtesy of BMC

RESOURCES