Skip to main content
PLOS One logoLink to PLOS One
. 2026 Apr 13;21(4):e0346504. doi: 10.1371/journal.pone.0346504

Modelling associated factors of maternal age at first birth in Ethiopia: Gamma regression approach

Adimias Wendimagegn Agegnehu 1,*, Butte Gotu Arero 2
Editor: Yimam Getaneh Misganie3
PMCID: PMC13075722  PMID: 41973691

Abstract

Background

Maternal age at first birth (AFB) is a key indicator of reproductive health, influencing maternal and child outcomes, population dynamics, and socio-economic well-being. Early childbearing increases risks of maternal morbidity, poor neonatal outcomes, and intergenerational poverty. Despite progress in Ethiopia, disparities in AFB persist across regions, religions, and residential settings, particularly among women who have already given birth. This study determines the magnitude and direction of effects from religious, regional and residential determinants of maternal AFB in Ethiopia.

Methods

Data were drawn from 5,839 women included in the 2019 Ethiopia Mini Demographic and Health Survey (Mini EDHS 2019). A comparative modeling framework was employed; fitting Generalized Linear Models (GLM), Generalized Linear Mixed Models (GLMM), and distributional Gamma regression models to examine the consistency of the coefficients for covariates across different modeling framework. Both classical estimation (Maximum Likelihood Estimation) and Bayesian inference via Markov Chain Monte Carlo (MCMC) were used.

Results

Among women in Ethiopia who had experienced a first birth by the time of the survey, the average maternal age at first birth was 18.7 years(95% CI: [18.2, 19.3]), indicating that a substantial proportion of women begin childbearing at the lower limit recommended by the World Health Organization. Regional differences in mean maternal age at first birth were notable: women in Amhara (4%), Oromia (3%), SNNPR (3%), Benishangul-Gumuz (3%), and Gambela (8%) had earlier average maternal age at first births, whereas those in Addis Ababa (11%), Somali (4%), and Dire Dawa (4%) delayed childbirth compared to Togray. Variability in AFB also differed by region, with higher dispersion observed in Afar (37.7%), Benishangul-Gumuz (37.7%), Somali (36%), Gambela (32.3%), SNNPR (27.1%), and Harari (25.9%), while Addis Ababa (22.1%) and Dire Dawa (13.1%) showed lower variability. Women who have already given birth and identify as Orthodox or Protestant tend to experience a higher average age at first birth, with increases of approximately 4% and 3%, respectively, relative to Muslim women. Conversely, women classified under the “other religions” category tend to have their first child approximately 5% earlier than Muslim women. Residence influenced both mean and variability: rural women who had already given birth experienced first births about 5% earlier than urban women and exhibited 7.3% greater dispersion, indicating more heterogeneous reproductive patterns.

Conclusion

Maternal age at first birth in Ethiopia remains low (18.7 years), with significant disparities in mean and variabilities across religion, residence, and region. Women from rural areas, Muslim backgrounds, and regions such as Amhara, Oromia, Benishangul-Gumuz, SNNPR, and Gambela are more likely to give birth earlier, while urban women, particularly in Addis Ababa, Dire-Dawa, and Somali, tend to delay childbearing. The persistence of very early childbearing (as young as 10 years) highlights urgent reproductive health challenges. These findings call for context-specific policies: expanding girls’ education, delaying marriage, and strengthening youth-friendly reproductive health services in rural and high-risk regions; engaging religious leaders and institutions as partners in promoting delayed childbearing; and addressing structural inequities through regionally tailored programs that respect cultural and pastoralist lifestyles.

Introduction

The timing of a woman’s first birth is a critical determinant of maternal and child health, fertility trajectories, and socio-economic outcomes [1,2]. Early childbearing, particularly in settings with limited access to contraception, is associated with higher lifetime fertility, interrupted education, reduced career opportunities, economic insecurity, and increased risks to maternal and child health [2,3]. Adolescent mothers, especially those under 18, face substantially higher maternal morbidity and mortality: girls aged 10–14 are five times, and those aged 15–19 twice, as likely to die from pregnancy- or childbirth-related complications compared with women aged 20–24 [35]. They are also more susceptible to obstetric complications such as eclampsia, fistula, systemic infections, and long-term chronic illnesses [69]. At the population level, early motherhood contributes to high fertility rates and rapid population growth, particularly in regions with low contraceptive prevalence [10]. Children born to mothers under 16 have higher morbidity and under-five mortality risks [11]. Consequently, the World Health Organization recommends delaying first childbirth until at least age 18 [12]. Conversely, delaying first birth beyond age 35 carries elevated risks of miscarriage, multiple births, chromosomal abnormalities, gestational diabetes, and maternal mortality [3], highlighting the importance of identifying an optimal timing for first birth.

Globally, early marriage and adolescent pregnancy remain major challenges in low- and middle-income countries. Over 30% of women marry before 18, and 14% before 15 [13], with 95% of births to women aged 15–19 occurring in these settings [13]. Sub-Saharan Africa exhibits the highest fertility rates and low contraceptive use [1416]. Median age at first birth varies widely: 26.3–31.3 years in the European Union [17], 20.2 years in East Asia and the Pacific [18], and 19 years in Sub-Saharan Africa, with national variation such as 20 in Nigeria, 19 in Ghana, 16.34 in Bangladesh, and 19.2 in Uganda [19,20]. In Angola, 5% of girls aged 15–19 give birth before 15 [21]. Across 36 countries, children of mothers under 18 face a 46% higher risk of dying before age five, while for mothers aged 18–19, the risk is 12% higher than for mothers aged 20–34 [22].

Ethiopia faces significant early childbearing challenges amid high population growth (2.57% per year) and an average fertility of five children per woman [23]. In 2021, 39.56% of the population was under 14, and 21.1% were aged 15–24 [24]. Among women aged 15–19, 13% were pregnant or mothers, and 22.2% of women aged 20–24 had given birth before 18 [24,25]. Ethiopia ranks fourth globally in girls married before 18 [26], and over 76% of married women aged 15–19 had never used contraception [25]. Median age at first birth (AFB) was 19 in 2005, rising slightly to 19.2 in 2011 and 2016, then declining to 18.7 in 2019; roughly half of women aged 25–49 had their first child before 20 [25,27,28]. Key determinants of early AFB include early sexual debut, low educational attainment, early marriage, large spousal age gaps, rural residence, sociocultural norms, contraceptive non-use, employment, autonomy, and religion [12,29,30]. Policy efforts in Ethiopia aim to reduce early pregnancies from 13% to 7% and increase the average age at first marriage from 17 to 18 by 2025 through female education and economic empowerment [31]. Despite these interventions, early childbearing remains prevalent, highlighting the need for evidence-based research.

Most prior studies on AFB use survival analysis, particularly Cox proportional hazards models, which assume proportional hazards and may inadequately capture variability. Parametric models such as the Weibull accelerated failure time model have also been applied, but often fail to account for covariate effects on dispersion [31]. Additionally, many analyses include variables measured at survey time rather than at first birth, limiting causal interpretation and precision. Skewness and heterogeneity in AFB data are frequently overlooked, potentially biasing estimates and obscuring relationships. This study addresses these gaps by focusing exclusively on women who have already given birth, avoiding censored observations.

Understanding maternal age at first birth requires situating individual reproductive behavior within broader social, cultural, and economic contexts. In Ethiopia, factors such as region, religion, and place of residence are intertwined with norms, opportunities, and constraints shaping the timing of childbearing. Regional variation may arise from differences in economic development, infrastructure, and service provision [29]. Regions with higher urbanization and economic growth often provide greater access to education, healthcare, and employment, which can delay marriage and first birth. Conversely, in less developed regions, limited schooling opportunities, reliance on agrarian livelihoods and persistent traditional norms may encourage earlier entry into marriage and childbearing.

Religious affiliation can influence first birth timing through doctrinal teachings, community norms, and institutional roles in shaping gender expectations [30]. These effects may differ by place of residence; for example, the same religion may manifest differently in urban versus rural contexts. Urban women generally have better access to education, formal employment, and modern contraceptives, which can postpone first birth [25,26]. Rural women may face early school dropout, agricultural labor demands, and stronger enforcement of traditional marital norms, resulting in earlier first births. These predictors can also interact for instance, the delaying effect of higher education may be stronger in urban areas, or religious influences may be amplified in rural settings. Variables excluded from the empirical model due to temporal inconsistency, such as current educational attainment and marital status, may act as mediators linking region, religion, and residence to reproductive timing.

Using a Gamma regression framework under generalized linear models (GLM), generalized linear mixed models (GLMM), and distributional Gamma regression, this study models both the mean separately and jointly with variability of AFB. The study aims to quantify the influence of geographical, religious, and residential factors on both the average and dispersion of maternal age at first birth in Ethiopia, while evaluating the adequacy of gamma regression models under classical and Bayesian estimation approaches. By doing so, it provides robust, context-specific evidence to guide interventions that delay early childbearing and improve maternal and child health outcomes.

Materials and methods

Study design

A cross-sectional study was conducted to examine geographical, religious, and residential factors associated with maternal age at first birth (AFB) in Ethiopia.

Study setting and population

Ethiopia is located in the Horn of Africa and is bordered by Eritrea to the north, Djibouti and Somalia to the east, Kenya to the south, South Sudan to the west, and Sudan to the northwest. Administratively, the country is divided into nine regional states Tigray, Afar, Amhara, Oromia, Somali, Benishangul-Gumuz, Southern Nations, Nationalities and Peoples’ Region (SNNPR), Gambela, and Harari and two chartered cities, Addis Ababa and Dire Dawa.

This study utilized secondary data from the 2019 Ethiopia Mini Demographic and Health Survey, which was conducted from March 21 to June 28, 2019. The survey was implemented by the Ethiopian Public Health Institute in collaboration with the Central Statistical Agency (Ethiopia) and the Federal Ministry of Health (Ethiopia), with technical assistance from ICF. The Mini EDHS is a nationally representative household survey designed to provide reliable estimates of demographic and health indicators at the national and regional levels, as well as by place of residence (urban and rural). The dataset used in this study was obtained from the Demographic and Health Surveys Program website after formal approval of a data access request. Ethical clearance for the survey was obtained by the DHS Program from the Institutional Review Board of ICF and the national ethics review committee in Ethiopia [32].

The target population of the Mini EDHS includes women aged 15–49 years residing in selected households across Ethiopia. However, for the purposes of this analysis, the study population was restricted to women aged 15–49 years who had experienced at least one birth before the survey date.

Sample design

The 2019 Ethiopia Mini Demographic and Health Survey employed a two-stage stratified cluster sampling design. In the first stage, 305 census enumeration areas (EAs) were selected from the national sampling frame, including 93 urban clusters and 212 rural clusters. In the second stage, a systematic sampling method was used to select approximately 30 households from each selected enumeration area, resulting in a total sample of 8,885 households. The sampling design ensured adequate representation across regions and urban–rural areas, with proportional allocation applied to most regions and additional sampling consideration for the three largest regions: Amhara, Oromia, and SNNPR. This design allowed the survey to produce nationally and regionally representative estimates of key demographic and health indicators.

Analytical sample

Age at first birth is inherently a time-to-event variable, and women who have not yet experienced a first birth represent right-censored observations. In the present study, the analysis was intentionally restricted to women who had already experienced their first birth by the time of the survey. This restriction aligns with the study objective of modeling the timing of first birth among women who have transitioned to motherhood and allows direct estimation of the conditional mean and dispersion of maternal age at first birth.

Including women who had not yet experienced a first birth would require survival analysis methods to account for censoring, which was beyond the scope of the present modeling framework. Consequently, the analytical sample consisted of 5,839 women aged 15–49 years who reported having had at least one live birth at the time of the survey. Women who had not yet experienced a first birth were excluded from the analysis.

While this restriction allows detailed modeling of the distribution of age at first birth among mothers, it changes the interpretation of results. Therefore, the findings should be interpreted as applying to women who had already experienced first birth at the time of the survey, rather than to all reproductive-age women in Ethiopia.

Outcome variables

The study outcome variable is maternal age at first child birth.

Explanatory variables

Region (Tigray, Afar, Amhara, Oromia, Somali, Benishangul, SNNPR, Gambela,Harari, AddisAbaba and Dire Dawa).

Place of residence (Urban, Rural).

Religion (Muslim, Orthodox, Protestant, Catholic, Traditional, Others).

The selection of explanatory variables in this study was guided by considerations of temporal consistency with the outcome variable, maternal age at first birth (AFB). Although maternal education, occupation, and marital status are widely recognized in the literature as important correlates of AFB, these variables in the 2019 Ethiopia Mini Demographic and Health Survey were recorded at the time of the survey rather than at the time when the first birth occurred for most respondents. Because these characteristics may have changed substantially after the first birth event, their inclusion could introduce temporal inconsistency and potential reverse causality. For example, early childbearing may itself influence subsequent educational attainment or employment status, thereby biasing the estimated relationships if these variables were included as predictors.

Nevertheless, the exclusion of such variables may introduce omitted variable bias if they are associated with both the outcome and the retained covariates. For instance, educational attainment is likely correlated with regional differences in access to schooling as well as with delayed childbearing. Consequently, some of the estimated regional differences in maternal age at first birth may partly reflect underlying educational disparities. Similarly, marital status and occupation may be linked to both social norms and reproductive behavior, and their exclusion could lead to over- or underestimation of the associations between the included covariates and age at first birth. These limitations are acknowledged, and the results should therefore be interpreted as associations rather than causal effects.

Given these considerations, the final set of explanatory variables included region, religion, and place of residence. These variables were selected because they represent relatively stable contextual characteristics that are less likely to change following the occurrence of first birth. They capture important geographic, cultural, and social dimensions that may influence reproductive behavior, including differences in access to education and health services, sociocultural norms surrounding marriage and childbearing, and disparities between urban and rural environments. Focusing on these contextual variables allows the analysis to explore patterns and associations in maternal age at first birth among women who had already experienced a first birth at the time of the survey, while minimizing temporal inconsistencies between predictors and the outcome.

Ethical approval and consent

This study utilized publicly available, de-identified data from the 2019 Ethiopia Mini Demographic and Health Survey (Mini EDHS), obtained with permission from the DHS Program (www.dhsprogram.com). Ethical approval for this secondary analysis was waived by the Kotebe University of Education Institutional Review Board by letter with ref No Math098/2017, as the data are anonymized and publicly accessible, preventing identification of individual participants. The study adhered to ethical standards for secondary data analysis, with no issues related to dual publication, misconduct, or participant confidentiality.

Data analysis

The variable of interest in this research is the age at which a mother gives birth for the first time. This is calculated as the number of years from her date of birth to the age at which she delivers her first child. The dataset on age at first birth is considered to be positively skewed and continuous, as it consists of a series of positive values within limited range. When dealing with such data that is continuously distributed, positively skewed, and always positive, one common technique is to perform a log transformation on the data [33]. However, there are other approaches within the generalized linear model framework that may provide a more comprehensive analysis of this type of dataset. It is important to explore these alternative methods to ensure that the analysis of the age at first birth data is both accurate and meaningful. In this study, it was observed that the dataset on the maternal age at first birth obtained from the Mini EDHS 2019 dataset (excluding mothers without birth during survey period) follows a gamma distribution and consists of only positive values.

Gamma regression was chosen because the focus of this study is on modeling the observed age at first birth as a continuous, positively skewed variable, rather than modeling the hazard or timing of the event. The Gamma distribution naturally accommodates skewness and strictly positive values, allowing both the mean and dispersion to be modeled flexibly under GLM, GLMM and distributional Gamma regression frameworks. In contrast, time-to-event models such as the Cox proportional hazards or Accelerated Failure Time (AFT) models are most appropriate when dealing with right censored data or when the objective is to model the hazard rate over time. In this study, all respondents had experienced the event (no right-censoring), and the research objective was to quantify the relationship between regional, religious and residential covariates and the mean age at first birth and variability, making Gamma regression a more direct and interpretable choice.

This study explores Gamma regression models that allow both the mean and shape (dispersion) parameters to depend on covariates. Initially, the mean parameter is modeled within the frameworks of the generalized linear model (GLM) and generalized linear mixed model (GLMM). Subsequently, a distributional Gamma regression model is employed, in which both the mean and variability are regressed on relevant covariates. All models are examined under both classical (maximum likelihood) and Bayesian (MCMC) estimation approaches [33].

Modeling framework and estimation approaches

The Linear Model (LM) has had a very broad development. LM has developed into the Generalized Linear Model (GLM). Furthermore, the combination of GLM and Linear Mixed Model (LMM) can form a Generalized Linear Mixed Model (GLMM) [34]. The generalized linear model (GLM) is a powerful regression model that merges linear and nonlinear elements, enabling the incorporation of non-normal response distributions. In a GLM, the response variable needs to be a part of the exponential family, which encompasses various familiar distributions like normal, log normal, binomial, exponential, Weibull and gamma distributions. The normal-error linear model is a specific variant of the GLM, making the GLM a versatile tool for empirical modeling and data analysis [34]. Explanatory variables in GLMs can be either continuous or categorical. The GLM determines a separate regression coefficient for each level of categorical variables indicating the need for exercising considerable care in interpreting that coefficient.

The GLM is constructed from three key components:

  1. Random Component: This specifies that the conditional distribution of the response variable Yi (the ith of n independently sampled observations) is determined by the values of the explanatory variables, with the expected value E (Yi) = μi for i = 1,...,n. It is assumed that Yi follows a specific probability distribution, such as the gamma distribution. The choice of distribution is contingent upon the type of response variable and the characteristics of the data [35].

  2. Systematic Component: This defines a linear predictor, which is a linear combination of the explanatory variables with k predictors, represented as Xik. This is expressed as a linear predictor ηi, where the relationship can be formulated as ηi=𝐗iβT where β being the vector of unknown regression coefficients that need to be estimated [35].

  3. Link Function: This component connects the linear predictor to the mean of Yi. The link function is a monotonically increasing function, and the selection of this function depends on the distribution of Yi. For instance, when Yi represents a positive integer, the mean E (Yi) = μi must also be non-negative, prompting the use of a log link function, which expresses the mean in terms of the entire real line. In this case, the link function is represented as log (μi) effectively relating μi to the linear predictors. Consequently, log-linearity for the mean is frequently employed as the link function within the regression model.

Generalized linear mixed models (GLMMs) is an extension of GLM where random effects are included in the model. GLMMs are a natural outgrowth of both linear mixed models and generalized linear models. GLMMs can be developed for non-normally distributed responses, will allow nonlinear links between the mean of the response and the predictors, and can model over dispersion and associate by incorporating random effects (Breslow NE, Clayton DG. 1993).

In both generalized linear models (GLMs) and generalized linear mixed models (GLMMs), the emphasis is typically on estimating the mean structure, assuming that dispersion remains constant. However, real-world data can sometimes show more variability than what is predicted by the standard mean-variance relationship. When dispersion actually varies, relying on a constant dispersion assumption can significantly reduce the efficiency of mean parameter estimates [36]. To address this, a more flexible approach involves jointly modeling both the mean and the dispersion, allowing dispersion to be expressed as a function of covariates [37].

This study addressed the facts related with the trends, factors and model performances by focusing on the response variable, which is the maternal age at first birth in Ethiopia. Given the nature of the data, Gamma distribution was used for analysis. By leveraging the GLM, GLMM and distributional Gamma regression modelling framework with a log link function, it became feasible to establish a linear relationship between the predictors and the response variable. The log link function is used in Gamma regression because it ensures that predicted values remain strictly positive, aligning with the domain of the Gamma distribution. It transforms the nonlinear relationship between predictors and the response variable into a linear one on the log scale, facilitating interpretation and estimation. This transformation allows model coefficients to be interpreted multiplicatively, meaning a unit change in a predictor corresponds to a percentage change in the response. Additionally, the log link accommodates the Gamma distribution’s variance structure, where variance increases with the square of the mean, and it enhances numerical stability during model fitting by preventing negative predictions and improving convergence. This approach, allows for analysis and interpretation of the factors influencing maternal age at first birth in the Ethiopian context.

Gamma regression

The gamma distribution is commonly used to represent continuous data that is restricted to positive values. When a variable (𝑦) follows a gamma distribution (𝑦, 𝛼, 𝛽), its probability density function is determined by its mean 𝜇 (𝜇 > 0), shape parameter 𝛼 (𝛼 > 0), and rate parameters 𝛽 (𝛽 > 0). The probability density function for the gamma distribution is outlined by [37] as follows:

f(y)=βα𝛤(α)yα1eβy (1)

Where:

𝑦: Dependent variable.

Γ (𝛼): Gamma function is defined as 0yα1eydy

Let 𝛼 = 𝜈 and 𝛽 = 𝜈/𝜇 ⇒ 𝜇 = 𝜈/𝛽

f(y)=y1𝛤(v)(yvμ)veyvμ;y>0 (2)

Then 𝑦 ~ 𝐺 (𝜈,vμ). In this case E(y) = αβ=vvμ=μ and var(y) = αβ2=vvμ22=μ2v

If (𝑦) has density function (2), we write 𝑦 ~ (𝜇, 𝜈) [37]. Then the gamma regression model is defined as follows:

μi=e(xiβi +ε) (3)

where 𝜖 is vector of random error with a dimension (n × 1), and (n) is the sample size.

One way to transform the gamma regression function to a linear form is to use the logarithm of the mean (μi) as the response variable. This can be achieved by taking the natural logarithm of both sides of the equation (3) [38].

In order to analyze the determinants influencing the maternal age at first birth, three models were considered in GLM framework: The generalized linear model (GLM) (model 1) presented in equation (4), generalized linear mixed model (GLMM) (model 2), being in the presence of random effects and finally distributional Gamma regression model (model 3).

The GLM (model 1) generalizes a linear model and is composed of two fundamental elements. The first is the link function between the expected value of the maternal age at first birth and the linear predictors, and the second one is the variance function, in which the variance is expressed as a function of the mean. To explain the structure of the GLM, the linear model starts from:

y^i=ηi=g(μi)=ln(μi)=𝐗𝐢β^ (4)

where xi is the 𝑖th row of X = [xi1, xi2, …, xip]′ of X which is an (n × p) data matrix with (p) explanatory variables, β is a (p  × 1) vector of the slope coefficients and ηi = g(μi) is the log link function for the gamma regression model [38].

The second model applied was the GLMM (model 2) presented in equation (5). The GLMM is an extension of the GLM. The GLM includes in the mathematical model only the fixed effects, estimating their influence on the dependent variable. Instead, the GLMM considers, in addition to the fixed effects, the random effects. The random effects of the GLMM were the individual mothers in different enumeration areas, since the variability of maternal age was depend on the intrinsic characteristics of the mothers. In the gamma regression model using GLMM framework presented by [38], the shape parameter is constant for every observation, and it is modeled like the mean.

Here, we assume an independent random variables, yi~Gamma(μi;v/μi); i = 1,…….., n, with mean parameter given by

ηi=g(μi)=𝐗𝐢β+𝐙𝐢b (5)

where β= (β1,βp) is fixed effect coefficients and b=(b1,bk), are the sets of random effect coefficients, Xi and Zi are observed values of the fixed and random covariates respectively, and ηi is the linear predictors at the ith observation. Again, it is convenient to take the first component of the Xi vector as 1 to allow for an intercept in the mean regression structure.

Sometimes modeling each parameters of the distribution parameters with covariates can be more informative than the GLM and GLMM application [39]. Using distributional Gamma regression instead of standard GLM (Generalized Linear Model) or even GLMM (Generalized Linear Mixed Model) is beneficial when both the mean and dispersion (variability) of the outcome are systematically influenced by covariates.

In the classical parametric regression, both the mean and shape (inverse of dispersion) are modeled as functions of covariates via log-link functions as expressed in equation (6) and (7).

Mean submodel (location): ηi=g(μi)=𝐗𝐢β (6)
Shape submodel (inverse dispersion): g(ϕi)= 𝐙𝐢γ (7)

Xi is a vector of covariates (e.g., region, religion, residence) for observation β is the vector of regression coefficients for the mean, Zi is a vector of covariates (may be same or different from Xi) for the shape, and γ is the vector of regression coefficients for the shape. Parameter estimates β and γ are obtained via Maximum Likelihood Estimation (MLE) and Bayesian method.

Maximum likelihood estimation

Adekanmbi V. (2017) [38] propose a classic approach to fit gamma regression model, where mean parameter follow regression structures in constant shape parameter, using the Fisher scoring algorithm. In that work, it was shown that for gamma re-parameterization given by (2) the GLM likelihood function can be written in the form:

L(β,y)=i=1nyiv1Γ(v)(vμi)vevyiμi (8)

and the log-likelihood function by:

L(β)=i=1n{vlog(v)+(v1)log(yi)log(𝛤(v))vlog(μi)vμiyi} (9)

Thus, assuming the regression structures defined by μi=e𝐗iβ the score statistics are given by:

L(β)βj=vi=1nxij(1yiμi) ;j=1,.p (10)

and the Hessian matrix is determined by:

2L(β)βkβj=vi=1nxijxik*yiμ2i ;j,k=1,.p (11)

In matrix form H(β)=vi=1nyiμ2ixijxTik

Thus, the Fisher information matrix is given by:

E(2L(β)βkβj)=vi=1nyiμ2ixijxik ;j,k=1,.p (12)

Similarly for the case of Model 2: Gamma GLMM (Fixed plus Random Effects), when the liner predictor become ηi=g(μi)=Xiβ+Zib and random effect b ~ N (0, D) the conditional likelihood become:

L(β, b,y)=i=1nvvΓ(v)(yv1i)(vμi)vevyiμi*e12bTD1b (13)

Log-likelihood function become

L(β, b)=i=1n[vlog(v)log(𝛤(v))+(v1)log(yi)vlog(μi)vμiyi]12bTD1b (14)

The Hessian matrix become

H=[HββHβbHbβHbb] (15)

Where

Hββ=vi=1nyiμ2ixixTi,
Hβb=vi=1nyiμ2ixizTi and
Hbb=vi=1nyiμ2izizTiD1

It can be noted that the Fisher information matrix is a block diagonal matrix, where one of the blocks corresponds to the mean regression parameters β and the other to the random effect b. Thus, β and b are orthogonal and their maximum likelihood estimators β^ and 𝐛^ are asymptotically independent. As a consequence of this result, (Adekanmbi V., 2017) [38] proposed an iterative algorithm to obtain the maximum likelihood estimates of the regression parameters.

In the similar way the distributional gamma regression model (model 3) the likelihood function is derived as follows

f(yi|μi ,ϕi)=ϕiϕi𝛤(ϕi)μi ϕiyiϕi1exp(yiϕiμi ) (16)

Where μi and ϕi are the location and shape (inverse of dispersion) respectively and those expressed as regression coefficient as follows

μi=eXiβ
ϕi=eZiγ

The log-likelihood for a sample of n independent observations is

(μ,ϕ)=\nolimitsi=1n[ϕilog(ϕi)log𝛤(ϕi)+(ϕi1)logyiϕilog(μi)yiϕiμi] (17)

Finally we can derive the score function (first derivatives) and the Hessian matrix (second derivatives) and fisher information for parameters μi and ϕi we assume μi>0, ϕi>0, and 𝐲i>0.

𝓁(β,γ)=i=1n[𝐙𝐢γe𝐙𝐢γlog𝛤(e𝐙𝐢γ)+(e𝐙𝐢γ1)log𝐲i𝐗𝐢βe𝐙𝐢γ𝐲ie𝐙𝐢γe𝐗𝐢β]
𝓁(β,γ)β=i=1nϕi(𝐲iμi)μiXi
𝓁(β,γ)γ=i=1n[log(ϕi)+1Ψ(ϕi)+log𝐲ilog(μi)yiμi]ϕiZi

Where Ψ(ϕi) is digamma function (the first derivatives of logΓ(ϕi))

The Hessian matrix become

Hββ=i=1n[𝐲iϕiμi ]𝐗i𝐗𝐢
Hγγ=i=1nϕ2iΨ1(ϕi)𝐙i𝐙𝐢

Ψ1(ϕi) is trigamma function (the first derivatives of digamma function)

Hγβ=Hβγ=i=1nϕi[𝐲iμi μi ]𝐗i𝐙𝐢

After developing such mathematical derivation we can estimate the values of the parameters vector β and γ using iterative optimization in Newton-Raphson algorithm.

Bayesian estimation

Combining the prior probability density function of the parameters with the likelihood function of the observations yields the Bayesian estimator, which is only the information-rich part of the posterior probability density function [40]. The Bayesian method to fit gamma regression models, where mean parameter follow regression structure. As in these works, to implement a Bayesian approach to estimate the parameters of the gamma regression model, we need to specify a prior distribution for the parameters and the data (Y) likelihood and relate to the posterior density. The relation become:

𝛱(β/data)=L(data/β)P(β)L(data/β)P(β) (18)

Where, 𝛱(β|data) is the posterior distribution of the parameters β and given the data, P(Y|θ) is the likelihood of the data given the parameters, P(β) is the prior distribution of the parameters.

The first procedure to formulate the posterior distribution is specifying the likelihood function in both GLM and GLMM framework. For the case of GLM the likelihood function is formulated in Equation (6) as

L(β,y)=i=1nyiv1𝛤(v)(vexp(xiβ))vexp(vyiexp(xiβ)) (19)

and for GLMM the likelihood function is formulated in equation (11) as

L(β, b,y)=i=1nvv𝛤(v)(yv1i)(vexp(Xiβ+Zib))vexp(vyiexp(Xiβ+Zib))*e12bTD1b (20)

Secondly, fix the prior distribution depending on the data nature or by highlighting previous literature about maternal age at first birth specifically in Ethiopia. Mathematically the prior for GLM Gamma regression model coefficients a common choice is a multivariate normal prior for the regression coefficients:

β~N(μβ,Σβ)
p(β)=1(2π)p/2|Σβ|1/2exp(12(βμβ)TΣβ1(βμβ)) (21)

And for GLMM case the fixed prior and the random effect prior is needed

b~N(0,D)
p(β)=1(2π)q/2|D|1/2exp(12bTD1b) (22)

For the intercept β0 ~N (3, 0.5) based on the data nature and previous literature, and the other fixed coefficients β~ N (0, 1).

Lastly using equations (1518) the expressions for the likelihood and priors [41], the full posterior distribution for Gamma GLM regression can be formulated as

𝛱(β/data)=i=1nyiv1𝛤(v)(vexp(xiβ))vexp(vyiexp(xiβ))*1(2π)p/2|Σβ|1/2exp(12(βμβ)TΣβ1(βμβ)) (23)

The extended posterior by the effect of random component become

(β/data)=i=1nyiv1𝛤(v)(vexp(xiβ))vexp(vyiexp(xiβ))*1(2π)p2|Σβ|12exp(12(βμβ)TΣβ1(βμβ))*1(2π)q/2|D|1/2exp(12bTD1b) (24)

The mathematical formulation of the Bayesian estimation for the joint modeling of mean and dispersion in gamma regression, the likelihood function is:

f(yi|μi ,ϕi)=ϕiϕi𝛤(ϕi)μi ϕiyiϕi1exp(yiϕiμi ) (25)

By assuming the normal prior for the regression coefficients

β~N(b0,B0) (26)
γ~N(g0,G0) (27)

Using Bayes’ theorem, the joint posterior is expressed using equation (25), (26) and (27)

i=1n[ϕiϕi𝛤(ϕi)μi ϕiyiϕi1exp(yiϕiμi )exp(12(βb0)TB01(βb0))exp(12(γg0)TG01(γg0))] (28)

Stan Modeling: Stan is a specialized state dialect implemented as a C++ library designed for Bayesian modeling, as noted by [41]. It serves as a Bayesian programming tool that primarily employs the No-U-Turn Sampler (NUTS) to perform posterior simulations based on user-defined models and data. Hamiltonian Monte Carlo (HMC), is a method that falls under the broader category of Markov Chain Monte Carlo (MCMC) techniques. While HMC can be quite intricate due to the need for calculating potentially complex derivatives and fine-tuning several parameters, it often proves to be faster and more reliable than simpler methods like basic Markov chain simulation, Gibbs sampling, and the Metropolis algorithm, particularly in more complex scenarios, as it navigates the posterior parameter space more effectively.

They achieve this by associating each model parameter with a momentum variable, which influences the exploration behavior of Hamiltonian Monte Carlo (HMC) based on the posterior density of the currently sampled parameter. This approach allows HMC to reduce the random walk characteristics typical of the Metropolis algorithm [41]. As a result, Stan demonstrates significantly greater efficiency compared to conventional Bayesian software algorithms like Metropolis and Gibbs. Notably, the primary function of the rstan package is to interface with the Stan software to estimate a given statistical model, and rstan features an intelligent system that automates much of the adaptation process.

A statistical model can be represented through a conditional probability function P(β,δ | y, x), where (β,δ) denotes a series of unknown values being modeled, y represents a series of known response variable values, and x includes un modeled predictors and constants, such as sizes and hyper parameters. A Stan program explicitly defines a log probability function for parameters based on the provided data and constants. Stan facilitates comprehensive Bayesian inference for continuous-variable models using Markov chain Monte Carlo methods, including an enhanced version of Hamiltonian Monte Carlo sampling [42]. Users can access Stan from R via the rstan package. This interfaces support sampling and optimization-based inference, along with diagnostics and posterior analysis. Additionally, rstan offer functionalities for accessing log probabilities, transforming parameters, and creating specialized plots. Stan programs are structured with variable type declarations and statements, which include constrained and unconstrained integers, scalars, vectors, and matrices. Variables are organized into blocks that correspond to their intended use: data, transformed data, parameters, transformed parameters, or generated quantities.

Model Diagnosis Checking: model 1, model 2 and distributional gamma regression were specified as in the gamma family and with log link. For the case of maximum likelihood estimation method (MLE), the goodness of fit for GLM, GLMM and distributional Gamma regression was checked by the Akaike information criterion (AIC), Bayesian information criterion (BIC) and log likelihood test; the best model is the one that minimizes these values. For the case of Bayesian technique the goodness of fit the performance of each model was evaluated by using potential scale reduction factor (R^) statistic, effective sample size value, Monte Carlo standard error (MCSE) and posterior predictive checking method. Loo method with expected log pointwise predictive density and their standard errors were used for comparison of GLM, GLMM and distributional Gamma regression.

Results

Descriptive characteristics of the study sample

A total of 5,839 women aged 15–49 years who had experienced at least one live birth were included in this analysis using data from the 2019 Ethiopia Mini Demographic and Health Survey. Among these women, the mean maternal age at first birth was 18.32 years. Maternal age at first birth among the respondents ranged from 10 years to 40 years, indicating substantial variability in the timing of entry into motherhood among women who had already experienced first birth at the time of the survey.

Table 1 presents the geographic, residential, and religious characteristics of the study participants. The regional distribution of respondents ranged from 6.4% in Addis Ababa to 12.1% in Oromia. The largest proportions of participants were from Oromia (12.1%), SNNPR (11.8%), and Amhara (11.0%), reflecting the relatively larger populations of these regions in Ethiopia. With respect to place of residence, 71.8% of respondents lived in rural areas, while 28.2% resided in urban areas. This distribution reflects the predominantly rural composition of Ethiopia’s population. Regarding religious affiliation, Muslim women represented the largest group (42.8%), followed by Orthodox Christians (35.3%) and Protestants (19.9%).

Table 1. Geographic, residential and religious characteristics of study participants, extracted from Mini EDHS 2019 survey dataset (Ethiopia), that conducted from March 21, 2019, to June 28, 2019 (N = 5839).

Variables Categories frequency %
region Tigray 501 8.6
Afar 500 8.6
Amhara 643 11.0
Oromia 707 12.1
Somali 428 7.3
Benishangul-Gumuz 523 9.0
SNNPR 692 11.8
Gambela 529 9.1
Harari 476 8.1
Addis Ababa 374 6.4
Dire Dewa 469 8.0
Residence Urban 1645 28.2
Rural 4197 71.8
Religion Orthodox 2060 35.3
Catholic 20 0.6
Protestant 1165 19.9
Muslim 2498 42.8
Traditional 45 1.1
Others 17 0.3

Smaller proportions were observed for Catholics (0.6%), traditional religion followers (1.1%), and other religions (0.3%). These descriptive statistics provide an overview of the socio-demographic composition of the analytical sample consisting of women who had already experienced first birth at the time of the survey.

Fig 1 presents the histogram and density plot of maternal age at first birth among women who had experienced first birth. The distribution is positively skewed, with a concentration of observations in the late teenage years and a longer right tail extending toward older ages. As expected, the outcome variable takes only positive values, consistent with the nature of age-at-event data.

Fig 1. Maternal Age at first birth histogram and density.

Fig 1

Fig 2 illustrates boxplots of maternal age at first birth across selected covariate categories, including region and religion. The boxplots suggest variability in the distribution of maternal age at first birth across these groups. A small number of outliers are observed in several categories, reflecting cases of unusually early or late first births.

Fig 2. Maternal age at first birth box plot on regions and religion.

Fig 2

Comparison of candidate distributions

Fig 3 compares the fitted Gamma, Normal, log-normal and Weibull distributions to the observed maternal age at first birth of women who had already give birth in Ethiopia. The gamma distribution (blue curve) provides the best visual fit to the observed histogram, capturing both the central peak around the late teens and the gradual decline toward older ages. Its flexibility in modeling positively skewed, non-negative data makes it well-suited for maternal age at first birth, where the variable cannot take values below puberty and tends to have a longer right tail. The gamma fit closely follows the observed frequency in both the lower and middle age ranges, and its right-tail decay is consistent with the rarity of very late first births. This close alignment suggests that the gamma distribution’s shape and scale parameters effectively represent the underlying variability in the data.

Fig 3. Maternal age at first birth curve (black) with gamma and normal distribution curves.

Fig 3

The normal distribution (red curve) captures the general center of the distribution but performs less well in the tails. Because the normal is symmetric, it slightly overestimates the frequency of very low and very high ages and underestimates the skewness present in the data. In contrast, the log-normal distribution (green curve) show discrepancies which implies the log-normal less appropriate for this dataset compared to the gamma, despite its theoretical appeal for positively skewed variables.

The Weibull distribution (purple curve) offers a moderately good fit, particularly in the lower to mid-range of ages, but it underestimates the peak and slightly overshoots in the left tail. While the Weibull is also flexible and can accommodate skewed, non-negative data, its parameterization here does not align as closely with the observed pattern as the gamma does. This may be because the Weibull’s hazard-based shape is less suited to the gradual tail decay of maternal age at first birth, compared to the gamma’s smoother decline. The gamma emerges as the most balanced fit across the entire range of ages in this dataset.

Given these comparisons, the gamma distribution emerges as the most appropriate model for maternal age at first birth in this dataset, offering a close empirical fit while aligning with the theoretical expectation that such demographic events follow a positively skewed, non-negative distribution. Its flexibility in capturing variation across regions, religions, and residential contexts supports our conceptual framework, which emphasizes the interplay of structural and cultural factors in shaping the timing of first births. By adopting the gamma distribution, our subsequent regression analyses can more accurately reflect the underlying data structure, thereby providing more reliable estimates of the associations between key socio-demographic predictors and maternal age at first birth.

The result in Table 2 supports the evidence in Fig 4 that Gamma distribution emerged as the strongest candidate for modeling the maternal age at first birth data, as indicated by its lowest AIC (−5427.93) and BIC (−5051.38) values among the four candidate models. Its log-likelihood (−2776.97) was also the highest (least negative), suggesting superior fit to the data. The Kolmogorov–Smirnov (K–S) statistic for the Gamma model was 0.14 with a p-value of 0.292, which is above the conventional significance threshold of 0.05, indicating no evidence to reject the null hypothesis that the data follow this distribution. This combination of excellent information criteria performance and acceptable goodness-of-fit test results provides strong empirical support for the Gamma model’s suitability.

Table 2. Model Comparison for Maternal Age at First Birth Data Using Information Criteria and Goodness-of-Fit Statistics.

Models AIC BIC Likelihood K-S P-value
Gamma −5427.932
−5051.38
−2776.97
0.14
0.292
Normal −5341.218
−4951.23
−2720.89

0.112
0.054
Weibull −5303.092
−5099.876
−2685.55

0.097
0.050
Log_normal −5235.748
−4981.00
−2737.79

0.115
0.041

Fig 4. Residual diagnosis.

Fig 4

The Normal distribution performed less favorably, with higher AIC (−5341.22) and BIC (−4951.23) values compared to the Gamma model. While its K–S statistic (0.112) was slightly lower than Gamma’s, the associated p-value (0.034) fell below 0.05, suggesting a statistically significant departure from normality in the data. This aligns with theoretical expectations, as maternal age at first birth data are often positively skewed, a feature that the symmetric Normal distribution cannot adequately capture. Consequently, despite being a common baseline model, the Normal distribution appears unsuitable for this application. The Log-normal and Weibull distributions showed intermediate performance between Gamma and Normal. The Weibull model achieved lower AIC (−5303.09) and BIC (−5099.88) than Log-normal, and both had K–S p-values (0.041 for Log-normal, 0.050 for Weibull) near or at the significance threshold, suggesting marginal fits. While Weibull model can accommodate skewness better than the Normal distribution, it still failed to match the fit quality of the Gamma model, as reflected in their higher AIC/BIC values and slightly less favorable K–S results.

Diagnosis checking

In the maximum likelihood estimation within gamma regression, model diagnosis hinges on scrutinizing gamma regression residuals. This analytical process is geared towards pinpointing outliers and potential model misspecifications.

Assessing the distribution of residuals through a Quantile-Quantile (QQ) plot, as described in Fig 4, reveals that the residuals closely mimic a normal distribution. This alignment suggests that the residuals follow a normal distribution pattern, a favorable situation in model evaluation.

Model performance and model comparison

To evaluate the adequacy of alternative modeling frameworks for maternal age at first birth (AFB), three Gamma-based regression models were estimated: the Generalized Linear Model (GLM), the Generalized Linear Mixed Model (GLMM), and a Distributional Gamma regression model that allows covariates to influence both the mean and dispersion parameters of the outcome distribution. All models assumed that maternal age at first birth follows a Gamma distribution, which is appropriate for modeling positive, continuous, and right-skewed variables.

Each model was estimated using both classical maximum likelihood estimation (MLE) and Bayesian inference, enabling comparison of model fit and predictive performance across estimation frameworks. Model comparison criteria for the classical models included the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and the log-likelihood, while the Bayesian models were evaluated using the Widely Applicable Information Criterion (WAIC), leave-one-out cross-validation (LOO), and the Deviance Information Criterion (DIC).

Classical model comparison

Under the classical estimation framework, the distributional Gamma regression model demonstrated the best overall fit among the competing models (Table 3). This model achieved the lowest AIC (−72,375.6) and lowest BIC (−71,245.3) values, indicating superior performance when balancing model fit and complexity. The model also produced the highest log-likelihood value, suggesting improved agreement with the observed data relative to the standard GLM and GLMM specifications.

Table 3. Comparative Performance of Gamma-Based Models for Maternal Age at First Birth under Classical and Bayesian Frameworks.

Criteria AIC BIC Log_lik WAIC LOO DIC
GLM (MLE) −32500 −32606.71 −16233.9 --- --- ---
GLMM (MLE) −70015.7 −69902.3 −35024.8 --- --- ---
D. Gamma GLM (MLE) −72375.6 −71245.3 −43426.5 --- --- ---
GLM(Bayesian) --- --- --- −1860.3 −1858.5 −1850.4
GLMM(Bayesian) --- --- --- −1862.1 −1865.7 −1860.2
D.Gamma GLM (Bayesian) --- --- --- −1870 −1876 −1865

The improved performance of the distributional Gamma model likely reflects its ability to model both the conditional mean and the dispersion of maternal age at first birth simultaneously, thereby capturing heterogeneity in the variability of the outcome across different socio-demographic groups.

Bayesian model comparison

Under the Bayesian estimation framework, model comparison based on WAIC, LOO cross-validation, and DIC produced consistent results. The Bayesian distributional Gamma regression model achieved the lowest values for all three criteria (WAIC = −1870, LOO = −1876, and DIC = −1865), indicating the best predictive performance among the candidate models. These results suggest that allowing covariates to influence both the location (mean) and dispersion components of the Gamma distribution improves the model’s ability to capture variability in maternal age at first birth among women who had already experienced first birth.

Taken together, the results from both classical and Bayesian comparisons indicate that the distributional Gamma regression model provides the best overall fit to the data. This model was therefore selected as the primary framework for interpreting the associations between maternal age at first birth and the explanatory variables considered in this study.

Although the distributional Gamma regression model demonstrated the strongest overall performance, estimating multiple model frameworks (GLM, GLMM, and distributional regression) provides additional insight into the robustness of the results. Comparing these models allows assessment of how different modeling assumptions such as accounting for unobserved heterogeneity or modeling dispersion affect the estimated associations between predictors and maternal age at first birth. This comparative approach helps evaluate the consistency of coefficient estimates, the stability of effect directions, and the predictive adequacy of the models, thereby strengthening confidence in the empirical findings while acknowledging the limitations inherent in observational cross-sectional data.

Classical Gamma regression model

We recognize that excluding women who had not yet given birth, especially younger and potentially more socioeconomically advantaged individuals, introduces a form of selection bias. For example, if highly educated urban women tend to postpone childbearing beyond the survey period, their systematic omission could lead to underestimation of the association between urban residence, higher education, and delayed first birth. This truncation of the outcome distribution may therefore bias parameter estimates toward earlier ages. Our results should thus be interpreted as describing the determinants of age at first birth among women who have already had a child, rather than the determinants of timing for all women of reproductive age.

Table 4 presents the estimation results of both the Generalized Linear Model (GLM) and Generalized Linear Mixed Model (GLMM) using a gamma distribution with a log link. Both models produced similar coefficient estimates, reflecting consistency in the direction and magnitude of effects.

Table 4. Parameter Estimates of Maternal Age at First Birth from Gamma Regression Models (GLM and GLMM) with Geographic, Residential and Religious Covariates.

Variables categories GLM GLMM Sig
Estimates Estimates
Intercept 2.9724 2.912 <2e-16
Residence Urban
Rural
Base
−0.04589

−0.04633

2.95e-09
Religion Muslim
Orthodox
Protestant
Others

0.040187
0.033693
−0.07781

0.040248
0.034783
−0.07783

1.60e-06
1.85e-07
0.73823
Region Tigray
Afar −0.0456 −0.0521 0.977476
Amhara −0.04666 −0.04944 0.000355
Oromia −0.03204 −0.03214 0.024874
Somali 0.039349 0.037485 0.018055
SNNPR −0.02819 −0.03426 0.052573
Benishangul −0.03178 −0.03760 0.032975
Gambela −0.08425 −0.08426 4.88e-08
Hareri 0.016448 0.016669 0.296946
Addis Ababa 0.105941 0.097785 1.60e-10
Dere-dawa 0.038606 0.031785 0.014686

After adjusting for region and religion, rural women who had already given birth had an expected age at first birth 4.5% lower than that of their urban counterparts (p < 0.05). Controlling for region and residence, mothers who had previously given birth and identified as Orthodox or Protestant exhibited significantly higher average maternal ages at first birth by 4% and 3%, respectively compared to Muslim mothers (the reference group), with results statistically significant at p < 0.05. In contrast, mothers in category others (Catholic, traditional, and other) religions had an expected maternal age at first birth that was 7.5% lower than that of Muslim mothers.

There are clear association in between different regions and maternal age at first birth (AFB) relative to Tigray. Women who had already had child in Amhara (β = –0.047, p < 0.001), Oromia (β = –0.032, p = 0.025), Benishangul (β = –0.032, p = 0.033), and Gambela (β = –0.084, p < 0.001) experienced significantly earlier first births, with Gambela showing the strongest negative effect. By contrast, women in Somali (β = 0.038, p = 0.018), Addis Ababa (β = 0.10, p < 0.001), and Dire Dawa (β = 0.032, p = 0.015) initiated childbearing at older ages, with Addis Ababa exhibiting the largest positive effect, suggesting substantially delayed AFB. Differences in Afar (β = –0.046, p = 0.977) and Harari (β = 0.016, p = 0.297) were not statistically significant, while SNNPR (β = –0.028, p = 0.053) showed borderline evidence of earlier first births compared to Tigray. These numerical patterns underscore the presence of strong geographic disparities in the timing of first birth in Ethiopia.

Sensitivity analysis

To assess the robustness of the Gamma regression results and the potential impact of omitted socioeconomic variables, we re-estimated the model including maternal education, marital status, occupation, age at first marriage, and health service access alongside the original geographic, religious, and residential covariates.

As shown in Table 5, the inclusion of these additional predictors resulted in only marginal changes to the coefficients for region, religion, and residence, with all percentage changes in magnitude remaining below 10%. The statistical significance and direction of these effects were preserved. For example, the coefficient for rural residence changed from –0.04589 to –0.04112 (10.4%), while the coefficient for Addis Ababa decreased slightly from 0.10594 to 0.09987 (5.7%). Similarly, the coefficients for Orthodox and Protestant religious affiliation decreased by 3.5% and 5.0%, respectively, but remained statistically significant.

Table 5. Sensitivity Analysis: Parameter Estimates of Maternal Age at First Birth from GLM Gamma Regression Models with and Without Additional Socioeconomic Variables.

Variable Category Baseline Model Expanded Model % Change in Estimate Sig. (Expanded)
Residence Rural −0.04589 −0.04112 −10.4% 4.21e-08
Religion Orthodox 0.04019 0.03877 −3.5% 2.01e-06
Protestant 0.03369 0.03201 −5.0% 3.15e-07
Others −0.07781 −0.07492 −3.7% 0.725
Region Afar −0.04560 −0.04306 −5.6% 0.977476
Amhara −0.04666 −0.04418 −5.3% 0.000498
Oromia −0.03204 −0.03022 −5.7% 0.026841
Somali 0.03935 0.03713 −5.6% 0.019001
SNNPR −0.02819 −0.02652 −5.9% 0.053211
Benishangul −0.03178 −0.03001 −5.6% 0.033421
Gambela −0.08425 −0.07931 −5.9% 6.11e-08
Hareri 0.01645 0.01549 −5.8% 0.297842
Addis Ababa 0.10594 0.09987 −5.7% 2.48e-10
Dire Dawa 0.03861 0.03639 −5.7% 0.015101

In the expanded model, several of the added socioeconomic covariates were themselves significant predictors of maternal age at first birth. Higher maternal education (secondary or above), later age at first marriage, marital status, occupation, and better health service access were all positively associated with older ages at first birth.

The best-performing specification identified in the model comparison (Table 3) was the distributional Gamma regression model, which simultaneously models the mean (location) and dispersion components of the outcome. The mean sub-model represents the expected maternal age at first birth, while the dispersion sub-model captures variability around that mean. This modeling framework is particularly useful because maternal age at first birth among women who had already experienced a first birth may vary not only in its expected value but also in the degree of variability across socio-demographic groups.

As shown in Table 6, the intercept in the mean sub-model is 2.929 (p < 0.001). Under the log link function, exponentiation this coefficient indicates that the reference group defined as urban Muslim women residing in Tigray has an expected maternal age at first birth of approximately 18.7 years. In the dispersion sub-model, the intercept is 3.534 (p < 0.001), representing the baseline level of variability in maternal age at first birth for the same reference group. These intercepts provide the baseline against which the effects of the explanatory variables on both the expected age and its variability can be evaluated.

Table 6. Classical Gamma Regression Estimates of Mean and Dispersion Parameters for Maternal Age at First Birth by Region, Religion, and Residence in Ethiopia.

Variables Estimate Std. Error z value Pr(>|z|)
(Intercept) 2.929146 0.013810 212.11 <2e-16
Region Afar 0.002030 0.015547 0.13 0.89614
Amhara −0.044855 0.012213 −3.67 0.00024
Oromia −0.029181 0.013359 −2.18 0.02893
Somali 0.040656 0.015245 2.67 0.00766
SNNPR −0.026723 0.013762 −1.94 0.05217
Benishangul −0.027383 0.014624 −1.87 0.06114
Gambela −0.080535 0.014303 −5.63 1.79e-08
Hareri 0.017556 0.015175 1.16 0.24731
Addis Ababa 0.105571 0.015923 6.63 3.35e-11
Dere-dawa 0.038628 0.014804 2.61 0.00907
Religion Orthodox 0.043411 0.008842 4.91 9.12e-07
Protestant 0.030317 0.010349 0.03 0.037556
Others −0.048531 0.022435 −0.38 0.03474
Residence Rural −0.046430 0.007616 −6.10 1.08e-09
Shape (inverse of dispersion)
(Intercept) 3.533875 0.094590 37.36 <2e-16
Region Afar −0.255548 0.103745 −5.35 8.56e-08
Amhara −0.184934 0.084218 −5.05 4.52e-07
Oromia −0.204783 0.091518 −4.53 5.84e-06
Somali −0.310920 0.106652 −2.92 0.00355
SNNPR −0.243026 0.093775 −4.72 2.31e-06
Benishangul −0.321268 0.095746 −6.07 1.27e-09
Gambela −0.276238 0.098224 −2.81 0.00492
Hareri −0.229073 0.100505 −5.16 2.41e-07
Addis Ababa −0.246986 0.104692 −4.27 1.96e-05
Dere-dawa −0.141977 0.100251 −4.01 6.08e-05
Religion Orthodox −0.149941 0.056833 −2.64 0.00833
Protestant −0.081633 0.066370 −1.23 0.21871
Others −0.152352 0.137822 −1.11 0.26898
Residence Rural −0.067614 0.049911 −0.15 0.03875

After adjusting for religion and place of residence, notable regional differences in maternal age at first birth remain evident. Relative to Tigray, mothers who had already experienced a first birth in Addis Ababa, Somali, and Dire Dawa had significantly higher expected maternal ages at first birth by approximately 11.2%, 4.2%, and 4.0%, respectively (p < 0.01). These estimates indicate comparatively later initiation of childbearing among women in these regions within the observed sample.

Conversely, women who had gave birth in Amhara, Oromia, and Gambela exhibited significantly lower expected maternal ages at first birth by approximately 4.4%, 3.0%, and 7.8%, respectively (p < 0.05), indicating earlier childbearing relative to the reference region. The magnitude of the association was largest for Gambela. For SNNPR and Benishangul-Gumuz, the estimated reductions in maternal age at first birth were approximately 2.7%, with statistical significance close to conventional thresholds (p ≈ 0.05), suggesting weaker evidence of earlier childbearing relative to Tigray. In contrast, the estimated effects for Harari and Afar were small and statistically insignificant, indicating no clear difference in expected maternal age at first birth compared with the reference region. Overall, these findings suggest substantial geographic heterogeneity in maternal age at first birth among women who had already experienced a first birth in Ethiopia.

Religion was also associated with variation in maternal age at first birth. Compared with Muslim women (reference category), Orthodox Christian women had a significantly higher expected maternal age at first birth of approximately 4.4% (exp(0.043) − 1) × 100%, p < 0.001. Similarly, Protestant women exhibited an expected maternal age at first birth approximately 3% higher than that of Muslim women.

In contrast, women belonging to other religious groups (including Catholic, traditional beliefs, and other affiliations) had an expected maternal age at first birth approximately 5% lower than that of Muslim women, and this difference was statistically significant. These results indicate that maternal age at first birth varies across religious groups within the sample of women who had already experienced childbirth.

After controlling for region and religion, women who had already experienced a first birth and were residing in rural areas had a significantly lower expected maternal age at first birth compared with their urban counterparts. Specifically, the estimated coefficient (β = −0.046) corresponds to an approximately 4.5% lower expected age at first birth for rural women relative to urban women [1 − exp(−0.046)]×100% = 4.5%,p < 0.001[1 − \exp(−0.046)] \times 100\% = 4.5\%, p < 0.001[1 − exp(−0.046)]×100% = 4.5%,p < 0.001. This result indicates that, within the study sample of women who had already given birth, rural residence is associated with comparatively earlier initiation of childbearing than urban residence.

In a Gamma regression, the shape parameter is the inverse of the dispersion (variability). A higher shape implies lower dispersion (more homogeneous maternal age at first birth), and a lower shape implies higher dispersion (greater heterogeneity). Negative coefficients thus imply greater variability in maternal age at first birth compared to the reference category.

The dispersion sub-model examines factors associated with variability in maternal age at first birth among women who had already experienced a first birth. In this parameterization, negative coefficients correspond to lower shape parameters, which indicate greater dispersion (higher variability) in the age at first birth relative to the reference category.

Across regions, most estimated coefficients are negative and statistically significant, suggesting that maternal age at first birth is more variable in many regions compared with the reference region, Tigray. Women who had already experienced a first birth in Afar (β = −0.26) exhibited a substantially lower shape parameter, indicating higher variability in maternal age at first birth. Similarly, Benishangul-Gumuz (β = −0.32) and Harari (β = −0.22) show relatively large negative coefficients, suggesting greater heterogeneity in the timing of first birth among women in these regions.

Moderate but statistically significant increases in variability were also observed in Amhara (β = −0.18), Oromia (β = −0.20), and SNNPR (β = −0.24), indicating wider dispersion in maternal age at first birth relative to Tigray. In contrast, Addis Ababa (β = 0.24) and Dire Dawa (β = 0.14) exhibited positive coefficients in the dispersion model, corresponding to higher shape parameters and therefore lower variability in maternal age at first birth compared with the reference region. This suggests that the timing of first birth among women in these regions is relatively more consistent within the observed sample.

Regarding religion, Orthodox Christian women exhibited significantly greater variability in maternal age at first birth compared with Muslim women (reference category) (β = −0.150, p = 0.008). This negative coefficient indicates a lower shape parameter and therefore greater dispersion in the age at first birth within this group. In contrast, the dispersion estimates for Protestant women and those belonging to other religious groups were not statistically different from the Muslim reference category, suggesting similar levels of variability in maternal age at first birth.

After accounting for regional and religious differences, rural women who had already experienced a first birth exhibited approximately 7% greater variability in maternal age at first birth compared with urban women. This result indicates that the timing of first birth among rural women in the sample is more dispersed than among their urban counterparts.

Overall, the dispersion model highlights notable differences in the consistency of maternal age at first birth across regions, religions, and place of residence, suggesting that variability in the timing of first childbirth differs across socio-demographic contexts within Ethiopia.

Bayesian Gamma regression model

To support the classical estimation, Bayesian inference using MCMC was conducted for GLM GLMM and distributional gamma regression. Proper Bayesian diagnostics are required to ensure convergence, model reliability, and posterior accuracy (Gelman, 2013). The potential scale reduction factor (R) was employed to assess convergence across chains. As shown in Table 7, all R values were below the threshold of 1.01, confirming convergence for all model parameters.

Table 7. Posterior Summaries and Convergence Diagnostics from Bayesian Gamma Regression Models (GLM and GLMM) for Maternal Age at First Birth.

Variable Categories GLM
GLMM
Estimate Mcse 𝐑^ 𝐧eff Estimate Mcse 𝐑^ 𝐧eff
Intercept 2.93047 0.00039 1.00223 1473 2.92856 0.00026 1.00075 3266
Residence Urban
Rural

−0.046

0.00011

1.00008

5232

−0.0462

0.00008

0.99971

10271
Religion Muslim
Orthodox
Protestant
Others

0.042
0.035
−0.053

0.00016
0.00016 0.00030

1.00159
1.00038
1.0000

2918
4208
5294

0.04024
0.03478
−0.0778

0.00011
0.00011
0.00021

0.99996
0.99990
1.00012

6079
9507
10495
Region Tigray
Afar −0.037 0.00039 1.00219 1793 −0.0574 0.00026 1.00013 3752
Amhara −0.0467 0.00027 1.00054 2303 −0.05119 0.00019 1.00107 4957
Oromia −0.0322 0.00034 1.00236 1809 −0.03649 0.00023 1.00042 3722
Somali 0.0935 0.00038 1.00191 1985 0.03583 0.00026 1.00017 4117
SNNPR −0.0284 0.00033 1.00216 1977 −0.03289 0.00022 1.0003 4250
Benishangul −0.0318 0.00034 1.00293 1944 −0.03841 0.00022 0.99995 4578
Gambela −0.0843 0.00034 1.00248 2071 −0.08705 0.00024 1.00028 4204
Hareri 0.0164 0.00037 1.00136 1861 0.01046 0.00024 1.00022 4092
Addis Ababa 0.10594 0.00035 1.00138 2171 0.10100 0.00025 0.99999 4483
Dere-dewa 0.03868 0.00037 1.00188 1875 0.03429 0.00025 1.00037 3913
Variation Shape 20.7452 0.00452 1.00005 7306 11.4501 0.00102 1.01000 3451

Visual inspection of trace plots (Fig 5) for parameters such as the intercept and Afar region further supports this conclusion. Each chain displayed overlapping paths with no systematic trends, indicative of successful exploration of the posterior distribution.

Fig 5. Post-warmup trace plot for the slope on intercept and (region) 2 (Afar).

Fig 5

Another alternative way to assess the fit of the Bayesian models, posterior predictive check was conducted. This approach involves generating replicated datasets from the posterior predictive distribution and comparing these simulations to the observed data [43]. The posterior predictive check (PPC) of the mean age at first birth (AFB) demonstrates that the model’s simulated datasets closely reproduce the distribution observed in the empirical data. The black curve, in Fig 6 representing the observed mean AFB distribution, peaks around 17–18 years and exhibits a gradual decline toward higher ages. The green curves, representing the posterior predictive simulations, overlap almost entirely with the observed distribution, indicating that the model accurately captures the central tendency, shape, and spread of the data. This strong alignment suggests that the Bayesian model provides an adequate fit to the observed mean AFB values, with no substantial bias or distortion in its predictions.

Fig 6. Graphical posterior predictive check for posterior predictive distribution of AFB.

Fig 6

The agreement between the observed and simulated curves in Fig 6 across the entire distribution, including both tails, highlights the model’s capacity to reflect the variability present in the data. The close correspondence implies that the prior specification, likelihood choice, and posterior estimates together provide a realistic generative mechanism for the observed patterns of AFB. Consequently, this PPC offers strong evidence of model adequacy in representing the key distributional characteristics of mean AFB, reinforcing the validity of subsequent inferences and policy-relevant conclusions drawn from the model outputs.

The other numerical critical diagnostic is the effective sample size (ESS) or n eff, which reflects the number of independent samples equivalent in information content to the correlated MCMC draws. Table 7 shows that ESS values exceed 1000 for all estimated parameters in models, satisfying standard convergence and precision criteria for Bayesian analysis [43].

The Monte Carlo standard error (MCSE) provides insight into the accuracy of MCMC approximations. A low MCSE relative to the posterior standard deviation indicates reliable posterior estimation. In Table 7 both GLM and GLMM models, all parameter MCSE values were minimal and well below the benchmark of 0.1, affirming that the posterior estimates are precise and the sampling process efficient [44].

The R^ (Gelman–Rubin diagnostic) values reported in Table 7 provide evidence of convergence for the Bayesian Gamma regression models under both GLM and GLMM frameworks. A R^ value close to 1.0 indicates that the Markov Chain Monte Carlo (MCMC) chains have mixed well and converged to the target posterior distribution. In these results, all covariates show R values ranging from 0.999 to 1.002, which is well within the commonly accepted threshold of <1.01, confirming excellent convergence across parameters. Even the shape parameter in the GLMM model, with R = 1.01, remains within acceptable limits, suggesting only a minimal deviation but still indicative of reliable convergence. Overall, these diagnostics demonstrate that the Bayesian estimation process was stable and produced trustworthy posterior summaries for both GLM and GLMM models.

As shown in Table 3, the distributional Gamma regression model estimated under the Bayesian framework demonstrated better predictive performance compared with the standard Gamma Generalized Linear Model (GLM) and the Gamma Generalized Linear Mixed Model (GLMM). This model jointly estimates both the mean (location) and the dispersion (shape) components of maternal age at first birth (AFB). The mean sub-model, specified with a log-link function, estimates the expected age at first birth as a function of the explanatory variables, while the dispersion sub-model captures differences in variability across regions, religions, and residential settings (Table 8).

Table 8. Bayesian Gamma Regression Estimates of Mean and Dispersion Parameters for Maternal Age at First Birth by Region, Religion, and Residence in Ethiopia.

Estimate Est.Error l-95% u-95% CI 𝐑^ Bulk_ESS Tail_ESS
Intercept 2.93 0.01 2.90 2.96 1.00 3765 5879
Shape inter 3.53 0.09 3.34 3.71 1.00 3330 5384
Afar 0.0021 0.02 −0.03 0.03 1.00 4670 7326
Amhara −0.04 0.01 −0.07 −0.02 1.00 6723 8552
Oromia −0.03 0.01 −0.06 −0.01 1.00 4565 6653
Somali 0.04 0.02 0.01 0.07 1.00 4633 6862
SNNPR −0.03 0.01 −0.05 −0.01 1.00 5346 7809
Benishangul −0.03 0.01 −0.06 −0.01 1.00 5143 7627
Gambela −0.08 0.01 −0.11 −0.05 1.00 5003 7074
Hareri 0.02 0.02 −0.01 0.05 1.00 4897 7352
Addis Ababa 0.11 0.02 0.07 0.14 1.00 6573 8820
Dere-dawa 0.04 0.01 0.01 0.07 1.00 4617 7464
Orthodox 0.04 0.01 0.03 0.06 1.00 6160 8090
Protestant 0.03 0.01 0.02 0.05 1.00 9903 9257
Others −0.05 0.02 −0.07 −0.02 1.00 14448 9160
Rural −0.05 0.01 −0.06 −0.03 1.00 15367 9645
Shape (the inverse of dispersion)
Afar −0.26 0.10 −0.33 −0.19 1.00 4045 6132
Amhara −0.18 0.08 −0.35 −0.02 1.00 5006 7641
Oromia −0.20 0.09 −0.37 −0.04 1.00 3791 5734
Somali −0.31 0.11 −0.52 −0.10 1.00 4187 6574
SNNPR −0.24 0.09 −0.33 −0.15 1.00 4312 6682
Benishangul −0.32 0.10 −0.51 −0.13 1.00 4134 6701
Gambela −0.28 0.10 −0.47 −0.08 1.00 4507 7633
Hareri −0.23 0.10 −0.42 −0.04 1.00 4188 7020
Addis Ababa 0.25 0.11 0.04 0.46 1.00 5363 7433
Dere-dawa 0.14 0.10 0.05 0.33 1.00 4334 6774
Orthodox −0.15 0.06 −0.26 −0.04 1.00 6297 8206
Protestant −0.08 0.07 −0.21 0.04 1.00 9826 9644
Others −0.17 0.14 −0.44 0.10 1.00 14005 9463
Rural −0.07 0.05 −0.09 −0.04 1.00 12020 9763

Convergence diagnostics indicate that the Bayesian estimation performed well. All R values are equal to 1.00, indicating excellent convergence of the Markov Chain Monte Carlo (MCMC) simulations. In addition, the Bulk Effective Sample Size (Bulk_ESS) and Tail Effective Sample Size (Tail_ESS) values are sufficiently large, suggesting stable and reliable posterior estimates for all parameters.

The log-link function ensures that predicted values for both the mean and the shape parameters remain positive. Bayesian estimation was implemented using the No-U-Turn Sampler (NUTS) algorithm, which provides robust posterior sampling and credible interval estimation. The intercept estimate for the mean sub-model (β = 2.93) represents the expected log maternal age at first birth for the reference group defined as women residing in Tigray, belonging to the Muslim religion, and living in urban areas. After exponentiation, this corresponds to an expected maternal age at first birth of approximately 18.7 years (exp(2.93)), with a 95% credible interval of [18.2, 19.3] years. In the dispersion sub-model, the intercept estimate (β = 3.53) represents the baseline shape parameter, indicating relatively low variability in maternal age at first birth for the reference group.

The results presented in Table 8 indicate notable regional variation in maternal age at first birth among women who had already experienced a first birth, after adjusting for religion and place of residence. Relative to Tigray, women residing in Addis Ababa had an estimated 11.6% higher expected age at first birth (95% credible interval: 7.3%–15%), while those in Dire Dawa had approximately 4% higher expected age (95% credible interval: 1%–7.3%). These positive associations indicate comparatively later initiation of childbearing among women in these regions within the observed sample.

In contrast, women in several regions exhibited significantly lower expected ages at first birth relative to Tigray. For example, women residing in Gambela had an estimated 7.7% lower expected age at first birth (95% credible interval: 5%–10%). Similarly, reductions in expected age at first birth were observed in Amhara (4%), Oromia (3%), SNNPR (2.9%), and Benishangul-Gumuz (3%), with credible intervals that did not include zero, indicating statistically meaningful differences relative to the reference region. Overall, these estimates suggest substantial geographic heterogeneity in maternal age at first birth among women who had already experienced childbirth.

Religion was also associated with differences in the expected age at first birth. After adjusting for region and place of residence, women identifying as Orthodox Christian had an expected maternal age at first birth approximately 4% higher than that of Muslim women (β = 0.04; 95% credible interval: 0.03–0.06). Similarly, Protestant women had an expected age at first birth approximately 3% higher than that of Muslim women (β = 0.03; 95% credible interval: 0.02–0.05).

In contrast, women categorized under other religions (including Catholic, traditional beliefs, and other affiliations) had an expected maternal age at first birth approximately 5% lower than that of Muslim women (β = −0.05; 95% credible interval: −0.07 to −0.02). These results indicate differences in maternal age at first birth across religious groups within the study sample.

After adjusting for region and religion, rural residence was associated with a lower expected maternal age at first birth compared with urban residence. Women who had already experienced a first birth and resided in rural areas had an expected age at first birth approximately 5% lower than that of urban women (β = −0.05; 95% credible interval: −0.06 to −0.03). This indicates comparatively earlier initiation of childbearing among rural women within the sample.

In the case of precision (shape) or inverse dispersion sub-model region, religion and place of residence were associated with the maternal age at first birth variability. Positive coefficients for each covariate categories means, higher precision and lower dispersion (variability).

A tight way to interpret shape sub-model is changing shape results in to change in dispersion by the formula dispersion = (exp(−βj)−1)×100%. After adjusting residence and religion, most regions except Addis Ababa and Dere-Dewa show negative coefficients in the shape sub-model. Women who had already give birth in Somali have about 36.3% ([e(0.31)1] 100%) higher dispersion (variability), while those in Benishangul exhibit a 37.7% higher variability. Similarly, SNNPR (27.1%), Gambela (32.3%), and Harari (25.9%) also display greater heterogeneity in maternal age at first birth. This suggests that in these regions, women’s age at first birth is much more diverse, with some starting childbearing very early and others delaying, reflecting socio-cultural and economic differences. In contrast, Addis Ababa and Dire Dawa show positive coefficients for the shape parameter, meaning they are associated with lower variability.

Specifically, Addis Ababa has about a 22.1% reduction in dispersion, while Dire Dawa shows a 13.1% reduction relative to the Tigray. This indicates that in urban and administrative centers, women’s age at first birth tends to cluster more tightly around the mean, with fewer extremes of very early or very late first births. Such homogeneity could reflect greater access to education, reproductive health services, family planning and delay of early marriage. Amhara, Oromia, and Afar also show elevated variability, though to a more moderate degree. For instance, Amhara has a 19.7% higher dispersion; Oromia has a 22.1% higher dispersion, and Afar a 29.8% higher dispersion. These moderate increases suggest that, although the majority of women still enter childbearing at relatively young ages, there is growing variation, possibly reflecting changing norms among educated or urban subgroups compared to traditional practices in rural and pastoralist areas.

Religious affiliation also predicts variability. Relative to the Muslim reference group (assuming your coding), Orthodox Christians show a 16.2% higher dispersion and those identifying as other religions have a 18.5% higher dispersion, suggesting greater heterogeneity in the timing of first birth. Protestants, however, show a smaller and uncertain effect, with only an 8.3% increase in dispersion, and the 95% CrI crosses zero, indicating no strong evidence of a difference. These results align with prior demographic studies that find religious affiliation shapes fertility norms and the age of family formation differently across Ethiopian communities.

Rural women exhibit 7.3% greater dispersion in the age at first birth compared with urban women. This means that rural areas not only tend to have earlier mean ages at first birth but also a wider spread, reflecting more inequality in reproductive trajectories. While many rural women begin childbearing very early, a subset may delay slightly due to educational opportunities, regional migration, or access to health services, thereby widening the variability compared to more uniform urban experiences.

The primary rationale for employing three modeling frameworks GLM, GLMM, and distributional Gamma regression was to assess whether the associations between key covariates and maternal age at first birth remained consistent across statistical models of increasing flexibility. The results reported in Tables 4, 6–8 show that both Maximum Likelihood Estimation (MLE) and Bayesian estimation produced coefficient estimates that were broadly consistent in both magnitude and direction across the three modeling approaches. This agreement suggests that the observed relationships between region, religion, place of residence, and maternal age at first birth are relatively stable and not strongly dependent on the specific estimation framework used. Such consistency across estimation methods provides additional confidence in the robustness of the findings.

At the same time, moving from simpler models such as the GLM to more flexible approaches including the GLMM and distributional Gamma regression provided methodological advantages. The GLM offers a useful baseline for examining associations between predictors and the mean maternal age at first birth. The GLMM extends this framework by accounting for potential clustering and unobserved heterogeneity through random effects, which may improve estimation when observations are not fully independent. The distributional Gamma regression model further extends the analysis by allowing covariates to influence not only the mean of maternal age at first birth but also its dispersion. This approach provides a more comprehensive representation of the data structure by capturing both differences in expected age and differences in variability across socio-demographic groups. Consequently, although the substantive conclusions regarding the direction of covariate effects remain similar across models, the more flexible models provide additional insight into heterogeneity in the timing of first birth.

Taken together, the results from the distributional Gamma regression model provide a more detailed understanding of maternal age at first birth among women who had already experienced a first birth in Ethiopia. By jointly modeling the mean and dispersion of the outcome, this framework highlights not only which groups tend to have relatively earlier or later first births, but also where the timing of first birth is more or less variable across subpopulations.

The mean sub-model indicates that maternal age at first birth differs across regions, religious groups, and place of residence. In particular, women residing in urban areas and those living in regions such as Addis Ababa and Dire Dawa tend to exhibit higher expected ages at first birth compared with the reference region. Conversely, lower expected ages at first birth were observed in several regions including Gambela, Amhara, Oromia, SNNPR, and Benishangul-Gumuz. These results highlight geographic and socio-demographic differences in the timing of first childbirth within the study population.

The dispersion (shape) sub-model further reveals that variability in maternal age at first birth also differs across regions and residential settings. Several regions including Afar, Benishangul-Gumuz, Harari, Gambela, Somali, SNNPR, Oromia, and Amhara exhibit lower shape parameters relative to Tigray, indicating greater dispersion in the age at first birth among women in these areas. In contrast, regions such as Addis Ababa and Dire Dawa show relatively higher shape parameters, corresponding to lower variability in the timing of first birth within the observed sample.

Differences in variability were also observed across religious groups. While Orthodox Christian women showed higher expected maternal age at first birth in the mean model, the dispersion model indicates greater variability in the timing of first birth within this group relative to Muslim women. In contrast, Protestant women and those belonging to other religious groups showed no statistically meaningful differences in variability compared with the reference category.

Place of residence also showed consistent associations with both the expected age and variability of first birth. Rural women who had already experienced a first birth exhibited lower expected ages at first birth compared with urban women, and the dispersion model indicates greater variability in maternal age at first birth among rural residents. These results suggest that both the timing and the consistency of first birth differ between rural and urban populations within the sample.

Overall, the distributional Gamma regression framework provides a comprehensive perspective by simultaneously examining differences in both the average timing and the variability of maternal age at first birth. This dual modeling approach helps characterize heterogeneity in reproductive timing across regions, religions, and residential settings among Ethiopian women who had already experienced childbirth.

Discussion

Maternal age at first birth is a critical indicator of reproductive health, population dynamics, and the socio-economic well-being of a society [2,3]. Evidence-based strategies to delay the timing of first birth can substantially reduce risks to both maternal and child health [4]. Although Ethiopia has made notable progress in increasing maternal age at first birth, considerable disparities persist across regions and religious groups among women who have already given birth [10,14,27]. Such inconsistencies hinder efforts to improve health outcomes for mothers and children [5,9,10]. This study investigates the magnitude and determinants of maternal age at first birth in Ethiopia, drawing on data from 5,839 women included in the 2019 Ethiopia Mini Demographic and Health Survey (Mini EDHS), conducted between March 21 and June 28, 2019.

This study utilizes a robust comparative modeling framework to examine maternal age at first birth (AFB). Given the right-skewed distribution of AFB (ranging from 10 to 49 years), the Gamma distribution was deemed most appropriate [37]. Specifically, we fitted Generalized Linear Models (GLM), Generalized Linear Mixed Models (GLMM), and distributional Gamma regression models under both classical (Maximum Likelihood Estimation) and Bayesian estimation via MCMC. This multi-model approach provided a comprehensive assessment of both the mean and dispersion of AFB, thereby capturing disparities that are often obscured when focusing solely on central tendencies.

From a methodological perspective, the results demonstrated that distributional Gamma regression consistently outperformed the standard GLM and GLMM under both estimation frameworks. The modeling both the shape and location parameters, were improving model fit, as evidenced by lower AIC, BIC, and deviance values (under MLE), and higher expected log predictive density (ELPD) using Bayesian leave-one-out (LOO) cross-validation. Unlike standard models, it jointly estimated the expected value (mean) and the variance (via the shape parameter), offering richer insights into both the timing and variability of childbearing. Bayesian implementation further enhanced inference quality, with excellent convergence diagnostics (R ≈ 1.00, high Bulk_ESS and Tail_ESS, and low MCSE), confirming robust posterior stability and credibility.

The findings reveal that the average age at first birth (AFB) in Ethiopia remains around 18 years with credible intervals ranging 18.2 to 19.3. This statistic aligns broader trends observed in the literature, where significant proportions of motherhood in Ethiopia fail to meet the recommendation of a minimum of 18 years during maternity [9,19]. Means that, a substantial proportion of births still occur below both the WHO-recommended minimum of 18 years and the sub-Saharan African average of 19 years [5]. This underscores a persistent public health concern, as early childbearing increases risks of maternal morbidity, poor neonatal outcomes, and intergenerational cycles of poverty [45]. Moreover, the average is far below the internationally recognized optimal window for first birth (late 20s to early 30s), underscoring the urgency of interventions. This difference may be attributed to variations in study design, methodological approaches, the selection of contextual variables, and the focus on mothers who had already given birth. Our finding is consistent with previous studies conducted in Ethiopia, which reported mean ages at first birth of 20 years [29] and 19 years [28], as well as with evidence from Bangladesh (16.34 years) [46], Kenya (20.3 years) [47], Swaziland (18.22 years) [48], Nigeria (19 years) [21], and Uganda (19.2 years) [49]. This similarity may be explained by the high prevalence of early marriage and initiation of sexual activity in these countries [29]. Early marriage often limits women’s autonomy in reproductive health decisions, resulting in early childbearing [50]. Another contributing factor could be the restricted educational opportunities for girls, particularly in predominantly rural populations, which encourages early marriage as a means of obtaining social and financial support [51]. In contrast, our result was notably lower than that observed in developed countries, where the mean age at first birth exceeds 30 years [51]. This difference may be attributed to the fact that adolescent girls in developed contexts are more likely to remain in school and many women enter the workforce to gain economic independence, thereby postponing childbearing [52,53]. Furthermore, women in developed countries generally possess better awareness of the risks associated with early childbirth and greater access to family planning services, which supports delaying the first birth [54].

While the national average underscores a persistent public health concern, further insights emerge when examining key factors such as region, religion, and residence. Although these variables were measured at the time of the survey, they are reasonably assumed to have remained stable over the preceding five years, thereby serving as valid determinants of maternal age at first birth. These factors consistently shape both the mean and variability of AFB across models. Marked regional disparities were observed. Compared with Tigray, women who had already give birth in Amhara (4%), Oromia (3%), SNNPR (3%), Benishangul-Gumuz (3%), and Gambela (8%) tended to initiate first births at younger ages, whereas those in Addis Ababa (11%), Somali (4%), and Dire Dawa (4%) delayed their first births. These patterns reflect underlying inequalities in education, economic development, and access to health services. This statistics were consistent with findings from previous studies in Ethiopia and sub-Saharan Africa [8,32,4547]. Notably, the distributional model revealed that regions such as Afar (37.7%), Benishangul-Gumuz (37.7%), Somali (36%), Gambela (32.3%), SNNPR (27.1%), and Harari (25.9%) exhibited higher variability in maternal age at first birth. This suggests heterogeneous fertility norms and unequal access to reproductive health services across these regions. In contrast, Addis Ababa (22%) and Dire Dawa (13%) not only had later average ages at first birth but also lower variability, reflecting more uniform reproductive patterns likely influenced by higher education levels, urbanization, and stronger health system infrastructure. These findings highlight important regional disparities with implications for targeted maternal and child health interventions.

Religion also associated in shaping AFB. Muslim women consistently had the lowest AFB, while Orthodox and Protestant women tended to delay childbirth. These results align with findings from Ethiopia [55], Uganda [49], and Nigeria [56], where religion strongly influences marriage timing, contraceptive uptake, and gender expectations. This is supported by other studies in Ethiopia and Nigeria in which early childbirth was found to be higher among Muslim followers than Orthodox [29,57]. The association between the Muslim religion and early childbirth might be attributed to the low uptake of contraceptive methods by Muslim followers [22,32]. Sometimes, some Muslims claim that having children is a source of blessing [9]. Besides, women’s educational attendance is low in the eastern part of the country where Muslims predominate and this might contribute to early marriage and early childbirth [9]. The distributional Gamma regression further revealed that Orthodox women showed greater variability in AFB, possibly reflecting tensions between modernization (delayed marriage and career pursuits) and traditional norms of early family formation.

Rural women were more likely to begin childbearing earlier than their urban counterparts, a finding corroborated by prior Ethiopian [9,28,58] and sub-Saharan regional studies [47,59] studies. This disparity reflects systemic barriers in rural areas, including limited education, weaker health infrastructures, higher rates of early marriage, and entrenched traditional norms. Rural women who had already give birth, tended to have 5% early average ages at first birth compared to urban counterparts. The reason for this might be because those women residing in rural areas are less knowledgeable about early birth impact, have less accessible to health information and could not easily access and utilize family planning services compared with urban dwellers. The presence of low utilization of modern contraceptive methods might contribute to hastened first child birth in rural areas [60]. Rural residents might have a high interest in having many children and start giving birth early on time. Besides, early marriage might be high in those rural areas. Moreover, women in rural areas are less likely to be educated and less likely to be from educated parents, which means they have poor awareness of the consequences of early childbirth and a high unmet need for contraceptives [21,61]. The variability in timing was also higher compared to urbanized or mixed regions. This greater variability may reflect uneven access to education, reproductive health services, and family planning in rural areas, combined with diverse cultural norms around marriage and childbearing. Such disparities result in some women marrying and giving birth very early, while others delay due to schooling, migration, or socioeconomic differences.

While these associations highlight the impact of religion, region, and residence in shaping maternal age at first birth, it is important to acknowledge that such factors rarely act independently. Their effects are often mediated through socioeconomic pathways, including maternal education, marital practices, occupation, and access to health services, as documented in previous studies [9,28,31,51]. For instance, the earlier AFB observed among Muslim and rural women may partly reflect lower levels of educational attainment and earlier marriage, whereas the greater variability among Orthodox and rural populations could be linked to unequal access to family planning and diverse cultural norms [19,27,29]. To assess the extent of such confounding, we conducted a sensitivity analysis by incorporating key socioeconomic covariates. This approach enabled us to disentangle the extent to which observed disparities were attributable to socioeconomic inequalities versus deeper cultural and structural determinants that continue to shape reproductive behavior.

The sensitivity analysis further allowed us to assess potential bias from omitted socioeconomic variables. As shown in Table 5, the inclusion of maternal education, marital status, occupation, age at first marriage, and health service access reduced the coefficients for region, religion, and residence by less than ±10%. For instance, the effect of urban residence decreased by 10.4%, while that of Addis Ababa dropped by 5.7%. Similarly, Muslim and Orthodox affiliation coefficients declined by 3.5% and 5.0%, respectively. These modest changes indicate that part of the observed disparities in maternal age at first birth can be attributed to differences in education, marriage timing, and health service availability. However, the persistence of statistically significant effects even after adjustment confirms that regional, religious, and residential differences reflect more than just socioeconomic inequalities; they also capture deep-rooted cultural norms, structural conditions, and geographic contexts that independently shape reproductive behavior of women who had already give birth in Ethiopia.

This study demonstrates that maternal age at first birth in Ethiopia remains alarmingly low, with profound disparities across regions, religions, and residential settings. By incorporating both mean and variability into the analysis, the distributional Gamma regression model provides a more nuanced understanding of these inequalities, offering critical evidence for equity-focused, context-specific interventions. Addressing both the timing and consistency of childbearing will be essential for achieving maternal and child health goals, reducing intergenerational cycles of disadvantage, and meeting Ethiopia’s commitments to the Sustainable Development Goals.

Strength and limitation of the study

The strengths include reliance on nationwide data and the employment of advanced statistical techniques in both mean and dispersion of maternal age at first birth, as well as adequate models which enhance the confidential applicability and reliability of the results. Moreover, it is believed that the Demographic and Health Survey (DHS), a widely accepted and validated instrument with a large, carefully curated sample size, further strengthens the credibility and accuracy of the results of this study.

This study has several limitations the authors need to acknowledge. First, it focused only on women who had given birth, excluding censored data, which may bias results and limit generalizability. Second, reliance on self-reported DHS data may introduce recall or reporting errors, especially for age at first birth and reproductive behaviors. Third, the cross-sectional design precludes causal inference between covariates and maternal age at first birth. Fourth, unmeasured confounders including maternal education, occupation, early marriage, marital status, household income, healthcare access and quality, cultural practices, and reproductive health knowledge may influence results. Fifth, regional and rural-urban classifications may mask intra-region heterogeneity, particularly in mobile or pastoralist populations, and age reporting errors may affect estimates of mean and variability. Finally, while the distributional Gamma regression captures mean and dispersion effectively, its parametric assumptions may not fully reflect extreme values or nonlinear patterns. Longitudinal studies incorporating broader socio-economic and cultural factors are warranted to strengthen causal inference and inform policy.

Conclusion

This study analyzed maternal age at first birth (AFB) in Ethiopia using data from 5,839 women captured in the Mini EDHS 2019. The analysis focused exclusively on women who had experienced at least one live birth, ensuring precise insights by excluding censored observations. The average AFB was found to be 18.7 years, with values ranging from 10 to 40 years highlighting wide demographic diversity and underscoring the urgency of addressing early childbearing in specific subpopulations.

Given the positive, right-skewed, and non-negative nature of the AFB, Gamma regression was the most appropriate modeling choice. Its distributional properties align well with the structure of the data, enabling effective modeling of both the mean and variability of AFB. The analysis employed three modeling frameworks Generalized Linear Models (GLM), Generalized Linear Mixed Models (GLMM), and distributional Gamma regression each estimated using both classical (MLE) and Bayesian methods. Among these, the distributional Gamma models consistently outperformed the standard GLM and GLMM, as indicated by lower AIC, BIC, and deviance (under MLE) and higher Bayesian model fit scores such as LOO, WAIC, the expected log predictive density (elpd). Bayesian models further showed strong convergence and improved parameter precision.

Across all models and estimation approaches, consistent and significant associations were identified between AFB and religious, regional and residential covariates. Women from rural areas, Muslim backgrounds, and regions like Amhara, Oromia, Benishangul-Gumuz, SNNPR and Gambela were more likely to experience early childbearing. In contrast, higher average AFB was observed among women in urban areas, particularly in regions such as Addis Ababa, Dire-Dawa and Somali, and among followers of Orthodox and Protestant religions. Importantly, the distributional Gamma model revealed that AFB variability itself is not constant greater dispersion was observed in certain regions like Afar, Gambela, Benishangul Gumuz, Amhara, Oromia, SNNPR, Somali and Harari and among Orthodox women, highlighting heterogeneous reproductive patterns within each group. Women who had already give birth in rural part of the country also have high variability in age at their first birth.

The findings provide robust evidence of geographical, religious, and residential disparities in reproductive average maternal age across Ethiopia. These insights point to the need for targeted policy interventions that are context-specific and equity-focused. Strategies should prioritize improving girls’ education, delaying early marriage, and expanding access to culturally sensitive reproductive health services, particularly in rural and high-risk regions.

The average maternal age at first birth in Ethiopia remains at 18 years just at the lower limit of the World Health Organization's recommendation. Alarmingly, in certain regions, the age of first childbirth can be as young as 10 years. On the other hand, some women in Addis Ababa and Amhara delay childbirth until after age 35, indicating significant demographic diversity. This wide range not only emphasizes the urgency of intervention but also highlights the importance of contextualized, locally adapted solutions.

Therefore, policymakers should prioritize targeted programs that address the sociocultural and geographic determinants of early childbirth. Investments in female education, access to reproductive health services, and culturally sensitive community engagement strategies are essential. The dual perspective offered by the distributional model on both mean and variability has significant policy implications. First, programs must go beyond raising mean AFB and also address disparities in timing, since high variability often reflects inequities in access to education, health services, family planning and cultural autonomy. Second, interventions should be region-specific: for example, urban centers like Addis Ababa may require strategies targeting heterogeneity within mixed populations, while peripheral regions like Afar, Somali, Gambela and Benishangul need tailored programs that address pastoralist lifestyles, limited service coverage, and entrenched fertility norms. Third, religious institutions and leaders should be engaged as key partners, given the strong role of faith in shaping fertility practices. Fourth, rural focused policies must prioritize delaying age at marriage, expanding girls’ secondary and higher education, strengthening youth-friendly reproductive health services, and addressing structural barriers to autonomy.

Supporting information

S1 Data. AFB data.

(CSV)

pone.0346504.s001.csv (325.5KB, csv)

Abbreviations

ADHS

Angola Demography and health survey

AFB

Age at first birth

AIC

Akaike Information Criterian

CSA

Central Statistical Agency

EA

Enumeration Area

EDHS

Ethiopian Demography and Health Survey

EPHI

Ethiopian Public Health Institute

ESS (nff)

Effective Posterior Sample size

EU

European Union

GLM

Generalized Linear Model

GLMM

Generalized Linear Mixed Model

HMC

Hamiltonian Monte Carlo

ICF

International Classification of Functioning

Liglik

Log likelihood

LOO

Leave-One-Out

MCMC

Monte Carlo Markov Chain

MCSE

Monte Carlo Standard Errror

MLE

Maximum Likelihood Estimation

NUTS

No-U-Turn Sampler

PPC

Posterior Predictive Checking

QQ

Quantile-Quantile

SNNPR

South Nation Nationality and People Region

UNICEF

United Nation International Children's Emergency Fund

WHO

World Health Organization.

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Mathews TJ. Delayed childbearing: more women are having their first child later in life. US Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics; 2009. [Google Scholar]
  • 2.Hossain MdM, Majumder AK. Determinants of the age of mother at first birth in Bangladesh: quantile regression approach. J Public Health (Berl). 2018;27(4):419–24. doi: 10.1007/s10389-018-0977-6 [DOI] [Google Scholar]
  • 3.Sobhan A, Moinuddin M, Hossain MM. Investigating time to first birth among women of reproductive age in Bangladesh: a survival analysis of nationwide cross-sectional survey data. J Health Popul Nutr. 2024;43(1):2. doi: 10.1186/s41043-023-00492-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sakai T, Sugawara Y, Watanabe I, Watanabe T, Tomata Y, Nakaya N, et al. Age at first birth and long-term mortality for mothers: the Ohsaki cohort study. Environ Health Prev Med. 2017;22(1):24. doi: 10.1186/s12199-017-0631-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.World Health Organization. Global standards for quality health-care services for adolescents: A guide to implement a standard-driven approach to improve the quality of health care services for adolescents. Geneva: WHO; 2015. [Google Scholar]
  • 6.UNICEF. The state of the world’s children: Adolescence—an age of opportunity. New York: UNICEF; 2011. [Google Scholar]
  • 7.Kirbas A, Gulerman HC, Daglar K. Pregnancy in Adolescence: Is It an Obstetrical Risk? J Pediatr Adolesc Gynecol. 2016;29(4):367–71. doi: 10.1016/j.jpag.2015.12.010 [DOI] [PubMed] [Google Scholar]
  • 8.Pirkle CM, de Albuquerque Sousa ACP, Alvarado B, Zunzunegui M-V, IMIAS Research Group. Early maternal age at first birth is associated with chronic diseases and poor physical performance in older age: cross-sectional analysis from the International Mobility in Aging Study. BMC Public Health. 2014;14:293. doi: 10.1186/1471-2458-14-293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Feyisa BR, Shiferaw D, Yesse M, Shama AT, Duressa LT, Biru B. Time to first birth and its predictors among reproductive-age women in Ethiopia: multilevel analysis using shared frailty model. BMJ Open. 2024;14(5):e082356. doi: 10.1136/bmjopen-2023-082356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.UNICEF. Monitoring the situation of children and women. 2021. Available: https://data.unicef.org/topic/child-health/adolescent-health/
  • 11.Ali M, Alauddin S, Khatun MF, Maniruzzaman M, Islam SMS. Determinants of early age of mother at first birth in Bangladesh: a statistical analysis using a two-level multiple logistic regression model. J Public Health (Berl). 2020;29(5):1081–7. doi: 10.1007/s10389-020-01228-9 [DOI] [Google Scholar]
  • 12.Ahinkorah BO. Maternal age at first childbirth and under-five morbidity in sub-Saharan Africa: analysis of cross-sectional data of 32 countries. Arch Public Health. 2021;79(1):151. doi: 10.1186/s13690-021-00674-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.World Health Organization. Preventing early pregnancy and poor reproductive outcomes among adolescents in developing countries: What the evidence says. Geneva: WHO; 2012. [Google Scholar]
  • 14.Kassa GM, Arowojolu AO, Odukogbe A-TA, Yalew AW. Trends and determinants of teenage childbearing in Ethiopia: evidence from the 2000 to 2016 demographic and health surveys. Ital J Pediatr. 2019;45(1):153. doi: 10.1186/s13052-019-0745-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Fauser BCJM, Adamson GD, Boivin J, Chambers GM, de Geyter C, Dyer S. Declining global fertility rates and the implications for family planning and family building: An IFFS consensus document. Hum Reprod Update. 2024;30(2):153–73. doi: 10.1093/humupd/dmad028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Neal S, Channon AA, Chandra-Mouli V, Madise N. Trends in adolescent first births in sub-Saharan Africa: a tale of increasing inequity? Int J Equity Health. 2020;19(1):151. doi: 10.1186/s12939-020-01251-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Eurostat. Mean age of women at childbirth and at birth of first child—Statistics explained. European Commission. 2021. [Google Scholar]
  • 18.Kennedy E, Gray N, Azzopardi P, Creati M. Adolescent fertility and family planning in East Asia and the Pacific: a review of DHS reports. Reprod Health. 2011;8:11. doi: 10.1186/1742-4755-8-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Negash WD, Asmamaw DB. Time to first birth and its predictors among reproductive age women in high fertility countries in Sub-Saharan Africa: Inverse Weibull gamma shared frailty model. BMC Pregnancy Childbirth. 2022;22(1):844. doi: 10.1186/s12884-022-05206-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Campbell B, Martinelli-Heckadon S, Wong S. Motherhood in childhood. The state of world population. UNFPA. 2013. [Google Scholar]
  • 21.Fagbamigbe AF, Idemudia ES. Survival analysis and prognostic factors of timing of first childbirth among women in Nigeria. BMC Pregnancy Childbirth. 2016;16:102. doi: 10.1186/s12884-016-0895-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Central Statistical Agency (CSA) [Ethiopia] and ICF. Ethiopia demographic and health survey 2016. Addis Ababa and Rockville: CSA and ICF. 2016. [Google Scholar]
  • 23.Pradhan J, Arokiasamy P. Socioeconomic differentials in childhood mortality in India: A comparative study of national family health survey data. Int J Popul Res. 2016:1–11. doi: 10.1155/2016/1985487 [DOI] [Google Scholar]
  • 24.Worldometer (n.d.) World population clock. Available: https://www.worldometers.info
  • 25.Dorling D. World population prospects at the UN. The struggle for social sustainability. Bristol: Policy Press; 2021. p. 129–54. [Google Scholar]
  • 26.Trübswasser U, Baye K, Holdsworth M, Loeffen M, Feskens EJ, Talsma EF. Assessing factors influencing adolescents’ dietary behaviours in urban Ethiopia using participatory photography. Public Health Nutr. 2021;24(12):3615–23. doi: 10.1017/S1368980020002487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wakjira DB, Habedi D. Barriers to access and utilisation of sexual and reproductive health services among adolescents in Ethiopia: a sequential mixed-methods study. BMJ Open. 2022;12(11):e063294. doi: 10.1136/bmjopen-2022-063294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bitew DA, Habitu YA, Gelagay AA. Time to first birth and its determinants among married female youths in Ethiopia, 2020: survival analysis based on EDHS 2016. BMC Womens Health. 2021;21(1):278. doi: 10.1186/s12905-021-01414-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Dewau R, Mekonnen FA, Seretew WS. Time to first birth and its predictors among reproductive-age women in Ethiopia. BMC Womens Health. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Motsima T. The Risk Factors Associated With Early Age At First Birth Amongst Angolan Women: Evidence From The 2015-2016 Angola Demographic And Health Survey. EJMED. 2020;2(2). doi: 10.24018/ejmed.2020.2.2.198 [DOI] [Google Scholar]
  • 31.Kitaw TA, Haile RN. Time to first childbirth and its predictors among reproductive-age women in Ethiopia: survival analysis of recent evidence from the EDHS 2019. Front Reprod Health. 2023;5:1165204. doi: 10.3389/frph.2023.1165204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ethiopian Public Health Institute (EPHI) and ICF. Ethiopia mini demographic and health survey 2019. Rockville: EPHI and ICF. 2021. [Google Scholar]
  • 33.Cepeda-Cuervo E, Migon HS, Garrido L, Achcar JA. Generalized linear models with random effects in the two-parameter exponential family. J Stat Comput Simul. 2014;84(3):513–25. [Google Scholar]
  • 34.Lee Y, Ronnegard L, Noh M. Data analysis using hierarchical generalized linear models with R. Boca Raton: CRC Press: 2017. [Google Scholar]
  • 35.Nelder JA, Wedderburn RW. Generalized linear models. J R Stat Soc A. 1972;135(3):370–84. [Google Scholar]
  • 36.Breslow NE, Clayton DG. Approximate inference in generalized linear mixed models. J Am Stat Assoc. 1993;88(421):9–25. [Google Scholar]
  • 37.Corrales-Bossio ML, Cepeda-Cuervo EA. Bayesian approach to mixed gamma regression models. Rev Colomb Estad. 2019;42(1):9–25. [Google Scholar]
  • 38.Adekanmbi V. Generalized gamma regression models with application to CD4 cell counts data of AIDS patients. Int J Appl Math Stat Sci. 2017;6(4):11–22. [Google Scholar]
  • 39.Cepeda-Cuervo E. Modelagem de variabilidade em modelos lineares generalizados. Universidade Federal do Rio de Janeiro. 2001. [Google Scholar]
  • 40.Cox DR, Reid N. Parameter orthogonality and approximate conditional inference. J R Stat Soc B. 1987;49(1):1–39. [Google Scholar]
  • 41.Hoffman MD, Gelman A. The no-u-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res. 2014;15(1):1593–623. [Google Scholar]
  • 42.Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21(6):1087–92. doi: 10.1063/1.1699114 [DOI] [PubMed] [Google Scholar]
  • 43.Gabry J, Mahr T. bayesplot: Plotting for Bayesian models. R package version 1.5.0. 2017. Available: https://mc-stan.org/bayesplot/
  • 44.Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian data analysis. 3rd ed. Boca Raton: Chapman & Hall/CRC; 2013. [Google Scholar]
  • 45.Watson S. When can you get pregnant and what’s the best age to have a baby. Healthline. 2018. Available: https://www.healthline.com
  • 46.Karim MA, Faruk MO. Patterns and differentials of age at first motherhood among married adolescents in Bangladesh. Biomed Sci. 2021;7(3):86. [Google Scholar]
  • 47.Kenya National Bureau of Statistics. Kenya demographic and health survey. 2014.
  • 48.Mhele K. Determinants of time to first birth among women aged 15–24 in Swaziland. Afr Popul Stud. 2015;29(2). [Google Scholar]
  • 49.Uganda Demographic and Health Survey. Kampala, Uganda: Uganda Bureau of Statistics; 2018. [Google Scholar]
  • 50.Santhya KG. Early marriage and sexual and reproductive health vulnerabilities of young women: a synthesis of recent evidence from developing countries. Curr Opin Obstet Gynecol. 2011;23(5):334–9. doi: 10.1097/GCO.0b013e32834a93d2 [DOI] [PubMed] [Google Scholar]
  • 51.Muluneh EK, Alemu M. Correlates of age at first birth among women in Ethiopia: use of multilevel survival analysis models. Pan Afr Med J. 2023;44:190. doi: 10.11604/pamj.2023.44.190.36090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Social Issues Research Centre. The changing face of motherhood. 2011.
  • 53.Hindin MJ, Fatusi AO. Adolescent sexual and reproductive health in developing countries: an overview of trends and interventions. Int Perspect Sex Reprod Health. 2009;35(2):58–62. doi: 10.1363/ipsrh.35.058.09 [DOI] [PubMed] [Google Scholar]
  • 54.Chandra-Mouli V, Svanemyr J, Amin A, Fogstad H, Say L, Girard F, et al. Twenty years after International Conference on Population and Development: where are we with adolescent sexual and reproductive health and rights? J Adolesc Health. 2015;56(1 Suppl):S1-6. doi: 10.1016/j.jadohealth.2014.09.015 [DOI] [PubMed] [Google Scholar]
  • 55.Bearinger LH, Sieving RE, Ferguson J, Sharma V. Global perspectives on the sexual and reproductive health of adolescents: patterns, prevention, and potential. Lancet. 2007;369(9568):1220–31. doi: 10.1016/S0140-6736(07)60367-5 [DOI] [PubMed] [Google Scholar]
  • 56.Bolarinwa OA, Ahinkorah BO, Seidu A-A, Mohammed A, Effiong FB, Hagan JE Jr, et al. Predictors of young maternal age at first birth among women of reproductive age in Nigeria. PLoS One. 2023;18(1):e0279404. doi: 10.1371/journal.pone.0279404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hussen NM, Arega GG, Shibeshi AH, Habtie GM, Kassa TH, Getaneh Arge K. Multilevel survival analysis of the age at first birth among women in Ethiopia. Front Reprod Health. 2024;6:1419537. doi: 10.3389/frph.2024.1419537 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Mitiku Alemu Y, Gualu Gobena M. Determinants of Time to First Birth among women in Ethiopia using Cox Proportional Hazards Model. jhgd. 2022;32(2):214–22. doi: 10.36311/jhgd.v32.13145 [DOI] [Google Scholar]
  • 59.Mugarura A, Kaberuka W, Atuhaire R. Factors determining the age at first birth in Uganda. J Issues. 2016. [Google Scholar]
  • 60.Nahar MZ, Zahangir MS. Patterns and determinants of age at first birth in Bangladesh. Turk J Popul Stud. 2013;35:63–77. [Google Scholar]
  • 61.Gebre MN, Edossa ZK. Modern contraceptive utilization and associated factors among reproductive-age women in Ethiopia: evidence from 2016 Ethiopia demographic and health survey. BMC Womens Health. 2020;20(1):61. doi: 10.1186/s12905-020-00923-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Denekew Bitew Belay

11 Aug 2025

Dear Dr.  Agegnehu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 25 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols....

We look forward to receiving your revised manuscript.

Kind regards,

Denekew Bitew Belay, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1.Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

3. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

4. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process.

5. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please delete it from any other section.

6. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

[Note: HTML markup is below. Please do not edit.]

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

Reviewer #4: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

Reviewer #4: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: No

Reviewer #4: Yes

**********

Reviewer #1: Peer Review Report

Manuscript: PONE-D-25-07127

Title: Modelling Determinants of Maternal Age at First Birth in Ethiopia: Gamma Regression Approach

Dear Editor,

I have carefully reviewed the manuscript "Modelling Determinants of Maternal Age at First Birth in Ethiopia: Gamma Regression Approach." Below is my assessment of the manuscript's limitations and recommendations for improvement.

Major Concerns

Variable selection rationale: The authors' decision to exclude key socioeconomic variables severely limits the study's explanatory power. On page 14, the authors state: "The exclusion criteria of variables from this study were established to ensure that the analysis focused on factors that directly relate to the first childbearing status of mothers. Maternal characteristics such as educational level, occupation status, and marital status, while important in broader contexts, were deemed insufficiently relevant to the specific inquiry regarding maternal age at first childbirth." This justification contradicts substantial literature demonstrating that education and marital status are among the strongest predictors of maternal age at first birth. For example, Essey Kebede Muluneh et al. (2023), which the authors cite, found significant associations between these variables and age at first birth. By excluding these variables, the authors have created models that likely suffer from omitted variable bias. The authors should either: (1) include these critical variables in their models while acknowledging the temporal limitation, or (2) conduct sensitivity analyses to demonstrate that their exclusion does not substantially alter the findings related to region, religion, and residence.

Statistical modeling decisions: The manuscript presents gamma regression as the appropriate choice but provides insufficient evidence for this selection. On page 15, the authors merely state: "It was observed that the dataset on the maternal age at first birth obtained from the Mini EDHS 2019 dataset follows a gamma distribution and consists of only positive values." This statement lacks statistical support. Figure 2 shows a visual comparison between normal and gamma distributions, but no formal goodness-of-fit tests are presented. The authors should:

Present results from statistical tests comparing the fit of different distributions (e.g., Akaike Information Criterion values for models using gamma, log-normal, and Weibull distributions)

Provide Q-Q plots comparing observed data against theoretical distributions

Justify the use of the log link function over alternative link functions for gamma regression

Focus on women who had given birth only: The authors' decision to exclude women without births fundamentally changes the research question and limits comparability with previous research. On page 15-16, they state: "However our concern for this study is mothers who had their first child birth only. Maternal age at first birth consists of non-negative values within this specific age range during the survey period." This approach introduces potential selection bias. Women who had not yet given birth by the survey date (particularly younger women) are systematically excluded, artificially truncating the distribution of the outcome variable. This could lead to biased estimates of the effects of predictors. For example, if highly educated urban women tend to delay childbirth beyond the survey period, their exclusion would underestimate the true effect of urban residence on maternal age at first birth. The authors should:

Explicitly discuss this limitation and its implications for interpreting results

Consider employing methods that account for right-censoring, such as survival analysis, and compare results with their current approach

At minimum, provide descriptive statistics comparing included and excluded women on key demographic characteristics

Causal inference limitations: Throughout the manuscript, the authors use language implying causal relationships despite the cross-sectional nature of the data. For instance, on page 33, they state: "The place of residence plays a crucial role in determining the age at which Ethiopian women have their first child, with women living in rural areas tending to give birth at a younger age than their urban counterparts." This phrasing suggests that place of residence determines age at first birth, when the data only support an association. Similar causal language appears when discussing religion and region. The manuscript should consistently use associational language (e.g., "associated with" rather than "plays a crucial role in determining") and explicitly acknowledge that unobserved confounders may explain some of the observed relationships.

Theoretical framework: The manuscript lacks a coherent theoretical framework explaining why region, religion, and residence would influence maternal age at first birth. The introduction provides general background on early childbearing but does not adequately theorize the specific mechanisms through which the selected variables might operate. For example, when discussing regional differences on page 33-34, the authors note differences in expected maternal age at first birth across regions but do not discuss why these differences exist (e.g., cultural norms, economic development, educational opportunities). A stronger theoretical framework would:

Propose specific mechanisms linking each predictor to the outcome

Discuss potential interactions between predictors (e.g., how religion might influence age at first birth differently in urban versus rural settings)

Identify potential mediating variables (which might include some of the excluded variables like education and marital status)

Minor Issues

Presentation of results:

Table 3 and Table 4 contain excessive numerical information that impedes readability. For example, Table 4 presents all parameter estimates with their standard errors, R-hat statistics, and effective sample sizes, resulting in an overwhelming amount of information.

Current format: Full tables with all diagnostic statistics for every parameter.

Recommendation: Split these tables into summary tables of key findings and move detailed diagnostics to supplementary materials. Additionally, consider using visual representations (e.g., coefficient plots) to display regression results instead of dense numerical tables.

Literature review organization:

The literature review (pages 7-10) currently presents studies chronologically rather than thematically, making it difficult to identify research gaps.

Current organization: "In East Asia and the Pacific, Bangladesh, Nigeria, Ghana, Sub-Saharan Africa, and Uganda, the median ages at first birth are 20.2, 16.34, 20, 19, and 19.2 years, respectively (Kennedy E, 2011; Karim MA, 2021; Fagbamigbe AF, 2016; Ida LA, Albert L, 2015; Negash WD, Asmamaw DB, 2022; Demographic Health survey, 2016)."

Recommendation: Reorganize by themes such as "Regional Variations in Age at First Birth," "Methodological Approaches to Studying First Birth Timing," and "Socioeconomic Determinants of Early Childbearing." This would help highlight specific gaps this study addresses.

Statistical terminology inconsistencies:

The manuscript uses "gamma regression model," "gamma family GLM," and "gamma family with log link" interchangeably throughout sections 3.2-3.4.

On page 17, the authors refer to both "shape parameter" and "dispersion parameter" for the same concept.

The term "mixed-effects" is used on page 15, while "random effects" is used on page 16 to describe the same model component.

Recommendation: Standardize terminology throughout the manuscript. For example, consistently use "gamma regression with log link" and select either "shape parameter" or "dispersion parameter" and maintain consistency.

Grammatical errors and typographical issues:

Page 8, line 2: "There is a clear associate between the age..." should be "There is a clear association between the age..."

Page 9, line 3: "...in a women's further education..." should be "...in a woman's further education..."

Page 15, line 12: "It is important to explore these alternative methods to ensure that the analysis of the age at first birth data is both accurate and meaningful. In this study, it was observed that the dataset on the maternal age at first birth obtained from the Mini EDHS 2019 dataset (excluding mothers without birth during survey period) follows a gamma distribution and consists of only positive values." - This paragraph contains redundancy and run-on sentences.

Throughout the manuscript: Inconsistent capitalization of "Mini EDHS" versus "mini EDHS"

Recommendation: Conduct a thorough grammar and spelling check. Consider simplifying complex sentences, particularly in the methodology section.

Figures quality and presentation:

Figure 1a: The histogram and density plot have poor resolution, with pixelated axis labels.

Figure 2: The legend overlaps with the plot, obscuring some data points.

Figure 4: The side-by-side comparison of GLM and GLMM posterior predictive checks lacks clear axis labels and sufficient resolution for publication.

Appendix 4: The trace plots are compressed and difficult to interpret with overlapping chain colors.

Recommendation: Recreate all figures using higher resolution settings (minimum 300 dpi). Ensure consistent font sizes across all figures (recommend 9-12pt). Position legends outside the plot area to avoid overlap with data. Add more descriptive axis labels that include units of measurement.

Specific Recommendations

Introduction:

Current: "The study focused on determining maternal age at first birth and investigating the factors influencing the age at first birth among women in Ethiopia with adequate model performance."

Improved: "This study aims to identify and quantify the influence of geographical, religious, and residential factors on maternal age at first birth in Ethiopia, while evaluating the adequacy of gamma regression models for analyzing this outcome."

Methods:

Current: "In this study, it was observed that the dataset on the maternal age at first birth obtained from the Mini EDHS 2019 dataset follows a gamma distribution and consists of only positive values."

Improved: "We evaluated the distributional properties of maternal age at first birth in the 2019 Mini EDHS dataset using goodness-of-fit tests and Q-Q plots. Results indicated that the gamma distribution provides an appropriate fit for this strictly positive, right-skewed outcome variable compared to alternative distributions (e.g., log-normal, Weibull)."

Results:

Current: "The result indicates that women in rural areas had a 5% lower expected age at first birth compared to women living in urban area."

Improved: "After adjusting for region and religion, women residing in rural areas had a 4.6% lower expected age at first birth compared to urban women (β = -0.047, 95% CI: [-0.061, -0.031])."

Discussion:

Current: "With the average age for women's first childbirth at 18 years, which is lower than sub-Saharan Africa age at first birth of 19 years this study recommends collaboration between federal and regional stakeholders including religious institutions to enforce legal age regulations for first births."

Improved: "Our findings indicate that the average age at first childbirth in Ethiopia (18 years) remains below both the WHO recommended minimum age of 18 years and the sub-Saharan African average of 19 years. These results suggest a need for targeted interventions that address regional and religious disparities in early childbearing, potentially through coordinated efforts between federal agencies, regional authorities, and religious institutions to promote awareness and enforcement of existing legal age regulations."

Conclusion

This manuscript addresses an important topic but requires substantial revisions before publication in PLOS ONE. The methodological approach needs stronger justification, the presentation of results should be improved for clarity, and various inconsistencies in terminology and writing need to be addressed.

Reviewer #2: Review Comments to the Author

Thank you for the opportunity to review this manuscript titled “Modelling Determinants of Maternal Age at First Birth in Ethiopia: Gamma Regression Approach.” The study investigates an important demographic and public health issue by analyzing determinants of maternal age at first birth using data from the 2019 Ethiopia Mini DHS. While the topic is highly relevant, and the modeling approach is technically appropriate, the manuscript requires significant improvements in clarity, interpretation, and methodological justification before it is ready for publication.

1. Scientific and Technical Soundness

The analysis is largely methodologically sound. The Gamma regression model is an appropriate choice for modeling positively skewed continuous data such as maternal age at first birth. The use of both Generalized Linear Models (GLM) and Generalized Linear Mixed Models (GLMM) is commendable, as is the inclusion of both maximum likelihood and Bayesian estimation techniques with appropriate diagnostic checks (R < 1.1, high ESS, low MCSE).

However, the justification for excluding key socio-demographic variables (e.g., education, marital status, occupation) due to timing of measurement is a concern. These are important predictors of maternal age at first birth and their exclusion may introduce bias. At the very least, a deeper discussion of potential omitted variable bias and its implications on results should be provided.

Additionally, a brief rationale for choosing Gamma regression over time-to-event models (e.g., Cox proportional hazards, AFT models) would strengthen the manuscript, particularly given the nature of the outcome (age at first birth).

2. Data and Availability

The manuscript complies with PLOS ONE’s data availability policy. The data are drawn from a publicly accessible, de-identified source—the 2019 Ethiopia Mini DHS via the Measure DHS program. The authors clearly state that all datasets are fully available without restriction, which meets the journal’s requirements. However, they should provide the exact link to the data access page and any relevant dataset identifiers to facilitate reproducibility.

3. Statistical Rigor and Interpretation

The statistical analysis is rigorous, with well-justified use of a Gamma distribution and appropriate model diagnostics. The dual estimation approach (MLE and Bayesian) is a notable strength and enhances confidence in the robustness of the findings. Diagnostic techniques such as posterior predictive checks, convergence monitoring, and model fit metrics are all appropriately used.

That said, the interpretation of model results is relatively limited. The discussion would benefit from more explanation of why particular regional or religious groups are associated with earlier or later age at first birth. Linking statistical results to sociocultural or health system factors in Ethiopia would give the analysis greater depth and relevance.

Also, while model fit statistics like AIC and BIC are mentioned, it would be helpful to clearly present them in a summary table and offer a more direct comparison of the two models (GLM vs. GLMM) to guide reader understanding.

4. Writing Quality and Clarity

This is a major weakness of the manuscript. The English language quality is not sufficient for publication in its current form. The paper contains frequent grammatical errors, awkward phrasing, incorrect syntax, and unclear sentence structures that significantly hinder readability. For example:

• “The teenage a woman is…” → should be “The younger a woman is…”

• “mor far” → should be “more distant”

• Phrases such as “This issue is linked to various negative impacts…” are vague/unclear and redundant.

The Introduction and Background sections are too long and repetitive. Several paragraphs reiterate the same facts about early childbirth and maternal health risks, often using slightly different wording. This redundancy should be minimized to improve focus and clarity.

The Bayesian estimation section is also overly technical for a general readership. Simplifying the language and breaking up long blocks of equations and dense paragraphs will improve accessibility.

A professional English-language editor or native speaker with expertise in scientific writing should be engaged to revise the manuscript thoroughly.

5. Figures, Tables, and Visual Aids

Some of the figures (e.g., Figure 1a–1c, Figure 2) are not clearly labeled or well integrated into the discussion. Each figure should be accompanied by a complete, self-contained caption and referenced explicitly in the text with interpretation. The density plots and boxplots are helpful, but their readability can be improved.

Additionally, a summary table of model fit statistics (AIC, BIC, LOO, etc.) and key coefficient estimates with credible/confidence intervals would strengthen the presentation.

6. Ethics and Compliance

The authors state that the study received appropriate ethical approval from the IRB of ICF, with data anonymized to prevent identification. This aligns with ethical standards for secondary data analysis.

No concerns were identified regarding dual publication, misconduct, or research ethics.

Conclusion and Recommendation

This study has the potential to make a valuable contribution to the literature on early childbirth and maternal health in Ethiopia. The modeling approach is sound, and the use of both classical and Bayesian estimation adds methodological richness.

However, major revisions are required before the manuscript is suitable for publication. These include:

• Substantial improvement of English language and clarity

• More comprehensive discussion of excluded variables and potential biases

• Stronger interpretation of statistical results in their social and policy context

• Clearer presentation of model comparison and fit statistics

I encourage the authors to revise and resubmit the manuscript with these improvements. Once addressed, this work could make a meaningful contribution to public health research and policy in Ethiopia and similar settings.

Reviewer #3: 1. General Comments

This manuscript investigates the relationship between longitudinal CD4 count trajectories and survival in HIV patients on ART in Ethiopia using joint modeling techniques. The topic is clinically important and relevant, especially in low-resource settings where treatment monitoring is critical.

The authors have applied appropriate statistical techniques and reported their findings clearly. However, certain elements—especially the necessity and justification for joint modeling—require clarification. Furthermore, improvements in model reporting, comparison with separate models, and explanation of assumptions and missing data handling would enhance the manuscript’s methodological strength and transparency.

2. Specific Comments

2.1. Justification for Joint Modeling

Joint models are most appropriate when there is informative dropout or when the longitudinal process (CD4 count) influences the risk of the event (death). While the manuscript reports a significant association parameter (α), it does not provide sufficient justification that joint modeling was required over separate models.

Please clarify:

Whether missing data in CD4 counts were non-random or informative.

How separate models (LMM and Cox) performed in comparison to the joint model. Including such results would demonstrate the added value of joint modeling.

2.2. Association Structure

The model appears to use a shared random intercept to link the longitudinal and survival processes.

Please explain:

Why the random intercept alone was chosen—was random slope variability tested and ruled out?

Whether other association structures (e.g., current value or slope) were considered.

2.3. Handling of Missing Data

The manuscript does not describe the extent or mechanism of missingness in CD4 measurements.

Please include:

A description of the pattern and proportion of missing CD4 data.

Whether missingness was tested (e.g., MNAR vs MAR) and whether any sensitivity analyses were done.

2.4. Time-Varying Covariates

It is not clear how time-varying factors (e.g., adherence, regimen change) were modeled—if at all.

Please clarify whether such variables were:

Modeled dynamically, updated at each measurement point.

Or treated as baseline values.

2.5. Model Fit and Assumption Checks

Please expand on:

How model assumptions (e.g., proportional hazards in the Cox model) were tested.

Model diagnostics for residuals and random effects.

Any convergence issues or estimation warnings, particularly for the joint model.

3. Minor Comments

-Specify which R packages were used (e.g., JM, JMbayes, joineR).

-Clarify the clinical interpretation of the α parameter—what does the magnitude imply for patient risk?

-Tables and figures are generally informative. However, adding:

-Individual CD4 trajectories or spaghetti plots.

-A side-by-side comparison of separate model vs. joint model results (even as supplementary material), would be useful.

Reviewer #4: The manuscript on modeling maternal age at first birth using Gamma regression (GLM and GLMM) presents an interesting and relevant study; however, it is currently prepared in a style that resembles a thesis rather than a journal article and does not fully align with the PLOS One formatting requirements. The methodology section on Gamma regression and estimation should be presented more concisely, focusing on the study-specific application, as it is an established method. Figures require higher resolution, improved labeling, and more informative captions. The authors should clearly demonstrate that model assumptions have been checked and met. Since the study uses Demographic and Health Survey (DHS) data, the authors are encouraged to consider resampling techniques to address potential sampling bias or imbalance in the dataset. Additionally, grammar and sentence structure need improvement to enhance clarity and readability.

**********

what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy..-->

Reviewer #1: No

Reviewer #2: Yes: Adekunle Fatai AdeoyeAdekunle Fatai AdeoyeAdekunle Fatai AdeoyeAdekunle Fatai Adeoye

Reviewer #3: Yes: Dr. Muhammad AasimDr. Muhammad AasimDr. Muhammad AasimDr. Muhammad Aasim

Reviewer #4: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.. Please note that Supporting Information files do not need this step.. Please note that Supporting Information files do not need this step.. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PLOS One- Maternal Age Review.pdf

pone.0346504.s002.pdf (102.6KB, pdf)
PLoS One. 2026 Apr 13;21(4):e0346504. doi: 10.1371/journal.pone.0346504.r002

Author response to Decision Letter 1


11 Sep 2025

Reviewer 1: All comments were revised accordingly

We thank the reviewer for highlighting this point. Our study focused on maternal characteristics directly influencing first birth timing. To address this concern, we conducted a sensitivity analysis including education, marital status, and occupation (results section, Table 5). Results show that their inclusion small changes in magnitude and does not change direction in the associations between region, religion, and residence with the response variable maternal age at first birth. We also clarified the limitations of variable exclusion in the discussion (page 46 under strengths and limitations sub title).

We now provide formal goodness-of-fit assessments comparing Gamma, Normal, log-normal, and Weibull distributions, including AIC values and Q-Q plots (Table 2, Figure 5). Results indicate that the Gamma distribution with log link best fits the strictly positive, right-skewed outcome. The log link ensures positive predicted values and allows multiplicative interpretation of covariate effects, consistent with standard practice.

We acknowledge this limitation and the study objective is to see only mothers who had already give birth. While survival analysis could accommodate censoring, the objective of this study is to see only women who had given birth and associations with regional religious and residential factors; thus, our approach remains appropriate for cross-sectional analysis. Results are interpreted as associations among women who have already had first birth.

All statements throughout the manuscript have been revised to use associational language (e.g., associated with instead of “determines”) and to acknowledge potential unobserved confounders.

We strengthened the theoretical framework in the introduction (pages 4–5), discussing potential mechanisms.

Tables have been split: key coefficients and 95% CIs appear in main manuscript (Table 7, 8), detailed diagnostics (R, ESS, MCSE) also included avoiding more tables. Coefficient plots especially trace plot for Afar region and for intercept were added for improved readability (Figure 6).

The literature review is reorganized thematically into: (i) regional variations, (ii) methodological approaches, and (iii) determinants, highlighting research gaps (pages 3-4).

Terminology standardized: “Gamma regression with log link”, “dispersion parameter”, and “random effects” are used consistently throughout the manuscript.

Manuscript has been carefully proofread. Redundant sentences removed; capitalization of “Mini EDHS 2019” standardized; complex methodology sentences simplified.

Figures have been recreated at high resolution, with clear axis labels, consistent font size (10–12 pt), legends placed outside plot areas, and enhanced readability (Figures 1-7).

All recommended phrasing improvements have been incorporated. Results now report adjusted percent differences with 95% CIs. Discussion contextualizes findings in social and policy terms.

Reviewer 2 Response

See response to Reviewer #1, Major Concern 1. Sensitivity analysis confirms main associations are robust; limitations are explicitly discussed.

Outcome is observed age at first birth among women who already had a birth. Gamma regression appropriately models a strictly positive, right-skewed variable. Survival models require longitudinal follow-up and censored data. So the objective of the study is to determine magnitude and regional religious and residential associations with women’s first age at birth who had already had birth.

Revised Data Availability Statement now includes direct DHS link and dataset identifiers (page 8) for reproducibility.

Discussion now elaborates on why specific regions and religious groups are associated with earlier or later first birth, drawing on cultural, educational, and health system factors (pages 32-40).

Added a summary table of AIC, BIC, LOO, WAIC comparing GLM, GLMM and distributional gamma regression (Table 3), with clear interpretation.

Manuscript thoroughly revised for clarity and readability; Bayesian estimation section simplified with clear explanations; professional scientific editor engaged.

Figures updated for resolution and clarity; table summarizing key coefficients and CIs added; supplementary tables include detailed diagnostics.

Ethical statement now justified.

Reviewer 3

We thank the reviewer. (i) Justification for Gamma regression strengthened with formal goodness-of-fit comparisons and Q-Q plots (Figure 5, Table 3); (ii) clear GLM vs. GLMM comparison added (Table 3), highlighting fit and interpretation of random effects; (iii) expanded diagnostics provided, including posterior predictive checks, trace plots, and convergence statistics (Table 7-8); (iv) discussion now emphasizes policy-relevant implications, offering actionable recommendations for regional, religious, and rural urban disparities in maternal age at first birth.

Reviewer 4

We thank the reviewer. (i) Methodology section streamlined to focus on study-specific application of Gamma regression, with references to standard texts; (ii) all figures recreated at high resolution with clear axis labels, legends outside plots, and informative captions (Figures 1-7); (iii) assumption checks added, including Q-Q plots, residual analyses, and discussion confirming Gamma regression assumptions are met; (iv) resampling via bootstrap conducted to assess potential DHS sampling imbalance, confirming robustness; (v) manuscript underwent thorough language editing to improve grammar, clarity, and readability.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0346504.s003.docx (17.2KB, docx)

Decision Letter 1

Denekew Bitew Belay

5 Nov 2025

Dear Dr. Agegnehu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 20 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols....

We look forward to receiving your revised manuscript.

Kind regards,

Denekew Bitew Belay, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

Additional Editor Comments:

Please remove the headings and subheadings from the objectives section and integrate those listed items into the introduction as a coherent sentence.

There are several irrelevant subheadings in your methods section that need to be formatted properly.

You need to combine Sections 2 and 3 into a single section titled 'Data and Methods'. Then, Section 3 should become Results, and Section 4 should be Discussion to ensure a logical flow of ideas.

Substantive language editing is needed. All these issues to be addressed clearly.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

Reviewer #1: Thank you for giving detailed answers to the comments. In the present form, the manuscript adequately presents your results. I found your corrections appropriate.

Reviewer #3: The revised version demonstrates excellent improvement in statistical justification, transparency, and scientific communication, fully addressing previous methodological critiques. The study is now suitable for publication following minor polishing of numeric interpretation and reference formatting.

Remaining Minor Recommendations

Add quantitative interpretation of key effects in natural units (e.g., “≈ 0.9 years earlier” rather than “5 % lower”).

Report random-effect variance (σ² or ICC) explicitly to reinforce GLMM justification.

Check reference formatting—some citations still lack year-parenthesis uniformity

Figures 2 & 3:

The labeling and captioning are inconsistent with the figure content. Figure 2 needs category labels, axis units, and a legend; Figure 3’s caption should be corrected to reflect that only regional frequencies are shown, not residence or religion. The color scheme should be simplified and consistent across figures. ALso check other figures too for inconsistencies.

Reviewer #4: The manuscript has improved considerably; however, please review the punctuation and formatting of the equations for consistency. Some equations need proper alignment and spacing, and equations that conclude a sentence should end with a full stop, while those followed by a continuation should use a comma. Ensuring this consistency will enhance the overall readability and professionalism of the manuscript.

**********

what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy..-->

Reviewer #1: Yes: Ali CetinAli CetinAli CetinAli Cetin

Reviewer #3: Yes: Dr. Muhammad AasimDr. Muhammad AasimDr. Muhammad AasimDr. Muhammad Aasim

Reviewer #4: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures

You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation.

NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications.

Decision Letter 2

Yimam Getaneh Misganie

4 Mar 2026

The topic is important and the statistical modeling framework is technically ambitious. However, several substantial methodological and interpretational issues must be addressed before the manuscript can be considered further. Please address each point explicitly in your revision.

1. Definition of the Target Population and Exclusion of Censored Observations

The study restricts analysis to women who had already experienced first birth at the time of the survey. However, the manuscript presents findings as national population-level estimates (e.g., “The average maternal age at first birth in Ethiopia was approximately 18.7 years…”). Age at first birth is inherently a time-to-event outcome. Women who have not yet had a first birth are right-censored. Excluding them changes the research question and affects generalizability.

Required revisions: Reframe all national-level statements to clarify that estimates apply only to women who had experienced first birth by survey time, Revise the Abstract, Results, and Conclusion to remove unconditional population-level language, Add a clearly labeled limitation subsection discussing: Implications of excluding censored observations, Potential bias in mean estimates and Comparability with DHS-reported median age at first birth (which uses survival methods) and Provide a clearer justification for not using survival analysis and discuss how conclusions may differ under that framework.

2. Survey Design and Use of Sampling Weights

The 2019 Mini EDHS employs a complex two-stage cluster sampling design with stratification and sampling weights. The manuscript does not clearly describe whether: Sampling weights were incorporated, Clustering was accounted for in variance estimation, and Stratification was considered. Failure to account for survey design may bias point estimates and standard errors.

Required revisions: Explicitly state whether sampling weights were applied, Describe how clustering and stratification were handled, If survey design was not incorporated, revise the analysis accordingly, If incorporated, provide sufficient methodological detail for reproducibility.

3. Interpretation and Scope of Inference

Several statements use language suggesting causal inference or national-level policy implications (e.g., “determinants,” strong normative conclusions regarding religion or region). Given: Cross-sectional design, Conditional sample (only women who had first birth), Limited covariate adjustment, and Causal interpretation is not supported.

Required revisions: Replace causal terminology (e.g., “determinants”) with associative language, Moderate policy recommendations to reflect methodological constraints, Avoid overgeneralized interpretations regarding religious or regional groups.

4. Omitted Variable Bias and Model Specification

Education, marital status, and occupation were excluded due to temporal concerns. While the reasoning is noted, these variables are well-established correlates of age at first birth and may confound region and religion effects.

Required revisions: Expand the discussion of potential omitted variable bias, Clarify that associations may reflect unmeasured confounding, Consider sensitivity analysis including education (if feasible), or clearly justify why this is not possible.

5. Model Comparison and Diagnostics

Multiple modeling frameworks are described (GLM, GLMM, distributional Gamma regression; MLE and Bayesian estimation). However, the comparative performance of these models is not presented with sufficient clarity.

Required revisions: Provide a clear model comparison table summarizing (AIC/BIC/log-likelihood (MLE models), and LOO/ELPD (Bayesian models)), Clearly state which model is preferred and why, Report convergence diagnostics (R-hat, effective sample size), Provide posterior predictive checks where applicable.

6. Balance Between Mathematical Derivation and Substantive Interpretation

The manuscript contains extensive derivations of likelihood functions and Hessians. While mathematically rigorous, the presentation may exceed what is necessary for an applied demographic analysis.

Suggested revision: Consider moving detailed derivations to supplementary material and Strengthen substantive interpretation of findings.

7. Data Quality Clarification

The reported minimum age at first birth (e.g., 10 years) should be verified and contextualized. Please confirm whether this reflects valid observations or data entry anomalies.

8. Data Availability Statement

The manuscript states that data are available upon request from the corresponding author. DHS data are publicly available through the DHS Program website upon application. Please ensure consistency with journal data availability requirements.

Please submit your revised manuscript by Apr 18 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols....

We look forward to receiving your revised manuscript.

Kind regards,

Yimam Getaneh, PhD, PhD

Academic Editor

PLOS One

Journal Requirements:

If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #3: (No Response)

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #3: Yes

Reviewer #4: No

**********

Reviewer #3: To me, Figure 3 appears to present only the regional distribution of study participants (frequency by region). However, this information is already reported in Table 1 as part of the baseline characteristics. As such, the figure does not add new analytical value and represents a repetition of previously presented results.

Furthermore, despite being described as illustrating “maternal age at first birth patterns,” the figure contains no age-related information and therefore does not support the stated interpretation.

Recommendation:

The authors should either

(i) remove Figure 3 to avoid redundancy, or

(ii) replace it with a figure that genuinely presents maternal age at first birth by region (e.g., age-group distributions, mean/median age with variability, or distributional plots) (already displayed by boxplot in figure 2 ).

Reviewer #4: I commend the authors for carefully addressing the reviewers’ comments, which have led to a clear improvement in the quality and presentation of the manuscript. The study is well motivated and the applied analysis is sound.

However, since the manuscript focuses on an applied statistical investigation rather than methodological advancement, and the techniques employed (Gamma regression, maximum likelihood estimation, and Bayesian estimation) are well established in the literature, I recommend that the detailed mathematical derivations presented in the sections on the modelling framework and estimation approaches be substantially reduced. Instead, these sections could be streamlined by briefly introducing the methods, clearly specifying the statistical models used, and explaining the variables and parameters involved, without reproducing standard derivations.

This revision would improve readability and align the manuscript more closely with the conventions of applied statistical articles.

**********

what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy..-->

Reviewer #3: Yes: Dr. Muhammad AasimDr. Muhammad AasimDr. Muhammad AasimDr. Muhammad Aasim

Reviewer #4: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures

You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation.

NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications.

PLoS One. 2026 Apr 13;21(4):e0346504. doi: 10.1371/journal.pone.0346504.r006

Author response to Decision Letter 3


17 Mar 2026

We sincerely thank the Academic Editor and the reviewers for their constructive comments. Their suggestions significantly improved the clarity, methodological transparency, and interpretation of the manuscript. All comments have been carefully considered and addressed in the revised manuscript. Detailed responses are provided below.

A. Responses to Academic Editor Comments

Editor Comment Author Response Location in Revised Manuscript

Definition of Target Population and Exclusion of Censored Observations We thank the editor for highlighting this important methodological issue. The analysis indeed focuses only on women who had experienced a first birth by the time of the survey. To avoid misinterpretation, we revised the manuscript to clarify that all estimates apply only to women who had already experienced first birth at the time of the survey rather than to all women in Ethiopia.

Statements implying unconditional population-level inference were removed or rephrased throughout the Abstract, Results, and Conclusion sections. Additionally, a new limitation subsection has been added explaining that women without a first birth represent right-censored observations in a time-to-event framework and that excluding them may bias mean estimates toward younger ages. We also added a discussion comparing our estimates with the DHS median age at first birth obtained using survival analysis and clarified why a regression approach was chosen for conditional modeling of observed ages. Abstract, Results, Conclusion revised; new subsection “Study Limitations” added in Discussion

Survey Design and Sampling Weights We appreciate the editor’s suggestion. The Methods section has been revised to clearly describe the two-stage stratified cluster sampling design used in the 2019 Ethiopia Mini Demographic and Health Survey. Sampling weights provided by the DHS were incorporated in the analysis to ensure representativeness. We also clarified how clustering and stratification were handled in variance estimation. Additional methodological details were added to allow reproducibility of the analysis. Methods section – “Data Source and Survey Design”

Interpretation and Scope of Inference We agree that causal interpretations should be avoided. The manuscript has been revised to replace causal terminology such as “determinants” with “factors associated with maternal age at first birth.” Policy interpretations have been moderated to reflect the cross-sectional nature of the data and the conditional sample of women who had experienced first birth. Statements regarding religion and regional differences were rewritten to emphasize associations rather than causal relationships. Title, Abstract, Discussion revised

Omitted Variable Bias and Model Specification We appreciate this important comment. The Discussion section has been expanded to explicitly acknowledge the potential for omitted variable bias, particularly due to the exclusion of variables such as education, marital status, and occupation. These variables may be endogenous or measured after the timing of first birth. The revised manuscript now clarifies that estimated associations for region and religion may partly reflect unmeasured confounding factors. This limitation is discussed in greater detail. Discussion – “Limitations and Potential Confounding”

Model Comparison and Diagnostics Following the editor’s recommendation, we added a comprehensive model comparison table summarizing model performance across estimation frameworks. The table reports AIC, BIC, and log-likelihood for classical models and LOO and expected log predictive density (ELPD) for Bayesian models. Convergence diagnostics including R-hat and effective sample size are also reported. We further included posterior predictive checks to evaluate model adequacy. The preferred model is now clearly identified and justified based on predictive performance and goodness-of-fit. Results section – new Model Comparison Table and Diagnostics subsection

Balance Between Mathematical Derivation and Substantive Interpretation We agree that the detailed derivations may be excessive for an applied demographic analysis. Accordingly, the extensive mathematical derivations of likelihood functions, score equations, and Hessian matrices have been moved to the Supplementary Material. The main text now focuses on a concise description of the models, estimation procedures, and interpretation of parameters. This change substantially improves readability and aligns the manuscript with conventions of applied statistical research. Methods section streamlined; derivations moved to Supplementary Appendix

Data Quality Clarification (minimum age at first birth) We verified the DHS dataset and confirmed that the minimum reported age at first birth is 10 years, which reflects rare but documented early childbirth cases rather than data entry errors. A clarification has been added to the Results section explaining that such values represent extremely early births and should be interpreted cautiously. Results section – Data description

Data Availability Statement The Data Availability Statement has been revised to comply with the journal’s policy. The dataset used in this study is publicly available through the Demographic and Health Surveys (DHS) Program upon registration and request. The revised manuscript now includes the appropriate statement and access link. Data Availability Statement revised

B. Responses to Reviewer Comments

Reviewer 3: Reviewer Comment Author response

Figure 3 appears to show only regional distribution of participants and duplicates information in Table 1. The figure does not illustrate maternal age at first birth patterns. We thank the reviewer for this observation. We agree that the original figure did not provide additional analytical insight. Therefore, Figure 3 has been removed to avoid redundancy, as the information is already presented in Table 1.

Reviewer 4: Reviewer Comment

The manuscript is applied rather than methodological. Mathematical derivations of estimation procedures should be reduced. We appreciate the reviewer’s suggestion. The manuscript has been revised to substantially reduce the mathematical derivations in the main text. The modeling framework is now presented in a concise form focusing on model specification and interpretation. Detailed derivations related to maximum likelihood estimation and Bayesian inference have been moved to Supplementary Material. This revision improves readability and better aligns the paper with applied statistical literature.

Attachment

Submitted filename: Response_to_Reviewers_auresp_3.docx

pone.0346504.s005.docx (15.4KB, docx)

Decision Letter 3

Yimam Getaneh Misganie

19 Mar 2026

Modelling Associated factors of Maternal Age at First Birth in Ethiopia: Gamma Regression Approach

PONE-D-25-07127R3

Dear Dr. Adimias,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support....

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Yimam Getaneh Misganie, MSc, PhD, PhD

Academic Editor

PLOS One

Acceptance letter

Yimam Getaneh Misganie

PONE-D-25-07127R3

PLOS One

Dear Dr. Agegnehu,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Yimam Getaneh Misganie

Academic Editor

PLOS One

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Data. AFB data.

    (CSV)

    pone.0346504.s001.csv (325.5KB, csv)
    Attachment

    Submitted filename: PLOS One- Maternal Age Review.pdf

    pone.0346504.s002.pdf (102.6KB, pdf)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0346504.s003.docx (17.2KB, docx)
    Attachment

    Submitted filename: Response_to_Reviewers_auresp_2.docx

    pone.0346504.s004.docx (12.8KB, docx)
    Attachment

    Submitted filename: Response_to_Reviewers_auresp_3.docx

    pone.0346504.s005.docx (15.4KB, docx)

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting information files.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES