Abstract
This study develops a nearly unbiased estimator of the ratio of the contemporary effective mother size to the census size in a population, as a proxy of the ratio of contemporary effective size (or effective breeding size) to census size (Ne/N or Nb/N). The proposed estimator is based on both known mother–offspring (MO) and maternal‐sibling (MS) relationships observed within the same cohort, in which sampled individuals in the cohort probably share MO relationships with sampled mothers. The rationale is that the frequency of MO and MS pairs contains information regarding the contemporary effective mother size and the (mature) census size, respectively. Therefore, the estimator can be obtained only from genetic data. We also evaluate the performance of the estimator by running an individual‐based model. The results of this study provide the following: (a) parameter range for satisfying the unbiasedness, and (b) guidance for sample sizes to ensure the required accuracy and precision, especially when the order of the ratio is available. Furthermore, the results demonstrate the usefulness of a sibship assignment method for genetic monitoring, providing insights for interpreting environmental and/or anthropological factors fluctuating Ne/N (or Nb/N), especially in the context of conservation biology and wildlife management.
Keywords: close‐kin mark‐recapture, effective population size, overdispersed reproduction, sibship assignment
We developed a nearly unbiased estimator of the ratio of the contemporary effective mother size to the census size in a population, as a proxy of Ne/N. The proposed estimator is based on a known mother–offspring (MO) relationship and on a known maternal‐sibling (MS) relationship found within the same cohort, in which sampled individuals in the cohort potentially share MO relationships with sampled mothers.
![]()
1. INTRODUCTION
The estimation of the ratio of the contemporary effective population size to the census size (Ne/N) has attracted much research attention for providing information about a current population, especially in the context of conservation biology and wildlife management (Frankham, Bradshaw, & Brook, 2014; Palstra & Fraser, 2012). Small Ne/N demonstrates large variance in reproductive success (Akita, 2020; Wang, Santiago, & Caballero, 2016; Waples, 2016), resulting from the variance of reproductive potential (e.g., the big old fat fecund female fish hypothesis; Hixon, Johnson, & Sogard, 2014) or from the situation in which only some families successfully reproduce (referred to as the “Sweepstakes reproductive success” hypothesis, Hedgecock & Pudovkin, 2011). Moreover, if Ne/N is invariant across years, then Ne may behave like an index of N, and vice versa (Luikart, Ryman, Tallmon, Schwartz, & Allendorf, 2010). However, if Ne/N fluctuates across years, the trends can clarify the interpretation of environmental and/or anthropological factors, causing the variance of reproductive potential, family‐correlated survivorship, or fluctuating population dynamics. Besides, low precision and/or large bias for estimating Ne/N may lead to a wrong interpretation of the population (Tallmon, Waples, Gregovich, & Schwartz, 2012).
The estimation of Ne/N has been performed by utilizing the estimated values of contemporary effective population size (Ne) and census size (N), unless complete pedigree data and/or full census data are available. Additionally, there are numerous methods for estimating Ne from genetic markers (Wang et al., 2016, and the references contained therein). There are also numerous methods for estimating N, such as a mark‐recapture method or population dynamics modeling with survey data (e.g., Kéry & Schaub 2011; Methot & Wetzel, 2013; Quinn & Deriso, 1999; Seber, 2002). It is known that there are large variations in both estimators; thus, their combination (i.e., the estimator of Ne/N) also shows large variation (Marandel et al., 2018; Palstra & Fraser, 2012). There is currently a little theoretical foundation for the estimator of Ne/N, indicating no guidance for a sample size to ensure the required accuracy and precision.
Close‐kin mark‐recapture (CKMR) is a recently developed method for estimating N that utilizes the information about kinship in a sample. This was possible owing to the recent advances in genetic methods for kinship determination (Bravington, Grewe, & Davies, 2016; Bravington, Skaug, & Anderson, 2016; Hillary et al., 2018; Skaug, 2017), although similar methods have been proposed in the beginning of the 21st century (Nielsen, Mattila, Clapham, & Palsbøll, 2001; Pearse, Eckerman, Janzen, & Avise, 2001; Skaug, 2001). Besides, the rationale is that the presence of a kinship pair in the sample is analogous to the recapture of a marked individual in mark‐recapture. Kinship pairs in the sample are less likely to be observed in larger populations; thus, the number of kinship pairs may reflect N. While the original CKMR is designed to estimate adult abundance (i.e., N), the monitoring data for CKMR also produce the estimator of Ne by detecting half‐sibling (HS) pairs within the same cohort (Akita, 2020). This kinship‐oriented estimation of Ne was presented in the context of the sibship assignment method (Wang, 2009) and is expected to provide a much more accurate estimator as kinship determination becomes more accurate.
In this study, we propose a new method for estimating the ratio of contemporary effective mother size to the census size ( ) in a population, as a proxy of . Assuming that kinships are genetically detected without any error, this method is based on the numbers of maternal‐sibling (MS) and mother–offspring (MO) pairs in a sample. Sampling is completed at a single breeding time; sampling offspring within the same cohort and mothers probably shares MO relationship with sampled offspring. Our model provides a nearly unbiased estimator of that explicitly incorporates two types of overdispersed reproduction (i.e., parental and nonparental variations; Akita, 2020), making it possible to target a species that shows iteroparity (i.e., multiple reproductive cycles during the lifetime) and/or sweepstakes reproductive success. This estimator applying an iteroparous species corresponds to the estimator of the ratio of contemporary effective breeding mother size to the census size, . First, we explain the modeling assumption and sampling scheme. Then, we analytically determine (nearly) the unbiased estimators of , , and , which are based on the numbers of MS and/or MO pairs. Finally, by running an individual‐based model, we investigate the performance of the estimator and provide a guide for a sample size. It is noteworthy that our modeling framework can be applied to diverse animal species. However, the description of the model focuses on fish species, which are presently the best candidate target of our proposed method.
2. THEORY
In this section, we present the theoretical foundation for estimating . The estimator is based on previous studies that provide the estimator of (Akita, 2020) and (Akita, 2018) The main contribution of this paper is formulation of the estimator of , presented in Equation 9. The main symbols used in the current paper are summarized in Table 1.
Table 1.
The list of mathematical symbols employed in the main text
| nM | Sample number of mother | |
| nO | Sample number of offspring | |
| N m | Number of mothers in the population when sampled offspring are born | |
| N e,m | Effective number of mothers in the population | |
| N b,m | Effective breeding number of mothers in the population | |
|
|
Overdispersion parameter under negative binomial reproduction | |
|
|
Expected number of surviving offspring of mother at sampling | |
|
|
Frequency of for all mothers. | |
|
|
Combined effect of deviation from the Poisson ( ) | |
|
|
Number of surviving offspring born to mother | |
|
|
Number of mother–offspring pairs observed in samples | |
|
|
Number of maternal‐sibling pairs observed in samples | |
|
|
Probability that a randomly selected pair (mother and offspring) shares a mother–offspring relationship | |
|
|
Probability that a randomly selected pair (two offspring) shares a maternal‐sibling relationship | |
|
|
Bias of |
2.1. Hypothetical population
We suppose that there is a hypothetical population comprising mothers and there is also no population subdivision or spatial structure. In this study, a mature female is called a mother even if she does not produce offspring. For mathematical tractability, we assume that only one spawning ground exists in which the mothers remain for the entire spawning season. Following (Akita, 2020), our modeling framework employs two types of overdispersed reproduction: parental and nonparental variations. Thus, the former indicates a variation caused by the mother's covariates, such as age, weight, and residence time on the spawning ground, while the latter indicates a variation caused by nonrandom stochastic events during a series of reproductive episodes, which are independent of the mother's covariates, such as family‐correlated survivorship or the mating behavior effects (e.g., competition for males/females and correlation between mating opportunities of the mother and the number of her offspring). Figure 1 illustrates a schematic diagram of the effects of parental and nonparental variations exemplified by age‐dependent reproduction and family‐correlated survival on kinship relationships in a population. Detailed definitions of parental and nonparental variations are stated in (Akita, 2020). Appendix 1 provides the theoretical foundation of both parental and nonparental variations in reproduction.
Figure 1.

Example of relationships between mothers and their offspring number. The open, gray, and filled circles represent mothers, their eggs, and their offspring, respectively. The area of an open circle indicates the degree of reproductive potential of each mother (i.e., ). The dotted and thin arrows denote mother–egg and egg–offspring relationships, respectively. The symbol × denotes a failure to survive at sampling. Sampled individuals are denoted with squares
2.2. Sampling
To obtain the estimator of , we utilize the number of MS and MO pairs observed in a sample. For the mathematical tractability, only one reproductive season is targeted for sampling. Thus, the sampling method does not affect our modeling framework whether it is invasive or noninvasive. In the population, mothers are randomly sampled immediately after the end of the reproductive season. Additionally, in the population, offspring are also randomly sampled. The numbers of MS and MO pairs are determined by pairwise comparison of all the sample individuals ( and comparisons, respectively). As depicted in Figure 1, six offspring and three mothers are sampled and two MS and three MO pairs are observed. In our modeling framework, if an MS pair also shares a paternal‐sibling (PS) relationship, we count it as an MS pair and ignore the existing full‐sibling (FS) relationship.
2.3. Linking Ne/N m to kinship relationships
In this subsection, we provide the theoretical foundation for understanding how is associated with kinship relationships in a population, based on work presented in previous studies. The rationale is that the observed number of MS and MO pairs has information about N e,m and N m, respectively, as noted below.
First, we consider the relationship between the number of MS pairs and . (Akita, 2020) defined the contemporary effective mother size as follows:
| (1) |
where denotes the probability that two offspring share an MS relationship with an arbitrary mother. This definition is related to the inbreeding effective population size (Nordborg & Krone, 2002; Wang, 2009). When sampling from a single cohort in a population with overlapping generations, the effective mother size in our definition corresponds to the effective breeding mother size per breeding‐time unit (e.g., year), which produces the single cohort and is denoted by (Waples, 1991). Hereafter, the description of the model focuses on species with discrete generations; thus, we use to denote the contemporary effective mother size, although is the appropriate notation in the left‐hand side of Equation 1 when the target species is iteroparous with overlapping generations, as exemplified in Appendix 2.
Given the total mother number and the degree of overdisepered reproduction in the population, (Akita, 2020) derived , which is approximately given by
| (2) |
where indicates combined effect of both parental and nonparental variations and satisfies . Without either parental or nonparental variation (i.e., , see details in Appendix 3), converges to , corresponding to the Poisson variance in reproduction for all mothers. The mathematical description of is briefly summarized in Appendix 3.
Next, we consider the relationship between the number of MO pairs and . It is natural to consider that the probability of a randomly sampled mother and her offspring sharing an MO relationship (denoted by ) can be associated with the total mother number, given by
| (3) |
It is noteworthy that is independent of the distribution of the offspring number (Akita, 2018).
Together with Equations 1 and 3, we finally obtain the ratio of the effective mother size to census size as follows:
| (4) |
indicating that is associated with kinship relationships (i.e., MS and MO) in a population. In other words, when and is estimated from observed MS and MO pairs, respectively, the ratio can also be estimated. Meanwhile, (Akita, 2020) obtained an alternative formulation of the ratio using Equations 1 and 2:
| (5) |
where is assumed for approximation. This theoretical connection indicates that estimating corresponds to estimating .
2.4. Estimator of N e,m/N
This subsection proposes the estimator of as follows:
| (6) |
A “hat” denotes the estimator of a variable in this study. The requisite condition that satisfies Equation 6 is independent of and . This property will be shown later in this subsection. Akita (2020) derived the nearly unbiased estimator of , which is given by
| (7) |
where denotes the observed number of MS pairs in a sample. This estimator works well unless is very small, which is based on the idea that the observation of approximately provides a linear estimator of .
Next, we consider to estimate by estimating . By definition of , we can set its estimator by , where denotes the observed number of MO pairs in a sample. Thus, using Equation 3, the estimator can be determined as follows:
| (8) |
Equation 8 provides a linear estimator of , thus this estimator also works well. Meanwhile, the inverse of the right‐hand side in Equation 8 is a standard (moment) estimator of in the context of CKMR (Bravington, Skaug, et al., 2016).
Finally, substituting (Equation 7) and (Equation 8) into Equation 6, we obtain the estimator of as follows:
| (9) |
Let both and be given. We numerically confirmed that there is no correlation between and (results are not shown). To intuitively explain this independency, we consider three mothers ( ) and their offspring, and assume that and . When the three offspring born to the first mother are sampled (i.e., ), the expected number of MO relationship is one ( ). Meanwhile, when an offspring is sampled from each mother's offspring (i.e., ), the expected number of MO relationship is also one ( ). Therefore, we conclude that both and are independent, and is expected to work well (see details in the Section 3).
The bias of is defined by , which is approximately given by (see Appendix 5 for the derivation).
| (10) |
It is noteworthy that is downwardly biased, especially when is very small. However, this bias may be ignored for a wide range of parameters (see details in the Section 3). Theoretically, asymptotically converges to zero as increases, making a nearly unbiased estimator. Moreover, as demonstrated in the Section 3, it is observed that an extremely skewed reproduction breaks down the unbiasedness (e.g., in the case that c = 20 and 100 in the results).
2.5. Individual‐based model
We developed an individual‐based model that tracks kinship relationships to evaluate the estimator's performance (e.g., ). The population structure was assumed to be identical to that in the development of the estimators. In addition, the population comprised mothers and their offspring, and it was assumed to follow a Poisson or negative binomial reproduction (the degree of skewness due to the nonparental variation is controlled by a parameter ; see Appendix 2 for details). The expected number of the surviving offspring of a mother (denoted by ) followed the density distribution (which is involved to the parental variation; see Appendix 2 for details). We calculated overdispersion parameter ( ) from and , as well as confirmed numerically that the value of gives the same statistics of the estimators even if the combination of and differs (results are not shown). Therefore, each offspring retained the mother's ID, making it possible to trace an MS and MO relationship.
Let a parameter set ( , , , , and parameters that determine ) be given. We simulated a population history in which mothers generated offspring; this process was repeated 100 times. The sampling process for each history was repeated 10,000 times, acquiring 1,000,000 data points that were utilized to construct the distribution of the estimators for each parameter set. Furthermore, true value of was calculated from and (Equations 1 and 2).
3. RESULTS
We numerically evaluated the performance of for the case in which the number of mothers, , and the combined effect of deviation from the Poisson, , were unknown. We changed the parameter values for ( and ) and (1, 10, 20, and 100). In addition, based on a given parameter set ( and ), we mainly addressed the number of samples ( and ) required to obtain adequate accuracy and precision. In this study, we evaluated the performance of for specific ranges of the sample sizes (50–200 when , and 200–1,000 when ). Meanwhile, other estimators (i.e., and ) are also evaluated and provided in Supporting Information.
First, we evaluated the accuracy of estimators based on their relative bias calculated by applying the individual‐based model, which is defined as follows: “averaged estimator ‐ true value/true value.” For a given combination of and , the value of the relative bias of is represented on a violin plot for limiting cases where the sample number of mothers and offspring is same (i.e., ), as depicted in Figure 2. Meanwhile, detailed results of the relative bias are represented on a heatmap as a function of and (see Figure S1 in Supporting Information). For most of the investigated parameter sets, we observed that their relative bias is less than 10%. As expected, the relative bias is not affected by since is exactly an unbiased estimator of (see Equation A11 in Appendix 4 and also Figure S3 in Supporting Information). Meanwhile, is downwardly biased when is relatively small to true (e.g., see Figure S2 for in Supporting Information), as presented in (Akita, 2020); thus, is downwardly biased. Contrary to the theoretical prediction for the direction of the bias (Equation 10), relatively strong overdispersion results in an upwardly bias for when is relatively small and is relatively large (e.g., and in Figure 2a). This inconsistency may be caused by the breakdown of the approximation for deriving (equation S14 in Akita, 2020). Thus, as described in Equation 2, extremely large results in a large variance of offspring number, generating a situation in which the behavior of random variable far deviates from the binomial distribution.
Figure 2.

Violin plots showing the distribution of relative bias in our estimator (Equation 9) for various values of and sample size. Filled circles represent the mean values. Sample sizes for mothers and offspring are identical (i.e., ) and indicated in the legend. For the demonstration purposes, the distribution is truncated, although the mean values are calculated including the truncated values. (a) , (b)
Next, we evaluated the precision of estimators based on their coefficient of variation. As demonstrated in Figure S4 in Supporting Information, the value of the coefficient of variation of is also represented on a heatmap as a function of and ; meanwhile, the violin plot (Figure 2) visually provides the degree of precision. For the investigated parameter sets, the degree of the coefficient of variation strongly depends on the sample sizes. As shown in Figures S5 and S6 in Supporting Information, the dependency results from the combined effects of variation of both and . As increases, it is noteworthy that the parameter space of sample sizes demonstrating large variation of (e.g., ) expands; however, when is small (e.g., ), relatively small results in large variation of because of a relatively large .
4. DISCUSSION
We theoretically developed a nearly unbiased estimator of the ratio of contemporary effective mother size to the census size ( ) in a population (Equation 9). The proposed estimator is based on known MO relationship and MS relationships observed within the same cohort, in which sampled individuals in the cohort probably share MO relationships with sampled mothers (Figure 1). The performance of the estimator (accuracy and precision) was quantitatively evaluated by running an individual‐based model (Figure 2; see also Figures S1–S6). Meanwhile, the bias is analytically provided (Equation 10). Our modeling framework utilizes two types of reproductive variations (Akita, 2020): variance of the average offspring number per mother (parental variation, denoted by ), and variance of the offspring number across mothers with the same reproductive potential (nonparental variation, denoted by ). Additionally, these two effects result in a skewed distribution of offspring number and are summarized into one parameter ( ) in the framework. Thus, our estimator can be calculated from sample sizes of mother and offspring ( and , respectively) and the observed numbers of MS and MO pairs ( and , respectively), and it does not require other parameters. The rationale for this is the following: (a) the frequency of MS and MO pairs contains information about and Nm, respectively; (b) the estimators of and are independently determined based on a pedigree structure in the population and sample sizes, generating the estimator of by multiplying both estimators ( and ). In this study, although is considered as a proxy of , our theoretical results can easily be extended to the estimator of the ratio of contemporary effective father size to the census size if fathers are also sampled. The comparison of both ratios could clarify the underlying processes that differentiate between the sexes in the context of reproductive ecology.
The novelty of this study is that can be obtained only from the genetic data, and there are numerous advantages in using the proposed estimator instead of separately estimating (via genetical method) and (via nongenetical method). First, sampling and analyzing designs have become substantially simplified. Requirements for the proposed estimator are sampling of mothers and (potentially) their offspring in an appropriate time, and the extraction of their DNA that satisfies an adequate number of markers for kinship detection. In addition, both MO and MS pairs can be detected by a applying unified framework of genetic analyzes (there are many algorithms to detect kinship pairs from single nucleotide polymorphisms (SNPs) or short tandem repeats (STRs)), although an MS pair involves many more DNA markers (e.g., several thousands of SNPs are required for detection) than an MO pair (e.g., several hundreds of SNPs are required for detection). Second, our theoretical results guide sample sizes (nM and nO) to ensure the required accuracy and precision, especially if the order of the number of effective mothers is approximately known. This is due to the simple formulation of the estimator determined only by the observed values (Equation 9). Third, the proposed estimator directly reflects the amounts of and at the same timing (i.e., immediately after the end of the reproductive season), leading to a clear interpretation of the results, especially for genetic monitoring. For example, when the strong cohort is added to the spawning population in the beginning of the year, the estimator of Ne without reflecting this addition may results in an inappropriate estimation of (details of the temporal scale relevant to estimated Ne for each method were discussed in Wang et al., 2016).
Our modeling framework is presented by combining the context of the sibship assignment method (for estimating ) and the CKMR method (for estimating ), which defines a kinship‐oriented estimation of effective/census population size. Improvements to these methods directly contribute to the estimation of . Furthermore, the original theory of the sibship assignment method requires HS and FS pairs but does not require a distinction between the MS and PS pairs (Wang, 2009). This is a significant advantage due to the difficulty of the distinction from genetic data. However, the limitation of using MS or PS pair enables us to employ a nearly unbiased estimator of for particular sex (Akita, 2020), which greatly improves the accuracy of the estimation of the in this study and thus that of (see figure E3 in Appendix 5). It is noteworthy that the estimator of is given by
| (11) |
where and denotes the sample size of the parent and the observed number of parent–offspring (PO) pairs in a sample, respectively (Bravington, Skaug, et al., 2016). The development of the unbiased estimator of without a distinction between MS and PS pairs that could provide an unbiased estimator of coupled with Equation 11, is a study for the future. Furthermore, using cross‐cohort HS pairs, the CKMR method also provides the estimator of (Bravington, Skaug, et al., 2016) that does not require the sampling of the parent, which probably provides the estimator of only from unmatured samples. This perspective of the study will also be conducted in the future.
Finally, we note the advantage of partitioning variance in reproductive success into two components. As denoted in Equation A8 in Appendix 3, the combined effect of parental and nonparental variations is given by
| (12) |
Meanwhile, as denoted in Equation 5, the estimator of N e,m/Nm provides the information for the left‐hand side of Equation 12. Thus, when we obtain parental variation information from the life‐history table or from other species, the estimator of N e,m/Nm can also estimate the degree of nonparental variation. This procedure provides an insight into sweepstake reproduction or family‐correlated sampling of offspring (i.e., nonrandom sampling), although the theoretical formulation and its evaluation remain a task for future research. Alternatively, Waples et al. (2018) developed a genetical method for estimating and via estimating the degree of nonparental variation from fecundity data in southern bluefin tuna.
CONFLICT OF INTEREST
The author declares no conflict of interest.
AUTHOR CONTRIBUTION
Tetsuya Akita: Conceptualization (lead); Formal analysis (lead); Funding acquisition (lead); Investigation (lead); Methodology (lead); Project administration (lead); Resources (lead); Software (lead); Supervision (lead); Validation (lead); Visualization (lead); Writing‐original draft (lead); Writing‐review & editing (lead).
Supporting information
Figures S1‐6
ACKNOWLEDGMENTS
The author thanks R. Nakamichi for fruitful discussions. The author also thanks Robin Waples and an anonymous reviewer for constructive feedback that substantially improved the quality of this manuscript. This work was supported by JSPS KAKENHI Grant Number 19K06862.
APPENDIX 1.
Distribution of the number of surviving offspring per mother and its variance
Let denote the total number of surviving offspring from a mother ( ) at time of sampling. Following (Akita, 2020) and giving the expected number of the surviving offspring per mother at time of sampling, (>0), is assumed to follow a negative binomial distribution through a conventional parametrization:
| (A1) |
where (>0) denotes the overdispersion parameter describing the degree of nonparental variation. In the current work, is assumed to be constant across mothers, whereas the expected number of surviving offspring ( ) varies across mothers. The mean and variance of this distribution are denoted by and , respectively. In the limit of infinite , this distribution becomes a Poisson distribution, which is given by . We assume to be independent and identically distributed with the density function , producing a parental variation. The shape of the density function is often complex, but may be described by information from the population, for example, the mother's weight composition in the population. The specific form of is given in Appendix 2 and is used for running an individual‐based model.
The variance of the offspring number can be given by
| (A2) |
APPENDIX 2.
Probability density function and its moment of offspring number per mother
As stated in the main text, our modeling framework does not require the specific form of the density function of offspring number per mother (denoted by ); it only requires the ratio of the second moment to the squared first moment ( ) instead. However, the specific form is required for evaluating the theoretical results (i.e., calculating the moment or running the individual‐based model). Herein, we model an age‐structured fish population that serves as a representative example, demonstrating both parental and nonparental variations. The following contents are almost the same as those of (Akita, 2020) except for the parameter values that produce the setting c = 20 and 100.
Suppose that the mean fecundity of a mother depends on her age. Let denote the mean fecundity, which is a function of age (denoted by ). The moment can be defined as , where is the frequency of mature mothers at a given age, and a max denotes the maximum age. Thus, we can numerically obtain the moment by applying and .
For marine species with a type‐III survivorship curve, it is generally assumed that individual fecundity is proportional to weight. By utilizing the von Bertalanffy growth equation for body weight, is explicitly defined as a function of age as follows:
| (A3) |
where , , and are conventionally used parameters in the von Bertalanffy equation, and they denote the growth rate, the adjuster of the equation for the initial size of the animal, and the allometric growth parameter, respectively. To obtain a specific value of , a coefficient value of 10 multiplied by the right‐hand side of Equation A3 was used when running the individual‐based model.
The frequency of mature mothers at a given age can be given as the following:
| (A4) |
satisfying , where and denote the frequency and maturity at a given age, respectively. Although is affected by historical population dynamics and age‐dependent survival, for simplicity, the mortality rate is assumed to be constant (i.e., age independent):
| (A5) |
where S denotes a survival probability. The maturity at age (Q(a)) is assumed to be a knife‐edge function, which is given by
| (A6) |
where a mat denotes the mature age.
To calculate , the required parameter set is . In this study, for the purpose of representation, we fixed the values of several parameters as follows: , , , and . In addition, we selected parameter value (defined in Equation A8 in the main text) to be 1, 10, 20, and 100 for comparison with the results in the main text that are derived from the parameter set , , , and , respectively.
Finally, we provide specific forms of ; thus, when and are obtained, is given by
| (A7) |
APPENDIX 3.
Combined effect of parental and nonparental variations
Following (Akita, 2020), combined effect of parental and nonparental variations can be partitioned into the two components, such as:
| (A8) |
The first and second parentheses give the effect of nonparental variation and parental variation, respectively. When is constant across mothers, equals and converges to ; meanwhile, , converges to . Without either parental or nonparental variation, converges to 1 and converges to . In the main text, we refer to “overdispersion” as the skewed distribution of the offspring number resulting from this combined effect.
Using Equations A2 and A8, the variance of the offspring number can be expressed as
| (A9) |
suggesting that the variance substantially increases with .
APPENDIX 4.
Derivation of the bias of
For calculation of the bias of , we require an expectation of the estimator given by
| (A10) |
As stated in the main text, both and are independent. Thus, the first term in the right‐hand side of Equation A10 can be ignored. The expectation of is given by
| (A11) |
From the first to the second line of Equation A11, we applied the relationship and Equation 3. Equation A11 indicates that is the unbiased estimator. The expectation of is given by
| (A12) |
which is illustrated in Appendix 4 of (Akita, 2020). Together with these relationships, we can obtain the bias of described in Equation 10.
APPENDIX 5.
Individual‐based model with application of Wang's estimator
Our modeling framework estimates the ratio of contemporary effective mother size to census size (denoted by ), as a proxy for the ratio of contemporary effective size to census size (denoted by ). To compare the performance of our method with that of other methods for directly estimating , we propose the estimator of , given by
| (A13) |
and indicate the sample numbers of parents and offspring, respectively, and , , and indicate the observed number of half‐sibling, full‐sibling, and parent–offspring pairs, respectively. is based on (Wang, 2009) assuming random mating and is defined in Equation 11 in the main text.
We evaluated the estimator's performance on data simulated by running an individual‐based model under the Wright–Fisher process for a diploid population. In the current comparison, we did not consider the case of overdispersion (i.e., ). Sex ratio was fixed to 0.5 in both whole and sampled populations. Each parent retained the ID of its offspring, making it possible to trace HS, FS, and PO relationships. Given a parameter set ( , , and ), we backwardly simulated a population history in which mother–offspring and father–offspring relationships were randomly specified from offspring; this process was repeated 10,000 times, acquiring 10,000 data points that were used to construct the distribution of for each parameter set. If neither HS nor FS pairs were found in a sample, we did not include that trial when constructing the distribution. For ease of comparison, we used the same sample size and population structure for both methods.
Figure A1 illustrates the distribution of the relative bias of for limiting cases where parent and offspring sample numbers are identical (i.e., ). Our results indicate that the estimator is upwardly biased, particularly when sample size is small. Alternatively, our estimator shows a broader region such that unbiasedness is approximately achieved. For example, when and in N = 2,000 (i.e., Nm = 1,000), the bias of is clearly observed (Figure E1a); meanwhile, does not produce a bias in these conditions (Figure 2a).
Figure A1.

Violin plots showing the distribution of relative bias of (denoted in Equation A13) for various sample sizes. Filled circles represent the mean values. The sample numbers of parents and offspring are identical (i.e., ) and indicated in the legend. For demonstration purposes, the distribution is truncated, although the mean values are calculated including the truncated values. (a) , (b)
Akita T. Nearly unbiased estimator of contemporary Ne/N based on kinship relationships. Ecol Evol. 2020;10:10343–10352. 10.1002/ece3.6421
DATA AVAILABILITY STATEMENT
No datasets were generated or analyzed in this study.
REFERENCES
- Akita, T. (2018). Statistical test for detecting overdispersion in offspring number based on kinship information. Population Ecology, 60(4), 297–308. 10.1007/s10144-018-0629-2 [DOI] [Google Scholar]
- Akita, T. (2020). Nearly unbiased estimator of contemporary effective mother size using within‐cohort maternal sibling pairs incorporating parental and nonparental reproductive variations. Heredity, 124, 299–312. 10.1038/s41437-019-0271-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bravington, M. V. , Grewe, P. M. , & Davies, C. R. (2016). Absolute abundance of southern bluefin tuna estimated by close‐kin mark‐recapture. Nature Communications, 7, 13162 10.1038/ncomms13162 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bravington, M. V. , Skaug, H. J. , & Anderson, E. C. (2016). Close‐Kin Mark‐Recapture Close‐kin mark‐recapture. Statistical Science, 312, 259–274. [Google Scholar]
- Frankham, R. , Bradshaw, C. J. , & Brook, B. W. (2014). Genetics in conservation management: Revised recommendations for the 50/500 rules, Red List criteria and population viability analyses. Biological Conservation, 170, 56–63. 10.1016/j.biocon.2013.12.036 [DOI] [Google Scholar]
- Hedgecock, D. , & Pudovkin, A. I. (2011). Sweepstakes reproductive success in highly fecund marine fish and shellfish: A review and commentary. Bulletin of Marine Science, 87(4), 971–1002. 10.5343/bms.2010.1051 [DOI] [Google Scholar]
- Hillary, R. M. , Bravington, M. V. , Patterson, T. A. , Grewe, P. , Bradford, R. , Feutry, P. , … Bruce, B. D. (2018). Genetic relatedness reveals total population size of white sharks in eastern Australia and New Zealand. Scientific Reports, 8(1), 2661 10.1038/s41598-018-20593-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hixon, M. A. , Johnson, D. W. , & Sogard, S. M. (2014). BOFFFFs: On the importance of conserving old‐growth age structure in fishery populations. ICES Journal of Marine Science, 718, 2171–2185. 10.1093/icesjms/fst200 [DOI] [Google Scholar]
- Kéry, M. , & Schaub, M. (2011). Bayesian population analysis using WinBUGS: A hierarchical perspective. Academic Press. [Google Scholar]
- Luikart, G. , Ryman, N. , Tallmon, D. A. , Schwartz, M. K. , & Allendorf, F. W. (2010). Estimation of census and effective population sizes: The increasing usefulness of DNA‐based approaches. Conservation Genetics, 112, 355–373. [Google Scholar]
- Marandel, F. , Lorance, P. , Berthelé, O. , Trenkel, V. M. , Waples, R. S. , & Lamy, J. B. (2018). Estimating effective population size of large marine populations, is it feasible? Fish and Fisheries, 201, 189–198. 10.1111/faf.12338 [DOI] [Google Scholar]
- Methot, R. D. , & Wetzel, C. R. (2013). Stock synthesis: A biological and statistical framework for fish stock assessment and fishery management. Fisheries Research, 142, 86–99. [Google Scholar]
- Nielsen, R. , Mattila, D. K. , Clapham, P. J. , & Palsbøll, P. J. (2001). Statistical approaches to paternity analysis in natural populations and applications to the North Atlantic humpback whale. Genetics, 157(4), 1673–1682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nordborg, M. , & Krone, S. M. 2002. Separation of time scales and convergence to the coalescent in structured populations Modern Developments in Theoretical Population Genetics: The Legacy of Gustave Malécot (pp. 194–232). Oxford University Press, Oxford. [Google Scholar]
- Palstra, F. P. , & Fraser, D. J. (2012). Effective/census population size ratio estimation: A compendium and appraisal. Ecology and Evolution, 29, 2357–2365. 10.1002/ece3.329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearse, D. , Eckerman, C. , Janzen, F. , & Avise, J. (2001). A genetic analogue of mark‐recapture'methods for estimating population size: An approach based on molecular parentage assessments. Molecular Ecology, 10(11), 2711–2718. [DOI] [PubMed] [Google Scholar]
- Quinn, T. J. , & Deriso, R. B. (1999). Quantitative fish dynamics Quantitative fish dynamics. Oxford University Press. [Google Scholar]
- Seber, G. A. F. (2002). The estimation of animal abundance and related parameters. London: Griffin. [Google Scholar]
- Skaug, H. J. (2001). Allele‐sharing methods for estimation of population size. Biometrics, 573, 750–756. [DOI] [PubMed] [Google Scholar]
- Skaug, H. J. (2017). The parent–offspring probability when sampling age‐structured populations. Theoretical Population Biology, 118, 20–26. [DOI] [PubMed] [Google Scholar]
- Tallmon, D. A. , Waples, R. S. , Gregovich, D. , & Schwartz, M. K. (2012). Detecting population recovery using gametic disequilibrium‐based effective population size estimates. Conservation Genetics Resources, 44, 987–989. 10.1007/s12686-012-9689-3 [DOI] [Google Scholar]
- Wang, J. (2009). A new method for estimating effective population sizes from a single sample of multilocus genotypes. Molecular Ecology, 18(10), 2148–2164. [DOI] [PubMed] [Google Scholar]
- Wang, J. , Santiago, E. , & Caballero, A. (2016). Prediction and estimation of effective population size. Heredity (Edinb), 117(4), 193–206. 10.1038/hdy.2016.43 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waples, R. S. (1991). Genetic methods for estimating the effective size of cetacean populations. Report of the International Whaling Commission (Special Issue 13) Report of the international whaling commission (special issue 13) (279–300).
- Waples, R. S. (2016). Tiny estimates of the N e/N ratio in marine fishes: Are they real? Tiny estimates of the N e/N ratio in marine fishes: Are they real? Journal of Fish Biology, 89(6), 2479–2504. 10.1111/jfb.13143 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figures S1‐6
Data Availability Statement
No datasets were generated or analyzed in this study.
