Skip to main content
Genes logoLink to Genes
. 2022 Dec 23;14(1):40. doi: 10.3390/genes14010040

Evaluating DNA Mixtures with Contributors from Different Populations Using Probabilistic Genotyping

Maarten Kruijver 1,*, Hannah Kelly 1, Jo-Anne Bright 1, John Buckleton 1,2
PMCID: PMC9858364  PMID: 36672780

Abstract

It is common practice to evaluate DNA profiling evidence with likelihood ratios using allele frequency estimates from a relevant population. When multiple populations may be relevant, a choice has to be made. For two-person mixtures without dropout, it has been reported that conservative estimates can be obtained by using the Person of Interest’s population with a θ value of 3%. More accurate estimates can be obtained by explicitly modelling different populations. One option is to present a minimum likelihood ratio across populations; another is to present a stratified likelihood ratio that incorporates a weighted average of likelihoods across multiple populations. For high template single source profiles, any difference between the methods is immaterial as far as conclusions are concerned. We revisit this issue in the context of potentially low-level and mixed samples where the contributors may originate from different populations and study likelihood ratio behaviour. We first present a method for evaluating DNA profiling evidence using probabilistic genotyping when the contributors may originate from different ethnic groups. In this method, likelihoods are weighted across a prior distribution that assigns sample donors to ethnic groups. The prior distribution can be constrained such that all sample donors are from the same ethnic group, or all permutations can be considered. A simulation study is used to determine the effect of either assumption on the likelihood ratio. The likelihood ratios are also compared to the minimum likelihood ratio across populations. We demonstrate that the common practise of taking a minimum likelihood ratio across populations is not always conservative when FST=0. Population stratification methods may also be non-conservative in some cases. When FST>0 is used in the likelihood ratio calculations, as is recommended, all compared approaches become conservative on average to varying degrees.

Keywords: probabilistic genotyping, DNA mixtures, population stratification, likelihood ratio

1. Introduction

DNA mixtures are routinely used to identify the donors of biological samples obtained from crime scenes [1,2,3]. If a Person of Interest (POI) is identified, a likelihood ratio (LR) statistic can be used to evaluate the evidence. Typically, when a POI is compared to a mixture, the standard hypothesis pair

H1:POIandN1unrelatedpersonscontributedtothemixture,H2:Nunrelatedpersonscontributedtothemixture, (1)

is used to evaluate support for the POI having contributed to the mixture. A population genetic model is needed to assign a likelihood ratio for a pair of hypotheses. It is common practice among users of probabilistic genotyping software to employ the Balding-Nichols model [4] with a parameter FST>0 (also called θ) chosen sufficiently large to obtain conservative estimates of the evidential weight. For FST=0 the Balding-Nichols model reduces to a population genetic model assuming Hardy Weinberg Equilibrium. If Linkage Equilibrium is also assumed, this model is also known as the Product Rule model [5]. Either way, allele frequencies obtained from a population study have to be used to assign allele probabilities in the likelihood ratio method. It is well known that a population study based on a representative sample from the relevant population needs to be used in the estimate of allele probabilities to avoid overstating the rarity of the observed variant alleles and consequently reporting an anti-conservative assessment of the weight of evidence [6]. Because match probabilities for unrelated persons not involved in a case for high quality profiles obtained using modern multiplexes are infinitesimally small regardless of the specific statistical evaluation method or population database [7], the problem of assigning the weight of evidence correctly may appear less important than before. On the other hand, the increased sensitivity of modern DNA profiling technology has dramatically increased the number of low-level and mixed samples that are routinely analysed which reinforces the need for accurate assessment of the weight of evidence because these samples are less informative [8]. Moreover, the use of expanded marker sets has increased the potential for population differentiation based on STRs [9] adding to the importance of using representative population databases and unbiased statistical methods.

In many jurisdictions, there are multiple populations that are potentially relevant when evaluating DNA evidence. For instance, the United States National Institute of Standards and Technology (NIST) laboratory has published allele frequency estimates for four population groups (African American, Caucasian, Hispanic and Asian) [10]. A more comprehensive set of estimates by the United States Federal Bureau of Investigation (FBI) [11] includes African Americans, Caucasians, Southwest Hispanics, Bahamians, Jamaicans, Trinidadians, Filipinos, and Chamorros. Depending on the geographical location and other case circumstances, more than one population may be relevant when evaluating the evidence in a particular case. For criminal casework, a pragmatic solution to the problem of choosing a relevant population could be to evaluate the likelihood ratio several times using different population parameters and reporting the minimum likelihood ratio statistic across the populations as a conservative estimate of the weight of evidence. This is established practice in some forensic laboratories. Further, this approach is one of the two options recommended by the SWGDAM Ad Hoc Working Group on Genotyping Results Reported as Likelihood Ratios [12] and has been studied in the context of relationship testing [13] but not in the context of DNA mixture interpretation. Note that this approach does not take into account the possibility that different donors originate from different populations and may not be conservative if this is the case. Alternatively, the working group states: “a likelihood ratio that combines information from the different population groups (e.g., weighted averages of the likelihoods) may be used in lieu of the single lowest value”. The working group agrees with the DNA Commission of the International Society for Forensic Genetics (ISFG), who write that “methods for stratifying the multiple evidential weights that have been obtained using the separate ethnic databases into a single value” are a “very elegant solution” [14]. In practice, such a method is likely to be a population stratified likelihood ratio that, in one way or another, models the different populations with some prior probabilities in the likelihood calculations. The prior probabilities can be based on census proportions or perhaps on case-specific background information. We are aware of this practice being adopted in some labs in Australasia and the United States of America.

Despite the apparent consensus that the use of population stratified likelihood ratios would be beneficial, little is published about how such likelihood ratios should be computed exactly and their behaviour with regards to different modelling choices appears to be poorly understood. For single source profiles, it has been shown that a population stratified likelihood ratio can be obtained from a weighted harmonic mean of the likelihood ratios for individual populations [15]. Specifically, if πe determines the probability that the donor of a sample originates from population e and LRe is calculated for population e, then a stratified likelihood ratio can be computed as:

1LR=e1LReπe.

When the DNA profile is mixed, a population stratified likelihood ratio can in general not be obtained from the separate likelihood ratios assuming different populations [15]. A further complication is that the different contributors to a mixture may originate from different populations. This can be taken into account in the likelihood ratio calculations. As far as we are aware, there are no published studies that evaluate the different modelling choices that can be made. For the evaluation of mixed profiles, the STRmixTM software implements a population stratified likelihood ratio that uses weighted averages of the likelihoods across population groups in the numerator and the denominator:

LRSTRmix=eπeP(O|H1,Gp,e)eπeP(O|H2,Gp,e)=eπesP(O|s)P(s|H1,Gp,e)eπesP(O|s)P(s|H2,Gp,e), (2)

where O is the observed mixture, e is a population, s denotes an ordered genotype set, P(O|s) is the weight of a genotype set and Gp is the genotype of a POI. Thus, the numerator and denominator likelihoods are computed in each population and weighted by their prior probabilities. When FST>0, a sub-population correction is applied taking into account the POI’s alleles for each population. The behaviour of this likelihood ratio has not been studied before. We make one initial observation. From (2), it can be seen that the numerator is fixed (across populations) as long as the sample is single source and there are no drop-in alleles so that the numerator equals the weight of the genotype in the deconvolution. In [15], it was shown the likelihood ratio simplifies to the harmonic mean in this case.

The DBLRTM software [16] version 1.2 implements a likelihood ratio framework for the evaluation of propositions involving one or more samples where contributors may be related according to a pedigree [17]. The comparison of a POI to a mixture can be formulated within this framework. Our intention is to develop an updated version of the software that is able to flexibly model population stratification in this general framework so that the population of each founder in the pedigree can be modelled separately. As a first step towards this development, we have implemented population stratification in a probabilistic way for pedigrees comprising founders only. The population from which each person originates is modelled using a prior distribution. The joint prior distribution of the populations for all persons can be constrained such that all sample donors are from the same ethnic group, or all permutations of populations can be considered.

The aim of the current work is to employ a simulation study to investigate the effect of different modelling choices when computing population stratified likelihood ratios. Specifically, we introduce and study two approaches within the proposed DBLRTM framework. In the first approach, which we call simple stratification, each person is assumed to originate from the same population. In the second approach, which we call full stratification, the population of all persons is considered to be independent a priori. Hence, all permutations of assignments of persons to populations are considered. These two approaches are compared to the method that is implemented in the STRmixTM software (2) and the pragmatic option of considering a minimum across population-specific likelihood ratios.

2. Methods

2.1. Likelihood Ratio Framework

The likelihood ratio framework presented in [17] is briefly summarised here. We have observed M1 evidentiary samples which we denote O1,,OM. Sample Oi is assumed to have been interpreted using probabilistic genotyping software assuming Ni1 contributors so that the weights of genotype sets are determined at every locus. We seek to evaluate a likelihood ratio for two hypotheses H1 and H2 about the contributions of persons to samples. Formally, a hypothesis Hj comprises a pedigree Pj and a set of links Lj from persons to samples. A link from a person to a sample may be specific to a contributor position or may span multiple contributor positions if the sample is mixed. In the current work, we will restrict the pedigrees to contain unrelated persons only.

We extend the framework by introducing a random variable E=(E1,,Em) that models the populations from which the m (unrelated) persons in a pedigree originate. When modelling r1 populations, the random variables Ei take values in e1,,er. In general, we need to specify a prior probability distribution for E. We consider two options. The first option is to restrict all persons to originate from the same population and to set P(E1=e,E2=e,,Em=e)=πe. We will refer to this first option as simple stratified. A second option, which we full refer to as fully stratified, is to consider all permutations of populations and assume independence of population between persons. That is, P(E1=e1,E2=e2,,Em=em)=πe1πe2πem. These two parametrisation of the prior distribution of E allow for a parsimonious model that can be informed by the prior population proportions. Although a more general specification is possible, it is unlikely that the prior probabilities can be meaningfully assigned in practice.

A hypothesis Hi as just defined is composite in two senses. First, the links from persons to samples may involve multiple contributor positions as explained in [17]. For instance, a person may be assumed to contribute to a two-person mixture in either contributor position (major or minor). To numerically evaluate the likelihood of a composite hypothesis, it first needs to be broken down into simple hypotheses. We employ the commonly applied assumption that each assignment of links to specific contributor positions has equal prior probability. Secondly, the hypothesis is composite in the sense that it is paired with a prior probability distribution of assignments of persons to populations.

Specifically, we partition a composite hypothesis Hi into simple hypotheses Hij with prior probabilities αij such that jαij=1. Such a simple hypothesis comprises a pedigree, a set of links from persons to samples at specific contributor positions and the population from which each person originates. The likelihood of Hi may then be evaluated as:

P(O1,,OM|Hi)=jαijP(O1,,OM|Hij)=jαijg1,,gmP(g1,,gm|Pij,eij)k=1MP(Ok|g1,,gm,Lij).

This is a generalisation incorporating population assignments for an expression derived previously ([17], Section 2.1). The outer sum considers all simple hypotheses, now incorporating population assignments and links from persons to samples at specific contributor positions. The inner sum is as defined before in [17]. That is, all possible genotypes are considered for pedigree members and the probability of the genotypes are computed using a pedigree algorithm. The product k=1MP(Ok|g1,,gm,Lij) is computed from the sample deconvolutions.

2.2. Likelihood Ratio for Comparison of a POI to a Single Sample

We will derive simple expressions for the likelihood ratio calculations outlined in the special case that a POI is compared to a single mixture. This will facilitate a comparison of the different approaches. The starting point in the stratified likelihood ratio calculation in STRmixTM (2) is the probability of the observed mixture profile conditional on the genotype of a POI. The DBLRTM approach, on the other hand, includes the probability of all genotypes in both the numerator and denominator likelihoods. Below, we derive expressions for the DBLRTM likelihood ratio for the specific case of the comparison of a POI to a two-person mixture to facilitate comparison with the STRmixTM approach.

2.2.1. Simple Stratification

The simple stratification method considers all persons to be from the same population, regardless of whether a person is a mixture contributor or the POI. The likelihood of a hypothesis Hj (j=1,2) can be written as a sum where each term considers a population:

P(O|Hj,Gp)=eP(O|Hj,Gp,e)P(e|Hj,Gp)=eP(O|Hj,Gp,e)P(e|Gp).

The term P(O|Hj,Gp,e) is the likelihood of the observed mixture assuming population e and conditional on the POI’s genotype (Gp). The term P(e|Gp) is the posterior probability that all persons are from population e after observing the POI’s genotype. We denote this probability as pe|Gp:=P(e|Gp) and compute it by applying Bayes’ rule:

pe|Gp=πeP(Gp|e)eπeP(Gp|e).

Hence, the simple stratified likelihood ratio is evaluated as:

LRsimple=P(O|H1,Gp)P(O|H2,Gp)=epe|GpP(O|H1,Gp,e)epe|GpP(O|H2,Gp,e)=epe|GpsP(O|s)P(s|H1,Gp,e)epe|GpsP(O|s)P(s|H2,Gp,e)

Comparing this expression to (2), it is seen that the prior probabilities in (2) are replaced by posterior probabilities. These posterior probabilities appear as a result of the assumption of perfect dependence of the populations of all relevant persons: if the POI is from a certain population, then all persons are assumed to be from this population. Hence, conditional on the POI’s observed genotype (Gp), the posterior probability that all persons are from population e is equal to the posterior probability that the POI is from population e. As will be demonstrated in example calculations below, if no sub-population correction is applied, i.e., FST=0, then the denominator can be simplified because the genotypes of the mixture donors and the POI are independent.

2.2.2. Full Stratification

The fully stratified likelihood considers all permutations of population assignments. If we write the likelihoods conditional on the POI’s genotype, then it is helpful to first decompose H1 into sub-hypotheses H11,H12,,H1N which state that the POI is donor 1 up to N respectively, where N is the number of contributors used in the deconvolution and then consider the population assignments.

LRfully-stratified=P(O|H1,Gp)P(O|H2,Gp)=1N(P(O|H11,Gp)+P(O|H12,Gp))++P(O|H1N,Gp)P(O|H2,Gp)

Each term in the numerator requires a sum over all permutations of population assignments

P(O|H1i,Gp)=e1,e2,,eNP(e1,e2,,eN|H1i,Gp)P(O|H1i,Gp,e1,e2,,eN)=e1,e2,,eN(pei|Gpjiπej)P(O|H1i,Gp,e1,e2,,eN)

In sub-hypothesis H1i, the POI is contributor i to the mixture, so the population of contributor i is the same as the POI’s. Each of the terms P(O|H1i,Gp,e1,e2,,eN) is computed by explicit summation over all genotype combinations. If FST>0 then the Balding-Nichols approach is used for allele probabilities with each set of donors from the same population. The denominator of the likelihood ratio is also obtained by explicit summation over all population assignments.

P(O|H2,Gp)=e1,e2,,eN,ePP(e1,e2,,eN,ep|H2,Gp)P(O|H2,Gp,e1,e2,,eN,eP)=e1,e2,,eN,eP(πe1πe2πeNpep|GpsP(O|s)P(s|H2,Gp,e1,e2,,eN,ep))

If FST=0 this expression can be simplified as will be shown in the example calculations below.

2.2.3. Example Calculations

To illustrate the differences between the likelihood ratio methods, we construct a one-locus example and demonstrate the explicit calculation of the likelihood ratios. Figure 1 shows a mock electropherogram for one locus of a two-person mixture; note that stutter peaks are not included for simplicity. Table 1 shows the corresponding deconvolution, i.e., the genotype sets and their corresponding weights. We assume there are two populations which we will label A and B. Table 2 shows (fictitious) allele frequencies in the two populations. We compare a POI with genotype (13/14) to the mixture and evaluate likelihood ratios for the hypothesis pair (1). For simplicity we use FST=0.

Figure 1.

Figure 1

Constructed two-person mixture at a single locus.

Table 1.

Mixture deconvolution for the electropherogram shown in Figure 1.

Genotype Combination (s) Weight (ws)
(13/14, 11/11) 0.3
(11/14, 11/13) 0.27
(11/13, 11/14) 0.23
(11/11, 13/14) 0.2
Table 2.

Allele frequencies used in likelihood ratio calculation for the electropherogram shown in Figure 1.

Frequency in Frequency in
Allele (a) Population A (faA) Population B (faB)
11 0.6 0.5
13 0.3 0.3
14 0.1 0.2

Population specific likelihood ratios. Using allele frequencies from population A only, we write LRA=PA(O|H1,Gp)/PA(O|H2,Gp) where the subscript A emphasises the population that is used in the calculations. Because this is a two-person mixture and we evaluate the sub-source likelihood ratio, we consider two sub-hypotheses H11 and H12 stating respectively that the POI is the first or the second contributor. These sub-hypotheses are assumed to have equal prior probability. In this example, the POI genotype is (13/14). Hence, PA(O|H1,Gp)=12(PA(O|H11,Gp)+PA(O|H12,Gp)). There is only a single matching genotype combination, so we obtain PA(O|H11,Gp)=(f11A)2×0.3 and PA(O|H12,Gp)=(f11A)2×0.2 which yields PA(O|H1,Gp)=(f11A)2×0.25. To evaluate PA(O|H2,Gp), we sum over the genotype combinations:

PA(O|H2,Gp)=sPA(s|H2,Gp)P(O|H2,Gp,s)=sPA(s|H2)ws=2f13Af14A(f11A)2×0.3+4f11Af14Af11Af13A×0.27+4f11Af13Af11Af144×0.23+2(f11A)2f13Af14A×0.2=3(f11A)2f13Af14A

We obtain the likelihood ratio in population A as LRA=112f13Af14A2.78 using frequencies in Table 2. Analogously, we obtain LRB=112f13Bf14B1.39.

Simple stratification. To evaluate the likelihood ratio using simple stratification we need the posterior probabilities that the POI’s genotype originates from population A or B. The probability of the POI’s genotype in population A is PA(Gp)=2f13Af14A=0.06 and the probability of the POI’s genotype in population B is PB(Gp)=2f13Bf14B=0.12. Hence, the posterior probability that the POI’s genotype originates from population A equals pA|Gp=120.06120.06+120.12=13; and pB|Gp=23. Next, we need to evaluate

LRsimple=P(O|H1,Gp)P(O|H2,Gp)=epe|GpP(O|H1,Gp,e)epe|GpP(O|H2,Gp,e)=13PA(O|H1,Gp)+23PB(O|H1,Gp)13PA(O|H2)+23PB(O|H2).

Previous calculations gave PA(O|H1,Gp)=14(f11A)2, PB(O|H1,Gp)=14(f11B)2, PA(O|H2)=3(f11A)2f13Af14A and PB(O|H2)=3(f11B)2f13Bf14B. We obtain

LRsimple=112(f11A)2+16(f11B)2(f11A)2f13Af14A+2(f11B)2f13Bf14b=10756121.76.

Full stratification. The full stratification method considers all permutations of population assignments. Under H1, this means that the populations of the two mixture donors (E1,E2) can take the values (A,A), (A,B), (B,A) or (B,B). Under H2, the populations of the two mixture donors and the POI (E,E2,Ep) can take eight values. We first decompose the sub-source hypothesis H1 into H11 and H12 and then consider different population assignments in the calculation.

LRfully-stratified=P(O|H1,Gp)P(O|H2,Gp)=12(P(O|H11,Gp)+P(O|H12,Gp))P(O|H2,Gp)

Each term is computed separately.

P(O|H11,Gp)=e1,e2P(E1=e1,E2=e2|H11,Gp)P(O|H11,Gp,e1,e2)=e1,e2pe1|Gpπe2P(O|H11,Gp,e1,e2)

Recall that pA|Gp=13 and pB|Gp=23. Hence,

P(O|H11,Gp)=1312PAA(O|H11,Gp)+1312PAB(O|H11,Gp)+2312PBA(O|H11,Gp)+2312PBB(O|H11,Gp)=160.3(f11A)2+160.3(f11B)2+130.3(f11A)2+130.3(f11B)2=0.15(f11A)2+(f11B)2

Similar calculations show that P(O|H12,Gp)=0.1(f11A)2+(f11B)2. To compute P(O|H2,Gp), we do not need to explicitly sum over the eight possible population assignments because the genotypes of the mixture donors and the POI are independent given FST=0. Hence,

P(O|H2,Gp)=P(O|H2)=πAπAPAA(O|H2)+πAπBPAB(O|H2)+πBπAPBA(O|H2)+πBπBPBB(O|H2)=143(f11A)2f13Af14A+14(0.3×2×f13Af14A(f11B)2+0.27×4f11Af14Af11Bf13B+0.23×4×f11Af13Af11Bf14B+0.2×2×(f11A)2f13Bf14B)+14(0.3×2×f13Bf14B(f11A)2+0.27×4f11Bf14Bf11Af13A+0.23×4×f11Bf13Bf11Af14A+0.2×2×(f11B)2f13Af14A)+143(f11B)2f13Bf14B=3218000=0.040125

Combining the above, we arrive at LRfully-stratified=0.125(0.62+0.52)0.0401251.90.

STRmixTM stratification. The population stratification method implemented in STRmixTM considers a weighted average of the likelihoods in each population. For our example, we obtain

LRSTRmix=eπeP(O|H1,Gp,e)eπeP(O|H2,Gp,e)=πAPA(O|H1,Gp)+πBPB(O|H1,Gp)πAPA(O|H2)+πBPB(O|H2)=120.25(f11A)2+120.25(f11B)2123(f11A)2f13Af14A+123(f11B)2f13Bf14B1.97

2.3. Simulation Study

In total, 250 two-person DNA mixtures were simulated for a GlobalFilerTM kit using the simDNAmixtures [18,19] package for R. The template parameter of the major contributor was kept fixed at 500 rfu, while the minor contributor had a template parameter sampled uniformly between 25 (a third of the analytical threshold) and 125 rfu. The low template values for the minor contributor ensure that a large proportion of the samples exhibit severe dropout. The genotypes of the two contributors were sampled according to NIST allele frequencies [10], one from the African American sample and the other from the Caucasian sample. It was randomised which one was the major and which one was the minor contributor. All samples were interpreted using STRmixTM version 2.9.

The goal of the simulation study was to compare likelihood ratios obtained using different ways of population stratification, and no population stratification, with the likelihood ratios computed using ground truth known populations. Likelihood ratios were assigned for the two known contributors to each of the 250 samples (500 likelihood ratios). Table 3 provides an overview of the permutations of parameters and implementations (DBLRTM and STRmixTM) that were used in the calculations. The effect of not using population stratification was investigated first by evaluating likelihood ratios using African American allele frequencies only and Caucasian allele frequencies only using both DBLRTM and STRmixTM. Next, the different ways of population stratification were applied. All likelihood ratio calculations were first done with FST=0 and then repeated with FST=0.01. The simulations were not repeated, i.e., the mixture donors were simulated according to a population genetic model with FST=0 in both cases.

Table 3.

Overview of likelihood ratios calculated for comparison of each of the 500 true donors to the 250 simulated two-person mixtures.

Population Implementation FST
African American only DBLRTM, STRmixTM 0, 0.01
Caucausian only DBLRTM, STRmixTM 0, 0.01
Simple stratified DBLRTM 0, 0.01
Fully stratified DBLRTM 0, 0.01
STRmixTM stratified STRmixTM 0, 0.01
Ground truth (mixed) DBLRTM 0, 0.01

The methods for population stratification discussed here are not restricted to mixtures of two contributors only. For example, if a likelihood ratio is evaluated involving a mixture of three contributors considering populations A and B, then eight permutations of population assignments are possible for the three mixture contributors. We have not involved three-person or higher order mixtures in the simulation study to keep the results concise.

2.4. Contributors from the Same Population

Although the main point of the current work is to investigate how the different likelihood ratio methods perform when the mixture contributors originate from different populations, we also perform one experiment where the contributors originate from the same population. Specifically, we repeat the simulation study described above for 250 mixed profiles where the contributors are simulated from the same populations.

3. Results

3.1. The Case of FST=0

We first discuss the effects of using the various methods listed in Table 3 for the case FST=0. Figure 2 compares the weight of evidence (WoE or log10LR) for the true donors in the simulated mixtures obtained using each of the compared methods with the WoE obtained using the ground truth populations. Specifically, the vertical axis shows the difference between the WoE obtained using a particular method and the WoE obtained using the ground truth populations while the latter is shown on the horizontal axis. A positive difference means that the method is anti-conservative. First, we note that using a single population which does not correspond to the POI’s population may lead to an inflated WoE when the two mixture donors are from different populations. The top-left panel shows that for Caucasian POIs the WoE is up to six bans higher (i.e., the LR is up to six orders of magnitude higher) when African American allele frequencies are used than when the ground truth of mixed populations is used. Similarly, the top-right panel shows that for African American POIs the WoE is up to 9 bans higher when Caucasian allele frequencies are used in comparison to the ground truth populations. In both cases, however, using only the population of the POI is on average conservative. For African American POIs the median WoE difference when using only the African American population is −0.38; the median WoE difference for Caucasian POIs is −0.34. This can also be seen from the plots in which the trendlines sit slightly below zero.

Figure 2.

Figure 2

WoE difference the compared methods and the WoE evaluated the ground truth populations with FST=0 for the 500 donors to the 250 simulated two-person mixtures where the contributors originate from different populations.

The three methods for stratified likelihood ratios are compared next. The simple stratification method implemented in DBLRTM assumes that all mixture donors are either from the one or from the other population which does not correspond to the ground truth. For some POIs this leads to an inflated WoE with the largest WoE difference being about two bans which may be a meaningful difference in some circumstances. The median WoE difference for the simple stratification method is 0.30 with a standard deviation of 0.57. The full stratification method implemented in DBLRTM does take into account the possibility that mixture donors originate from different populations. The median WoE difference for the full stratification method is 0.03 with a standard deviation of 0.32. Hence, the full stratification method is more accurate and less biased than the simple stratification method in these simulations. The STRmixTM stratification method (bottom-right panel) employs a weighted average of likelihoods in the numerator and denominator of the likelihood ratio. This method is on average slightly anti-conservative with a median WoE difference of 0.31 and a standard deviation of 0.82. For all simulated POIs the WoE difference was less than four bans.

Finally, we discuss the results for the approach of taking the minimum WoE across the two populations. Although this leads to the most conservative WoE across the two populations that are considered, the calculations do not take into account the possibility that different donors originate from different populations. Therefore the minimum WoE across populations can be larger than the WoE obtained using ground truth populations and may be anti-conservative. The bottom-left panel of Figure 2 shows the WoE difference for this method. The trendlines sit slightly below the horizontal axis which indicates that this method is on average conservative. The median WoE difference is 0.42 with a standard deviation of 0.52. For 13% of the simulated POIs this method is anti-conservative with respect to the WoE obtained using the ground truth populations (WoE difference greater than zero).

3.2. The Case of FST=0.01

All likelihood ratio calculations comparing the POIs to the simulated mixtures were repeated with FST=0.01. We compare the results to the ones presented for FST=0 in the previous section. Figure 3 is the equivalent of Figure 2 with FST=0.01 instead of FST=0. The two top panes in Figure 3 show that using a single population in the likelihood ratio calculation that is not the POI’s population inflates the WoE for most POIs in comparison to the WoE evaluated using the ground truth populations. The extent to which the WoE is inflated is smaller than when FST=0 is used in the likelihood ratio calculations. Using only the POI’s population is conservative, as was the case with FST=0. Specifically, the WoE difference for African American POIs has a median of 0.61 and a standard deviation of 0.53; the WoE difference for Caucasian POIs has a median of −0.53 with a standard deviation of 0.52.

Figure 3.

Figure 3

WoE difference between compared methods and the WoE evaluated the ground truth populations with FST=0.01 for the 500 donors to the 250 simulated two-person mixtures with contributors from different populations.

The three stratification methods are affected differently by the use of a sub-population correction. Notably, the trendlines for the WoE differences for the simple stratification and the STRmixTM stratification are lower when FST=0.01 is used compared to when FST=0 is used, while the WoE difference for the full stratification method is shifted less. Using FST=0.01, the simple stratification method has a median WoE difference of 0.53 with a standard deviation of 0.57. The full stratification method is close to unbiased with a median WoE difference of 0.05 and a smaller standard deviation of 0.32. For the STRmixTM stratification method the median WoE difference is 0.07 and the standard deviation is 0.72. The fraction of POIs for which the WoE difference is positive (i.e., the method is anti-conservative) is 12.4% for the simple stratification method, 15.4% for the full stratification method and 46.6% for the STRmixTM stratification method. Although these fractions are non-negligible, most of the positive WoE differences are small in magnitude. The fraction of POIs for which the WoE difference exceeds one ban is 0.6% for the simple stratification method, 0 for the full stratification method and 6.6% for the STRmixTM stratification method.

3.3. Contributors from the Same Population

Part of the simulation study was repeated with both of the contributors in each of 250 simulated mixtures now originating from the same population. Figure 4 shows the likelihood ratios obtained for the 500 contributors when compared to the simulated mixtures using the six different likelihood ratio approaches. As before, the top panels show that using the wrong allele frequencies is non-conservative for almost all POIs. Interestingly, the full stratification method is slightly non-conservative in comparison to the ground truth population. For 94.6% of the POIs, the WoE difference is greater than 0 and the median difference is 0.30 bans (close to a factor of 2 in likelihood ratios). The factor of 2 appears because both under H1 and H2 the single large likelihood is the one with all persons coming from the same population, while under H1 there are four permutations in total compared to eight under H2. Alternatively, this can be seen as follows. Recall from the example calculations that we may write

LRfully-stratified=P(O|H1,Gp)P(O|H2,Gp)=12(P(O|H11,Gp)+P(O|H12,Gp))P(O|H2,Gp)

Figure 4.

Figure 4

WoE difference between the compared methods and the WoE evaluated using the ground truth population with FST=0.01 for the 500 donors to 250 simulated two-person mixtures with contributors from the same population.

Assuming two populations (A,B) are considered and the only large likelihoods are obtained for E1=E2=A, we obtain

P(O|H11,Gp)=P(E1=A,E2=A|H11,Gp)PAA(O|H11,Gp)+P(E1=A,E2=B|H11,Gp)PAB(O|H11,Gp)+P(E1=B,E2=A|H11,Gp)PBA(O|H11,Gp)+P(E1=B,E2=B|H11,Gp)PBB(O|H11,Gp)=pA|GpπAPAA(O|H11,Gp)+pA|GpπBPAB(O|H11,Gp)+pB|GpπAPBA(O|H11,Gp)+pB|GpπBPBB(O|H11,Gp)πAPAA(O|H11,Gp)

In the same way we obtain P(O|H12,Gp)πAPAA(O|H12,Gp) and P(O|H2,Gp)πAπAPAA(O|H2,Gp). Combining these, we obtain an expression for the fully stratified likelihood ratio LRfully-stratifiedπA12(PAA(O|H11,Gp)+PAA(O|H12,Gp))πAπAPAA(O|H2,Gp)=LRAπA. In our example, we have πA=12 so we get LRfully-stratified2LRA. The simple stratification method gives likelihood ratios that are for most POIs very close to the likelihood ratio computed with the ground truth population. The minimum WoE across populations is for most POIs equal to the WoE evaluated using the ground truth population and is otherwise slightly conservative. Finally, the STRmixTM stratification method yields results that are most of the time very close to the ground truth population likelihood ratios.

4. Discussion

Despite an apparent consensus in the literature [12,14] that likelihood ratios incorporating population stratification are an elegant solution, little has been published about how such likelihood ratios should be evaluated exactly and how stratification methods compare to other methods. The goal of the current work was to partially address the lack of understanding in this area by proposing explicit methods and evaluating those using a simulation study. Related to this problem, several articles have previously described the effects of population database choice on kinship likelihood ratios [13,20]. Although these publications show that there are meaningful differences depending on which population database is used, another recent publication demonstrates that a universal (worldwide) database can be a good choice in the context of missing persons identification [21]. In the context of random match probabilities for full single source DNA profiles, Ref. [22] presented results that support the dismissal of racial databases and the adoption of a race-neutral database. The aforementioned articles do not discuss the context of probabilistic genotyping. The behaviour of likelihood ratio methods is especially important in this context, because more sensitive multiplexes increase the number of low-level and mixed samples that are analysed. The weight of evidence for such samples may be low and sensitive to modelling assumptions.

We have proposed two new likelihood ratio methods for population stratification. The first method, called simple stratification, is based on the assumption that all persons in the evaluation are from the same population. The second method, called full stratification, considers all permutations of assignment of persons to populations. The behaviour of both methods was investigated using a simulation study and compared to a method that is currently implemented in the STRmixTM software. In the absence of sub-population correction (i.e., when FST=0), the findings indicate that the full stratification method is most accurate when the mixture contributors originate from different populations. The standard deviation of the WoE difference between this method and the ground truth is also the smallest among the compared methods. The simple stratification method is on average slightly conservative, however the standard deviation of the WoE difference between the simple stratification method and the ground truth is comparatively large (0.57 versus 0.32 for the full stratification method). This means that the simple stratification method is more often slightly non-conservative. The highest WoE differences observed for the simple stratification method are about 2 bans. The stratification method implemented in STRmixTM is on average slightly anti-conservative with the median WoE difference being 0.31 bans. The standard deviation of 0.82 is higher than the standard deviation obtained for the other two stratification methods. The highest WoE differences were close to four bans. Besides the stratification methods, the simulation study also investigated the possibility of taking the minimum WoE across the two populations that were considered. This is, on average, a conservative approach. However, it is important to note that this approach does not take into account that the mixture donors originate from different populations and is therefore not always conservative.

When FST=0.01 was used in the likelihood ratio calculations, all approaches behaved more conservatively in the sense that lower WoEs were obtained. The WoE evaluated using the ground truth populations is affected relatively weakly by changes in FST because the sub-population correction only applies within a population and not across populations. This explains why using the POI’s population, or a minimum across populations, is more conservative in the simulations where FST>0.

The current work was limited in scope. We have restricted the simulation study to involve two-person mixtures only to keep the volume of results manageable. Another limitation of the work is that we have assumed that a person originates from a single population. A natural extension would be to consider that persons may have two parents from different populations.

Finally, we mention an implementation detail relevant to software implementation of the models discussed in the current work. As discussed previously [17], DNA mixture calculations typically consider a virtual so-called Q allele that represents any allele not observed in a sample. When a POI is compared to a mixture and one of the POI’s alleles is not observed in the profile, it is (almost) irrelevant for the evidential value calculation which allele this is exactly depending on the particular model that is used. Population stratification methods introduce a complication, because the POI’s alleles that are not shared with the mixture may be informative of the population from which the POI originates and may affect the likelihood. Moreover, alleles at loci that are not shared between the POI and the mixture may also affect the likelihood.

5. Conclusions

The current work establishes that the common practise of taking a minimum WoE across populations is not always conservative when FST=0. Population stratification methods may also be non-conservative in some cases. When FST>0 is used in the likelihood ratio calculations, as is recommended, all compared approaches become conservative on average to varying degrees.

Author Contributions

Conceptualization, M.K.; methodology, M.K., H.K., J.-A.B. and J.B.; software, M.K.; validation, H.K.; formal analysis, M.K., H.K., J.-A.B. and J.B.; writing—original draft preparation, M.K, H.K., J.-A.B. and J.B.; writing—review and editing, M.K., H.K., J.-A.B. and J.B. All authors have read and agreed to the published version of the manuscript.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Funding Statement

This work was supported in part by grant NIJ 2020-DQ-BX-0022 from the US National Institute of Justice. Points of view in this document are those of the authors and do not necessarily represent the official position or policies of their organizations.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Meester R., Slooten K. Probability and Forensic Evidence: Theory, Philosophy, and Applications. Cambridge University Press; Cambridge, UK: 2021. [Google Scholar]
  • 2.Buckleton J.S., Bright J.A., Taylor D. Forensic DNA Evidence Interpretation. CRC Press; Boca Raton, FL, USA: 2018. [Google Scholar]
  • 3.Balding D.J., Steele C.D. Weight-of-Evidence for Forensic DNA Profiles. John Wiley & Sons; Hoboken, NJ, USA: 2015. [Google Scholar]
  • 4.Balding D.J., Nichols R.A. DNA profile match probability calculation: How to allow for population stratification, relatedness, database selection and single bands. Forensic Sci. Int. 1994;64:125–140. doi: 10.1016/0379-0738(94)90222-4. [DOI] [PubMed] [Google Scholar]
  • 5.Balding D.J., Bishop M., Cannings C. Handbook of Statistical Genetics. John Wiley & Sons; Hoboken, NJ, USA: 2008. [Google Scholar]
  • 6.National Research Council . The Evaluation of Forensic DNA Evidence. National Research Council; Washington, DC, USA: 1996. [Google Scholar]
  • 7.Gill P., Haned H., Bleka O., Hansson O., Dørum G., Egeland T. Genotyping and interpretation of STR-DNA: Low-template, mixtures and database matches—Twenty years of research and development. Forensic Sci. Int. Genet. 2015;18:100–117. doi: 10.1016/j.fsigen.2015.03.014. [DOI] [PubMed] [Google Scholar]
  • 8.Butler J.M. The future of forensic DNA analysis. Philos. Trans. R. Soc. B Biol. Sci. 2015;370:20140252. doi: 10.1098/rstb.2014.0252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Devesse L. Ph.D. Thesis. King’s College London; London, UK: 2022. Characterisation and Differentiation of five UK Populations Using Massively Parallel Sequencing of Forensic STRs. [Google Scholar]
  • 10.Steffen C.R., Coble M.D., Gettings K.B., Vallone P.M. Corrigendum to ‘US population data for 29 autosomal STR loci’[Forensic Sci. Int. Genet. 7 (2013) e82–e83] Forensic Sci. Int. Genet. 2017;31:e36–e40. doi: 10.1016/j.fsigen.2017.08.011. [DOI] [PubMed] [Google Scholar]
  • 11.Budowle B., Moretti T.R., Baumstark A.L., Defenbaugh D.A., Keys K.M. Population data on the thirteen CODIS core short tandem repeat loci in African Americans, US Caucasians, Hispanics, Bahamians, Jamaicans, and Trinidadians. J. Forensic Sci. 1999;44:1277–1286. doi: 10.1111/1556-4029.12806. Erratum in J. Forensic Sci. 2015, 60, 1114–1116. [DOI] [PubMed] [Google Scholar]
  • 12.SWGDAM Ad Hoc Working Group on Genotyping Results Reported as Likelihood Ratios. Recommendations of the SWGDAM Ad Hoc Working Group on Genotyping Results Reported as Likelihood Ratios. 2018. [(accessed on 12 October 2022)]. Available online: https://www.swgdam.org/_files/ugd/4344b0_dd5221694d1448588dcd0937738c9e46.pdf.
  • 13.Ge J., Budowle B. Kinship index variations among populations and thresholds for familial searching. PLoS ONE. 2012;7:e37474. doi: 10.1371/journal.pone.0037474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gill P., Hicks T., Butler J.M., Connolly E., Gusmão L., Kokshoorn B., Morling N., van Oorschot R.A., Parson W., Prinz M., et al. DNA commission of the International society for forensic genetics: Assessing the value of forensic biological evidence-Guidelines highlighting the importance of propositions: Part I: Evaluation of DNA profiling comparisons given (sub-) source propositions. Forensic Sci. Int. Genet. 2018;36:189–202. doi: 10.1016/j.fsigen.2018.07.003. [DOI] [PubMed] [Google Scholar]
  • 15.Triggs C., Harbison S., Buckleton J. The calculation of DNA match probabilities in mixed race populations. Sci. Justice J. Forensic Sci. Soc. 2000;40:33–38. doi: 10.1016/S1355-0306(00)71931-9. [DOI] [PubMed] [Google Scholar]
  • 16.Kelly H., Kerr Z., Cheng K., Kruijver M., Bright J.A. Developmental validation of a software implementation of a flexible framework for the assignment of likelihood ratios for forensic investigations. Forensic Sci. Int. Rep. 2021;4:100231. doi: 10.1016/j.fsir.2021.100231. [DOI] [Google Scholar]
  • 17.Kruijver M., Taylor D., Bright J.A. Evaluating DNA evidence possibly involving multiple (mixed) samples, common donors and related contributors. Forensic Sci. Int. Genet. 2021;54:102532. doi: 10.1016/j.fsigen.2021.102532. [DOI] [PubMed] [Google Scholar]
  • 18.Kruijver M. simDNAmixtures: Simulate Forensic DNA Mixtures. R Package Version 0.2. 2022. [(accessed on 12 November 2022)]. Available online: https://linkinghub.elsevier.com/retrieve/pii/S1872497321000703.
  • 19.Kruijver M., Bright J.A. A tool for simulating single source and mixed DNA profiles. Forensic Sci. Int. Genet. 2022;60:102746. doi: 10.1016/j.fsigen.2022.102746. [DOI] [PubMed] [Google Scholar]
  • 20.Toscanini U., Salas A., García-Magariños M., Gusmão L., Raimondi E. Population stratification in Argentina strongly influences likelihood ratio estimates in paternity testing as revealed by a simulation-based approach. Int. J. Leg. Med. 2010;124:63–69. doi: 10.1007/s00414-009-0359-2. [DOI] [PubMed] [Google Scholar]
  • 21.Laurent F.X., Fischer A., Oldt R.F., Kanthaswamy S., Buckleton J.S., Hitchin S. Streamlining the decision-making process for international DNA kinship matching using Worldwide allele frequencies and tailored cutoff log10LR thresholds. Forensic Sci. Int. Genet. 2022;57:102634. doi: 10.1016/j.fsigen.2021.102634. [DOI] [PubMed] [Google Scholar]
  • 22.Oldt R.F., Kanthaswamy S. Expanded CODIS STR allele frequencies–Evidence for the irrelevance of race-based DNA databases. Leg. Med. 2020;42:101642. doi: 10.1016/j.legalmed.2019.101642. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Not applicable.


Articles from Genes are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES