Skip to main content
Microbiology Spectrum logoLink to Microbiology Spectrum
. 2024 Jul 23;12(9):e03946-23. doi: 10.1128/spectrum.03946-23

Maximum likelihood estimators for colony-forming units

K Michael Martini 1,2,, Satya Spandana Boddu 1, Ilya Nemenman 1,2,3, Nic M Vega 1,2,3,
Editor: Stephan Schmitz-Esser4
PMCID: PMC11371269  PMID: 39041814

ABSTRACT

Measuring the abundance of microbes in a sample is a common procedure with a long history, but best practices are not well-conserved across microbiological fields. Serial dilution methods are commonly used to dilute bacterial cultures to produce countable numbers of colonies, and from these counts, to infer bacterial concentrations measured in colony-forming units (CFUs). The most common methods to generate data for CFU point estimates involve plating bacteria on (or in) a solid growth medium and counting their resulting colonies or counting the number of tubes at a given dilution that have growth. Traditionally, these types of data have been analyzed separately using different analytic methods. Here, we build a direct correspondence between these approaches, which allows one to extend the use of the most probable number method from the liquid tubes experiments, for which it was developed, to the growth plates by viewing colony-sized patches of a plate as equivalent to individual tubes. We also discuss how to combine measurements taken at different dilutions, and we review several ways of analyzing colony counts, including the Poisson and truncated Poisson methods. We test all point estimate methods computationally using simulated data. For all methods, we discuss their relevant error bounds, assumptions, strengths, and weaknesses. We provide an online calculator for these estimators.

Estimation of the number of microbes in a sample is an important problem with a long history. Yet common practices, such as combining results from different measurements, remain sub-optimal. We provide a comparison of methods for estimating abundance of microbes and detail a mapping between different methods, which allows to extend their range of applicability. This mapping enables higher precision estimates of colony-forming units (CFUs) using the same data already collected for traditional CFU estimation methods. Furthermore, we provide recommendations for how to combine measurements of colony counts taken across dilutions, correcting several misconceptions in the literature.

KEYWORDS: CFU, colony count estimation, bacterial counts, MPN, dilution plating, maximum likelihood estimator, dilution experiments, most probable number

INTRODUCTION

Extrapolation of viable microbial counts from suspensions of live cells is a longstanding—and surprisingly complicated—problem, which here we will explore computationally. The fundamental problem is simple: there exists a sample with some unknown concentration of live microorganisms, which an experimentalist wants to measure. For simplicity, we will assume that the sample in question was taken as, or has been resuspended into, some initial volume. That initial volume will be serially diluted (usually in a 10-fold series), and fixed-volume aliquots (or sub-samples) of the resulting suspensions will be cultured. If these aliquots are plated in or on agar, the resulting data will be in the form of colony counts (colony-forming units, CFUs). Alternately, multiple aliquots may be taken from a single dilution and used to seed a number of wells or tubes of liquid culture, or a number of plates. Then, the number of volumes showing growth when seeded from a particular dilution, as a fraction of the total number of volumes inoculated, can be used to calculate the most probable number (MPN) of live agents in the initial volume (1, 2).

Best practice for this apparently simple and ubiquitous scenario has been the subject of debate for over a century (1, 316). CFU measurement is inherently subject to errors due to the stochastic nature of counts. This type of error is the primary concern in research settings, where microbes are grown in culture prior to quantification. For reasons that are not clear, the substantial literature on microbial quantification in food and environmental samples is not well known in research laboratories, and the practices and estimators used vary considerably from lab to lab.

Counting error is related to the density and number of colonies. At one extreme, when the sample is too concentrated, the number of resulting colonies will be too numerous to count (TNTC; sometimes “too many to count,” TMTC). At these high concentrations, colonies merge, breaking the assumption that each microbe corresponds to one colony (6, 17). At the other extreme, when the sample is very diluted, the number of colony initiating bacteria in the sample is subject to small-number statistical (sampling) fluctuations, resulting in high relative error (ratio of the standard deviation to the mean) (18, 19). Crowding error is widely acknowledged, and TMTC colonies or plates are generally simply excluded from analysis [although there are uses for these data, see, for example, reference (20)]. However, understanding of small number errors is less common in wet-lab microbiology. Finally, experimental errors, such as inaccuracies in pipetting, can emerge and compound over the steps of a serial dilution. However, the latter source of error is expected to be negligible for equipment calibrated to usual standards, and technical replication further reduces effects of this variation (21).

Thus, the problem at hand is how can CFU density best be estimated from plate counts, given the error produced by sampling fluctuations, colony crowding, and (to a lesser degree) pipetting? These errors will contribute differently to different experimental designs. For a single sample with initial volume V0 which is represented by one count of colonies nk at one dilution dk (because only one dilution was measured, or because only one spot or one plate in a series was countable), statistical error of counts [presumably but not necessarily (22, 23) Poisson] is inevitable, and pipetting error will contribute but may not be significant. For a single sample represented by more than one count of colonies (representing counts at different dilutions within a single dilution series, and/or technical replicates where one sample was measured multiple times), the same errors apply, but pipetting bias may not be constant across measurements (for example, one failing O-ring on a multichannel pipette can lead to bias in a single column of a 96-well plate). If technical errors are reasonably small, these measurements should conform to expectations for Poisson sampling error (24).

It is critical to note that for multiple samples of the same type measured in parallel (including biological replicates), we can no longer expect variation across samples to reflect a Poisson-distributed sampling error. Variation across samples will be biological (or otherwise inherent) and demographic (accumulating over time) in addition to that due to sampling. This was the basis for the Nobel Prize-winning experiments of Luria and Delbrück, who used the distribution of fluctuations to distinguish Darwinian vs Lamarckian evolution (25). This is also frequently the case in environmental samples, where different samples from the same source (e. g., water samples from different parts of the same lake) will produce measurements that have super-Poisson variation (aka, over-dispersed). In such cases, a substantial part of the variation is “real” due to inhomogeneities in the source (26) and counts may be better represented by a lognormal or negative binomial distribution (22, 23) than by the Poisson (26).

These considerations are substantial. Sample inhomogeneity is known to be a major source of variation between measurements in environmental and food samples (22, 27). Even apparently “well-mixed” samples, such as water samples or liquid cultures in a laboratory, can be very inhomogeneous especially if microbes have a tendency to clump (19). However, such considerations are case-specific, and dealing with these is beyond the scope of the present work. The larger problem of microbiological estimation from varied and inhomogeneous samples has been dealt with extensively elsewhere, for example, references (18, 22, 28).

Here, we will focus on point estimation of CFU density within an individual, homogeneous sample, which may be represented by a single set of measurements or by technical replicates, in which one sample is measured multiple times. The main objective of this paper is to propose methods for accurately estimating CFUs, while taking into account the effects of crowding and sampling fluctuations, without losing valuable data from counts. Drawing from previous research (1719, 29), we present simple analytical formulas that can be used to combine counts from different dilutions and to obtain precise CFU point estimates along with accurate error bars. First, we examine existing point estimate methods in the literature, assessing their strengths and weaknesses including pick-the-best, Poisson, and Poisson with a cutoff. We discuss the “Poisson with a cutoff” method, which clarifies the impact of crowding on CFU density estimation and demonstrates how to minimize the effects of sampling error by combining measurements of “uncrowded” counts. We explore the mathematical equivalence between different regimes of the common CFU estimators. Next, we introduce a crowding-explicit model to demonstrate the relationship between canonical plate-based counts and the most probable number method for presence/absence of growth in liquid media. This is achieved by considering colony-sized patches of the plate as equivalent to individual tubes. This extension of the domain of applicability of MPN to colony counts will potentially increase the accuracy of a whole class of experiments with no additional experimental costs. Finally, we computationally evaluate and compare all the point estimators we discussed for their bias and standard errors. If experimentalists require estimates beyond point estimation such as full distributions or need to include experimental uncertainties in the counting process, more detailed methods are available in the literature (19). However, if an experimentalist just requires a point estimate of the CFU density, we provide practical recommendations for experimentalists on how to select appropriate dilution and replication schemes and how to combine data from multiple observations. We also have provided a calculator for these estimators available on Hugging Face spaces, named CFUestimator (30).

RESULTS

A brief history of counts

CFUs are a proxy for the concentration of microbes within a sample. A standard experimental procedure for estimating CFUs consists of serially diluting homogeneous samples in a sterile aqueous buffer, then plating aliquots of these dilutions on growth-supporting agar and later counting the resulting colonies. If an appropriate dilution has been reached, each microbe will form an independent colony that is countable by eye. We assume for simplicity that all plates or tubes used for growth have the same ability to support growth of the organism(s) being studied, and that the sample is sufficiently homogenized to ensure that microbes are free in solution and not adhered to one another or to a substrate (4). (In practice, these assumptions should be tested before choosing an estimator (18, 26); we discuss one such test in the next section.).

The simplest way of estimating CFUs is to multiply the number of colonies by the reciprocal of the dilution factor to find the concentration of colony-forming microbes in the original suspension (1, 5, 29). For example, say, there is a single sample represented by one countable 10 cm plate in a dilution series, where we observe 100 distinct colonies after plating 100 µL of a 1:100 dilution (dilution 2 in a 10-fold series) from the original sample. In this case, following this simple procedure, we would obtain

CFUVolume=countsFracOriginalVolume=1000.1mL0.01=100103=105CFU/mL. (1)

This is exactly equivalent to multiplying the number of counts by a volume correction factor [1/(size of aliquot in mL)] and multiplying by the base of the dilution series raised to the power of the number of dilution steps

CFUVolume=10.1mL100colonies102=100103=105CFU/mL. (2)

This simple calculation follows from a more general Poisson model, explained below. This method works reasonably well under ideal conditions: all samples should be represented by a single count of colonies, and each count should be large enough to minimize small-number sampling fluctuations, and yet small enough to avoid crowding on the plate. When any of these conditions are not met, accurate estimation of CFU density becomes more complicated.

There is a broad literature of methods proposing to ensure that estimates of CFU density are “good.” A good estimator should be accurate. Formally, this means that such estimators should have the true value of the CFU density as their expected value. In other words, they must be unbiased. Good estimators must also be precise, so that variance in the estimate is small and samples are repeatable. Therefore, an ideal solution to this problem should provide an estimator that is provably unbiased and with a minimal variance. The solution to this problem is well known in statistics: if we can assume that data follow a specific probability distribution, then the maximum likelihood estimator (MLE) for that distribution will have these properties (18). While this is formally true only for very large samples, MLEs generally perform well even for relatively small samples. Furthermore, an ideal method should be straightforward to use in the hands of researchers without advanced mathematics background. Unfortunately, many of the available methods fail one or the other of these requirements, being either simple to use, but statistically sub-optimal, or mathematically correct, but incomprehensible to many bench scientists.

Straightforward-to-use methods focus largely on designing protocols that avoid data in error-prone extremes. For example, the FDA recommends (31) that the best dilution range for coliform bacteria results in 25 to 250 colonies per 10 cm spread plate, with the ideal count closest to 250. Restriction on the high end limits counting errors due to growth limitation by nutrient depletion as well as outright merging of colonies, which would bias the number of counts downward. Conversely, restriction on the lower end limits the sampling error associated with small numbers of counts. Specifically, under the assumption that counts represent random draws from a given sample and are, therefore, Poisson distributed, the error scales as the square root of the number of counts. Thus, for small counts, the error becomes an unacceptably large fraction of the mean. Within the example above, our dilution 2 count of 100 colonies should have a standard deviation (SD) of 100=10, giving a coefficient of variation (CV) of 10%. At dilution 3, we might obtain 10 counts, with an SD of 103.16 and a CV of 31.6%.

From here, the simplest approach that is often used in practice is to choose only the plate or spot that has the “best” count in the acceptable range, and to estimate CFU density based on that single count. Often, only the dilution at the high end of the countable range is used since it has the smallest sampling fluctuations; all other measurements are discarded (29). We call this the “pick-the-best” method for later reference. If counts in the acceptable range can be consistently achieved, this method is straightforward and reasonably accurate. However, discarding data is rarely advisable, and over- and under-crowded measurements can, in fact, be used to improve CFU estimates.

Simplest “good” estimator: Poisson

One simple and reasonably accurate model for calculating CFUs assumes that the number of colonies are Poisson distributed, with variation due to sampling. That is, for a particular dilution, the mean colony count for that dilution is the same as the variance. By this model, the most likely estimator for the density of microbes is simply the ratio of the total number of colonies counted from all plates divided by the total amount of liquid used from the original sample in all plates (see Supplementary Information). If there is only one countable measurement for a given sample, this simplifies to “pick-the-best.”

The Poisson model implicitly assumes that the original sample is well mixed and each microbe plated will result in its own separate and countable colony. It further assumes that experimental volume is spread uniformly across the agar surface, resulting in cells being randomly distributed, independent of the locations of where other cells landed. Formally, these assumptions mean that there is a uniform and well-mixed population density r of microbes per unit volume in an initial volume of liquid V. The liquid is diluted by a factor dk = Vk / V, where Vk is the volume of the liquid from the original sample used on the plate or the spot k. Plating will result in nk colonies, where nk is Poisson distributed with the parameter λ = r dkV = rVk. That is, the average number of colonies per experiment is r dkV with variance r dkV. Using these assumptions, the MLE of the density of microbes rmle and its standard error are

rmle =knkVkdk=knkkVk,σ=rmle knk. (3)

In other words, the best estimator for the concentration, rmle, is the total number of colonies divided by the total amount of the original volume of liquid used. However, as noted earlier, this ignores crowding and counting errors. In practice, this method should be avoided unless all measurements are from well-dispersed, uncrowded plates, as crowding effects can make a large difference in the estimator, resulting in under-estimating the microbial density as colonies merge and are under-counted.

If technical replicates exist (multiple measurements of the same sample; four or more such measurements are recommended), it is straightforward to test whether the data adhere to a Poisson distribution using the following test, known as the dispersion index test. If there are j measurements of a given sample, with average number of counts N- and a variance of counts sN2, then the index of dispersion D2 is

D2=(j1)sN2N, (4)

which will be distributed as χ2 with j − 1 degrees of freedom (18, 32). If D2 is greater than the upper 1 − α quantile of that distribution, where α is the needed significant P-value, then we reject the null hypothesis that these replicates are drawn from the same Poisson distribution. This can indicate technical problems that are introducing an excess (or insufficiency) of variation, possibly by biasing replicates differently from one another (e.g., the failing O-ring example above), or biases due to a too lenient cutoff for countability.

Combining data: common bad estimators

The primary reason for the “pick-the-best” approach is that it eliminates confusion over how to combine multiple measurements for a given sample, particularly when counts belong to more than one dilution. First, notice that combining measurements from technical replicates that are taken at the same dilution is straightforward. For example, let us assume an original 200 µL volume V contains r = 3 · 108 CFU. We can create simulated serial dilutions from this original volume by assuming that each pipetting step (10-fold dilutions and plating onto agar) is a binomial sampling event (19) that comes with experimental noise. In one such simulation, triplicate plating 100 µL aliquots results in counts n6 = (162, 141, 148), all from the sixth 10-fold dilution. The fraction of the original volume plated in each case is V6 = 0.5 ⋅ 10−6 = 5 ⋅ 10−7. These numbers can be combined via the Poisson method shown in the previous section to estimate CFU density in V,

CFU=162+141+148510-7+510-7+510-7=162+141+1483510-7=3.007108. (5)

Alternately, counts taken from the same dilution can be averaged across technical replicates, then adjusted by the volume plated and the dilution read (33)

CFU=2162+141+1483106=2162+141+148310-6=162+141+14830.510-6=2.007108. (6)

Clearly, these two most common approaches are algebraically identical.

In contrast, combining counts across different dilutions is less straightforward. In fact, some commonly used methods for combining measurements are statistically inadmissible. For example, if there are multiple measurements in the countable range, the USDA FSIS recommends (33) that researchers calculate the estimated CFU for each dilution separately using the average colony count across technical replicates at a given dilution and then average the results of the separate dilutions. If the two estimates are more than a factor of 2 apart, the researcher is instructed to instead only use the counts from the higher-density plates. This commonly used method incorrectly combines the data using a simple average, thus increasing the variance of the estimated CFU density. Indeed, continuing the example above, let us suppose that, on the plates corresponding to the seventh 10-fold dilution from these three technical replicates, we observe (18, 22, 25) colonies. The Poisson estimator gives us

CFU=162+141+148+13+17+2035·10-7+35·10-8=3.036·108. (7)

The USDA averaging method gives

CFU= 12(2162+141+1483106+213+17+203107)                          =12(3.06108+3.33108)=3.2108. (8)

On these data, averaging was substantially less precise, with an error of 7% as compared with the Poisson method’s error of 1% (recall that the true density in this example is 3.0 · 108 CFU per 200 µL). The USDA method improperly averages across dilutions, amplifying fluctuations associated with small colony number counts, whereas the simple Poisson model properly combines measurements across dilutions by effectively re-weighting small counts by the small volumes with which they are associated. In a later section, we demonstrate that averaging across dilutions will, as a rule, increase the variance of CFU estimates.

Too few and too many

Furthermore, there is the problem of what to do with zero counts. These data are inevitably limited by some threshold of detection (TOD), representing the smallest CFU density at which counts can be detected. This “left-censoring” is a well-known issue (3436) with many proposed work-arounds, including but not limited to substituting zeros with a small value (which may be the average of the undetectable range, a maximum likelihood inferred value, or some other small number), reporting zeros as “below TOD” or “<1” rather than as a value, and pretending they did not happen (not generally recommended; although if zeros are rare, it will not make much difference) (34, 36). Sometimes, a threshold of quantification representing the lowest “acceptable” (sufficiently precise) count is used along with or instead of TOD (29), with values below this threshold omitted from analysis.

The “best” approach to zero-contaminated count data depends on what else is in the data and what the data will be used to do. We assume here that the goal is to obtain a point estimate of CFU density in the original sample (as opposed to, for example, determining the probability that this density is in excess of some threshold). If a sample is represented by zero and non-zero measurements, the Poisson model explicitly allows zero counts to be incorporated as outcomes of the random sampling process. For example, if a hypothetical V = 200 µL sample contains 5 · 107 CFU, one simulation of serial dilution and plating in triplicate with 100 µL per plate produces dilution 6 counts of (31, 26, 20 ) and dilution 7 counts of (4, 0, 0). Using just the dilution 6 counts, we estimate

CFU=31+26+203(510-7)=5.13107±0.59107. (9)

If we use the lower dilution as well, we obtain

CFU=31+26+20+4+0+03(5107)+3(5108)=4.91107±0.55107. (10)

In this case, incorporating data from zeros (in the form of additional volume that was plated but contained no counts) improved precision. Alternately, when zeros are common because the density in the original sample is close to the TOD, non-zero counts are useful for making a distinction between samples where no organisms are detectable (and density might be zero) and those where the density of organisms cannot be zero. Although the actual density cannot be estimated accurately or precisely from very low counts, the distinction between “<TOD” and “>1” for a given sample is important (36).

At the other end of the range, researchers must deal with crowding and set thresholds for “too many to count.” Defining an optimal range for “countable” data is not always straightforward, and this determination is very important to ensure that CFU estimates are accurate. Since the sampling-based standard error of counts scales as 1knk, the number of colonies counted nk should be as large as reasonably possible.

However, there are consequences for pushing this too far. As cell density in the aliquot increases, counts will be biased downward due to merging of colonies and colony stunting or failure to grow. These data are then “right-censored,” with an upper limit past which the number of counts observed does not increase in proportion to an increase in the density in the original sample. Densities above this point result in “crowded” samples, with counts that are lower than the true number of CFUs. Furthermore, as the number of colonies per plate or spot increases, data collection becomes more time-consuming or requires expensive robots for automatic counting; it is common for researchers to minimize effort on plates near the top of the “acceptable” range by dividing plates into sections, counting colonies in one section, and multiplying this count by the number of sections to get an estimated final count for the whole plate. While this approach is sufficient for a rough estimate of CFU density, it introduces additional sampling variation due to both reduction in counts and imperfect division of plates, and it does not remove bias due to crowding. We will demonstrate the consequences later in this paper.

Previous works (17) have modeled crowding using shifted Poisson distributions. In these models, the variance of estimates from crowded data would be the same as if there was no crowding and the mean would be shifted down due to colonies merging together. However, this is a priori unlikely to be true. As we will show below, if colonies are crowded, both the mean and the variance will be shifted relative to the pure Poisson (uncrowded) distribution. The reason for this is that the variance of the large colony counts is shifted downward due to a “ceiling” effect—there is an upper bound to the total number of colonies, which limits upward fluctuations. In other words, the use of a shifted Poisson distribution is a reasonable approximation, but the variance must also be modified.

Better estimators: Poisson with cutoff, aka what is countable, exactly?

The main problem with the I Poisson model is that it does not account for counting errors due to crowding. The simplest way to take account of the crowding is to assume that there is a threshold of colonies, M, below which crowding is negligible, which in practice will often be smaller than the largest number of counts we are willing to attempt (TMTC). It is convenient to assume that this threshold M is the same as some TMTC threshold (e.g., the commonly referenced 250–300 colonies per plate for coliforms). However, this threshold will depend on colony size and morphology, which will vary across different bacteria and will change for a given bacterial strain depending on the media used, concentration of agar in a given batch of plates, incubation time and temperature, humidity, etc. To be certain of the validity of a chosen threshold within a standardized protocol, it is advisable to plate a finer-than-normal dilution series (two- to fivefold, to ensure multiple dilutions with readable counts) and check that dispersion within and across different “countable” dilutions for a given sample is consistent with Poisson. For densities where crowding affects counts, the data will become “right-censored” and show a decrease in variation as compared with expectations for Poisson-distributed samples.

Once the threshold M is determined, we can then segment our data into two parts: plates/spots with counts above the threshold where crowding is important, and plates/spots with counts below this threshold for which crowding is not important. If we have identified our cutoffs well, for our theoretical homogeneous sample, the Poisson estimator is correct for all measurements k where the number of colonies counted nkM. The calculation is, therefore, exactly the same as for the I Poisson estimator, with the caveat that only measurements nkM are used. Here, the indicator function I (nk < M) is 1 when nk < M, and 0 otherwise. Similarly, I (nk > M) is 1 when nk > M, and 0 otherwise. Due to its balance between simplicity and accuracy, this method is the easiest to use in practice:

rmle =kI(nkM)nkkI(nkM)Vk,σ=rmlekI(nkM)nk. (11)

If we want to incorporate data from measurements above this threshold M, the calculation becomes slightly more complicated. This method was first introduced in the literature as “averaging TNTC counts” (18, 20). Using “crowded” measurements as if they were uncrowded will bias the naive Poisson estimator downward, resulting in under-estimation of CFU density (Fig. 2). However, we can use the number of plates/spots that were above the crowding threshold M, along with the colony counts from plates/spots below this threshold at the same dilution, to estimate CFUs. This will be applicable when plate counts at a given dilution are toward the high end of the countable range, such that some technical replicates fall below this threshold and others above it by chance. To estimate the CFU density in the original sample r, the following equation should be solved numerically (see SI for the derivation):

kI(nkN)(nkrdkV)+kI(nk>N) dk dkV(dkrV)NedkrV0dkrVtNetdt =0. (12)

The first term is equivalent to the simple Poisson model and uses the counts from uncrowded samples directly, whereas the second term reflects the probability of counts being above the threshold M. Inference of r can be done in Excel using SOLVER or using numerical solvers in R, Python, MATLAB, etc. An equivalent model is shown in references (18, 20).

This model properly accounts for two error sources in counts from an individual sample: the sampling fluctuations and the crowding effect. The simple Poisson, using only counts from uncrowded plates, gives a good estimate for the CFU counts and properly combines multiple measurements at different dilution factors. The more sophisticated form of the model has greater precision, but the greater computational effort may or may not be worth it to an investigator depending on the effect size and the structure of the experiment. In the next section, we present an alternate estimator based on the most probable number approach, which we argue provides a better tradeoff between effort and estimator performance when incorporating data from crowded samples.

Crowding and the most probable number

For the final model, we consider the effects of crowding in space. To account for crowding, we will divide each plate into NAplateAcolony regions, each approximately the size of a full colony. We make the assumptions that if more than one microbe lands in one of these regions, the colonies that form from these cells will grow together and be counted as one colony. For each region, the number of cells landing in that region will be Poisson distributed with parameter λ=dkrVN.

These assumptions are equivalent to that of quantal-based methods for microbial quantification, such as the commonly used MPN method. In the MPN assay, a known quantity (volume of original sample) is introduced into each of a series of N replicate tubes, and the dilution of the original sample is adjusted to find a region where some of the tubes contain viable growth and some do not. The results of this assay are therefore, for each dilution volume Vk from the original sample, out of the Nk tubes inoculated, a number nk that is positive for growth.

A direct mapping to tube-based assays is possible if space on a plate (or within a spot) is considered as a set of colony-sized bins. Each of the N colony-sized regions on a plate or within a spot corresponds to one tube. The presence of colonies in a particular region corresponds to when a tube has growth. Hence, a plate that is divided into N regions can be thought of as N tubes being tested in parallel, cf. Fig. 1.

Fig 1.

Fig 1

Visual equivalence between plate and tube-based assays. The left panel is a cartoon of a typical plate containing colonies, where the growing colonies are shown as dark disks. In the middle panel, the plate is divided into N (here 16) approximately colony-sized regions. If a region contains one or more colony centers (black dots), this region can be mapped to a positive (dark) tube as shown in the right panel. Similarly, regions containing no colony centers are mapped to negative (light) tubes. This demonstrates that plating is equivalent to a massive parallel version of a tube-based assay with NAplateAcolony. Furthermore, it demonstrates that the MPN method can be used for plate data.

Therefore, the probability of nk successes in N colony-sized regions on the agar surface can be described using a crowding-explicit model based on the binomial distribution. Assuming that the cells in the original sample are well mixed, the probability of zero cells landing in a particular region is (from the Poisson) p0=e-dkrV/N and the probability that at least one cell lands in that region is therefore p§amp;gt;=1-e-dkrV/N. Assuming that the original sample is well mixed, each region is independent of all other regions in our crowding model, so that

p(nk)=(Nnk)p>nkp0Nnk=(Nnk)(1edkrVN)nkedkrV(Nnk). (13)

We can maximize this probability to find the MLE CFU density, rmle (see the SI for the full derivation). We can accomplish this by numerically solving the following equation for r:

kdknkN(1erdkV/N)=kdk. (14)

This expression for r is the same as that of the MPN estimator (31, 37, 38). In the SI, we show that, in the limit where concentrations and colony counts are low, this model simplifies to the Poisson model. Outside the “uncrowded” regime, the mean and the variance of data from the crowding model are not the same as in the Poisson. Therefore, the two approaches are not equal to each other, though both are depressed due to the “ceiling” effect described earlier. In the SI, we also find that the error associated with the maximum likelihood estimator rmle of the MPN method can be minimized at an optimal dilution factor, which falls into the crowded regime.

The MPN procedure can generate biased estimates of the original sample density, and the precision and accuracy of results depend strongly on the number of tubes used (18). The bias on the maximum likelihood estimator results in an over-estimate of 20%–25% with five tubes, which is reduced to a few percent with 50 tubes (see SI). By back of the envelope calculation, an average 10 cm plate (inside diameter 86 mm, surface area 58 cm2) can fit a maximum of approximately 5,000 medium-sized (1 mm outside diameter) “tubes,” whereas a single grid square on a 10 × 10 cm square plate (typically gridded 6 × 6) can fit 200 of these colony-sized spaces. All of these are well above the threshold where the bias in this estimator (39) makes much difference in the value. (Note that this refers to the number of colony-sized spaces available and is independent of the number of colonies observed.) This also means that the standard error of the estimator will, in theory, be minimized at a plating density that is much higher than the threshold for “uncrowded” plates and, in fact, is well into a range of densities where a minority of colonies will be distinct. Fortunately, the standard error is still well behaved over a broad space in fraction of regions occupied (SI), meaning that plate counts into the “uncrowded” range will still produce good estimates with this method. In fact, this produces a result equivalent to that of the Poisson method in the fully uncrowded regime. However, the MPN method is most useful as plating densities encroach into the crowded regime, allowing precise and accurate estimation of CFU density from plates that would provide severely biased estimates using a naive Poisson model.

Utility of the models

Here, we demonstrate the relative utility of each model for estimation of CFU density from simulated data. First, we can use the crowding-explicit binomial sampling model described in the previous section to estimate bias due to crowding, and to demonstrate the importance of choosing an appropriate cutoff M, below which plates are considered to be uncrowded and countable. To do so, we solve the crowded binomial model in equation 13 for dV with respect to the average number of colonies ⟨n⟩ and the number of colony-sized regions on a plate N. Doing so, we find dV=-Nrlog(1-nN). We can substitute this into the Poisson estimator and find

rp=ndV=nNrln(1nN)=rnNln(1nN). (15)

Let us define the ratio of expected colony number to the number of colony-sized regions as f=nN. This ratio represents the amount of crowding, where a value of 1 is the maximum crowding and a value close to zero is in the uncrowded regime. Expressing the previous expression in terms of the crowding, we see

rpr=fln(1f). (16)

This ratio indicates how close the estimated CFU concentration is to the true concentration. A ratio of 1 tells us that we have an unbiased estimator, whereas a ratio of less than 1 tells us we are under-estimating the CFU density. We plot this expression in Fig. 2 to show how the simple Poisson estimator under-estimates the actual concentration as a function of crowding, f. After a crowding value of f = 0.2, the Poisson estimator starts to be significantly biased, undershooting the true value by about 10%. This has implications for the value used in the Poisson model with a cutoff. The cutoff should be chosen such that the bias is not greater than the experimenters targeted precision. For example, if a bias must be less than 10%, then a cutoff of about 20% of the total plate capacity should be used. In the case of a 10 cm plate with an estimated 5,000 1 mm diameter colony-sized regions, this corresponds to a cutoff of M = 1,000, whereas the more typical cutoff of M = 300 provides an essentially unbiased estimate (bias 3%), but this results in a large statistical fluctuation of 5.8%. In the case of a 6 by 6 mm grid on a 10 cm by 10 cm plate, there are roughly 200 grid regions in a plate. Thus, an M = 40 would be appropriate to achieve bias less than 10%, and a threshold of M = 12 colonies is required to reduce bias to 3% for colonies of this size. At these thresholds, the statistical error would be 15.8%.

Fig 2.

Fig 2

Bias and under-estimation of the true bacterial concentration as a function of crowding, illustrated using the Poisson estimator. We illustrate this by plotting the ratio of the estimated concentration (with the error bands denoting ±1 SEM at N = 5,000) to the true concentration. Here, crowding is measured by the ratio of the average number of colonies to the maximum number of colonies that can fit within a plate f=nN. At low crowding values (encompassing the conventionally recommended 25–250 colonies per plate), the naive estimator has low bias, but large uncertainty. At a crowding value of 0.2 (∼1,000 colonies on a 10 cm plate—an ambitious task, and not recommended), the naive Poisson estimator under-estimates the true concentration by about 10%, and many-fold under-estimation is possible as crowding approaches 1.

To compare the performance of the different estimators discussed here, we simulated 1,000 experiments and applied each of our estimators to the resulting data. Data for each experiment was modeled using the binomial crowding model with r = 105, V = 0.2, N = 5,000, and dilution values (0.1, 0.1, 0.01, 0.01, 0.001, 0.001). This corresponds to two replicates for each dilution in a 10-fold dilution experiment. An example set of colony counts corresponding to these dilutions is (1,705, 1,629, 196, 181, 21, 21). The first two dilutions are in the over-crowded regime and the last two dilutions are in the dilute uncrowded regime. The traditional methods (“pick-the-best,” averaging, segment averaging) and Poisson with a cutoff will discard the first two counts as too many to count, while the other methods will use their numeric values. The resulting distributions are plotted in Fig. 3.

Fig 3.

Fig 3

The probability distributions of estimated CFU concentrations from different estimators generated from 1,000 independent numerical experiments with dilutions 0.1, 0.1, 0.01, 0.01, 0.001, 0.001, r = 100,000, V = 0.2, N = 5,000. Here, the segmented-plate average (one-quarter of the plate is counted), naive Poisson, “pick the best,” traditional average, Poisson with cutoff, and MPN methods are compared. The MPN method demonstrates the best combination of high precision and accuracy.

The results show that the MPN method is unbiased and has the highest degree of accuracy. The Poisson with a cutoff (which always discards counts from the least-diluted samples in these outputs) is nearly unbiased, whereas the naive Poisson (which unrealistically uses all colony counts and included here only for comparison) is biased down due to inclusion of “crowded” data. This illustrates the relative costs of discarding data and using these data inappropriately. The naive Poisson has a similar variance as that of the MPN because both are using all the data points. However, the measure around which the naive Poisson estimator varies is incorrect due to this bias. With the Poisson estimator, increasing accuracy comes at a cost in precision; the Poisson with cutoff has roughly twice the standard error of the MPN method due to the fact that it does not use all the data and throws out the first two counts of each experiment. Next, the traditional averaging method (33) has roughly five times the standard error of the MPN method, due to the fact that it gives lower-precision measurements the same weight as higher-precision large counts in the uncrowded regime. However, it is unbiased. If there are technical replicates, pick-the-best (choosing the largest number of counts in the countable range, over multiple technical replicates at each dilution) is a biased estimator (over-estimating CFUs) and has a standard error roughly 10 times that of the MPN method. Pick-the-best where the best count from each technical replicate is used is equivalent to Poisson with a cutoff, with some loss of precision due to discarding of small counts. Segment averaging (here, counting one-quarter of the plate, and assuming perfect segmenting such that exactly one-quarter of the colonies are counted) resulted in an unbiased estimator with the largest standard error, roughly 13 times the standard error of the MPN method.

These simulations show that the MPN method produces the most precise results and is unbiased. However, the Poisson with a cutoff is a close second, also with high accuracy and precision and with the advantage of being practical to calculate by hand. The bias of the naive Poisson (using all data) serves as a warning: if counts are not in the uncrowded regime, the Poisson assumptions do not apply, and an estimator using only number of colonies counted at each dilution will under-estimate the CFU density in the original sample. Other standard estimators (averaging, segment averaging) using the same data required for the Poisson estimator show universally poorer precision than Poisson with a cutoff and cannot be recommended.

DISCUSSION

We have discussed commonly used point estimator methods for estimating CFUs in a single sample from dilution series data, and we have presented a new method based on the MPN framework. The methods overview given here, while far from comprehensive (19, 4044), is intended as a practical introduction to sampling errors in count-based measurements, particularly for researchers outside environmental and food microbiological surveillance who are likely to have had little if any exposure to existing literature.

We have focused here on technical rationales for choosing a point estimator for CFU density, but it is important to emphasize that the research question and the data taken must be suitable. We assume here that the researcher wishes to obtain a point estimate of bacterial density in each sample, with high accuracy and precision so that comparisons between samples can be made. The Poisson-based estimator used throughout this paper assumes well-mixed, homogeneous samples with high culturability, which may not be true even for shaken broth cultures. This assumption should be verified for the sample type to be used. Departures from the Poisson will appear as deviations from expected dispersal of counts (18, 23), as indicated in the text. If measurements across technical replicates are not Poisson, this may be correctable. For example, if bacteria are known or suspected to clump, proper shaking (6) and/or use of a surfactant like Tween 80 (45, 46) can help to disperse aggregates. As always, any such protocols should be validated under the conditions where they are to be used.

In practice, the choice of point estimator will depend on the precision required for the estimate of CFU density, as well as the tradeoffs between experimental repetition and analytical complexity that a researcher is willing to make. We have provided a summary of the strengths and weaknesses of all methods in Table 1. Traditional pick-the-best estimators are fine for quick imprecise measurements; however, this method has the largest standard error as it does not use a majority of the data collected. Other methods can use more of data collected to provide more precise point estimates of the concentration with smaller standard error. For experiments with reasonably large expected effect size, the simplest mathematically admissible method—the Poisson estimator with a cutoff—is perfectly valid, as long as the dilutions are chosen appropriately to ensure all measurements are in the countable range. Broadly speaking, addition of unbiased data will improve the precision of an estimator. Historically, technical replicates have been used for this purpose—even technical duplication is sufficient to markedly reduce variance of the estimated CFU density, although triplicate plating is preferred to safeguard against accidents and outliers (28) (also see SI). The Poisson model allows data from technical replicates to be combined into a single mathematically interpretable point estimator with definable properties—specifically, a maximum likelihood estimator, which should be an unbiased and minimally variable estimator for the true value. This is as opposed to averaging (33), which produces an estimate whose properties are not well defined. The Poisson method also allows the investigator to incorporate data from dilutions with too few counts, in addition to (not in place of) data from countable wells in the same dilution series—by effectively re-weighting the contribution of these wells by the total volume of original suspension that they contain, these data can be used to improve the accuracy of the estimator even though their sampling variance is high. We have presented several methods for estimating CFUs and we have provided a calculator for these estimators available on Hugging Face spaces, named CFUestimator (30).

TABLE 1.

Table summarizing the strengths and weaknesses of each estimator along with their appropriate regions of validity

Estimator name Strengths and weaknesses
Pick the best This traditional estimator is simple to understand and calculate but has large standard error.
Naive Poisson The naive Poisson is valid at low crowding, but if used with high crowding data can produce a strong bias. It can combine data across multiple different dilutions.
Poisson with cutoff The Poisson with a cutoff is easy to calculate by hand. It can combine data from different dilutions. The resulting bias of the estimator can be controlled by setting the cutoff and is sufficiently small when data are uncrowded.
MPN The MPN method is valid across all crowding levels and uses all available data. It can combine multiple experiments. The same method can be used for colony counts on plates and in tubes, viewing patches of a plate as equivalent to individual tubes. The method requires either a computer program or table to calculate. It produces an unbiased estimator and has the smallest standard error of all discussed methods.

The correspondence shown here between using tubes and gridding a plate into subsections based on colony area allows the usage of estimator techniques typically used for quantal-based measurements of CFU density, specifically the MPN, where positive growth events (e.g., colonies) are explicitly considered to represent one or more originating cells. These techniques have a long history in environmental surveillance microbiology, and statistically well-founded techniques are readily available for analysis of such data (2, 47, 48). If an experimentalist wants tighter bounds for an estimated CFU count, the MPN provides a very low-variance, unbiased estimator at the cost of some extra steps. This estimator allows the experimentalist to incorporate data from normally uncountable (TMTC) plates as well as counts from uncrowded plates, maximizing the amount of information that can be gleaned from a dilution series.

The MPN model requires an estimate of the maximum number of colonies that can be packed into the growth area for each sample; we show (SI) that it is better to over-estimate this maximum than to under-estimate it. If the patch size on a plate is correctly chosen to be around the size of a typical colony, even a spot-plating assay on a 10 by 10 cm plate is equivalent to running hundreds of tubes in parallel. Furthermore, it is necessary to estimate the number of occupied regions in the growth area. In or near the uncrowded regime, this will be equivalent to the number of counts. However, this method does not require that all colonies are individually countable—instead, image analysis (4951) can be used to estimate both the size of an individual colony and the fraction of total area occupied by colony growth. The MPN estimator can therefore potentially provide accurate, precise estimates of CFU density for plates where exact counts cannot be obtained. However, colony size varies across different microorganisms as well as across culture conditions (media type, agar percentage, pad thickness, plate drying time and conditions, growth temperature and atmosphere, etc.) and incubation time on plates, meaning that the size range of colonies may be different even across plates within a single experiment (52, 53). This added complication of properly choosing a grid size or determining the typical size of a colony means that application of the MPN will most likely require parameters estimated for the specific experiment being analyzed. Furthermore, the fact that colony size can decrease under crowding means that heavily crowded plates or plate regions, where few or no distinct colonies are visible, may have very different “average” colony sizes than the same microbes in a less-crowded area. While theory suggests that the MPN estimator will be most precise when the majority of colony-sized locations are occupied [(54), also see SI], this practical limitation suggests that use of the MPN on plate count data will become less accurate with extremes of crowding, and that the best use of the MPN is likely to be in the liminal region between the technically uncrowded and the physically uncountable, where most to all growth is in the form of distinct, countable colonies but crowding produces a measurable bias in these counts.

MATERIALS AND METHODS

Simulations were coded in Python. We provided a custom calculator for all estimators analyzed in this work on Hugging Face spaces, named CFUestimator (30).

ACKNOWLEDGMENTS

This work was funded, in part, by NSF grant no. 2014173. I.N. and K.M.M. were additionally supported by the Simons Foundation Investigator program grant to I.N.

Contributor Information

K. Michael Martini, Email: karl.michael.martini@emory.edu.

Nic M. Vega, Email: nvega@emory.edu.

Stephan Schmitz-Esser, Iowa State University, Ames, Iowa, USA.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/spectrum.03946-23.

Supplemental material. spectrum.03946-23-s0001.pdf.

Maximum likelihood estimators for CFU concentration.

DOI: 10.1128/spectrum.03946-23.SuF1

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. McCrady MH. 1915. The numerical interpretation of fermentation-tube results. J Infect Dis 17:183–212. doi: 10.1093/infdis/17.1.183 [DOI] [Google Scholar]
  • 2. Cochran WG. 1950. Estimation of bacterial densities by means of the "most probable number" Biometrics 6:105–116. [PubMed] [Google Scholar]
  • 3. Breed RS, Dotterrer WD. 1916. The number of colonies allowable on satisfactory agar plates. J Bacteriol 1:321–331. doi: 10.1128/jb.1.3.321-331.1916 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Fisher R, Thornton H, Mackenzie W. 1922. The accuracy of the plating method of estimating the density of bacterial populations: with particular reference to the use of thornton’s agar medium with soil samples. Ann Appl Biol 9:325. doi: 10.1111/j.1744-7348.1922.tb05962.x [DOI] [Google Scholar]
  • 5. Halvorson HO, Ziegler NR. 1933. Application of statistics to problems in bacteriology: I. A means of determining bacterial population by the dilution method. J Bacteriol 25:101–121. doi: 10.1128/jb.25.2.101-121.1933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Jennison MW, Wadsworth GP. 1940. Evaluation of the errors involved in estimating bacterial numbers by the plating method. J Bacteriol 39:389–397. doi: 10.1128/jb.39.4.389-397.1940 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Jones PCT, Mollison JE, Quenouille MH. 1948. A technique for the quantitative estimation of soil micro-organisms: with a statistical note by. Microbiology 2:54–69. doi: 10.1099/00221287-2-1-54 [DOI] [Google Scholar]
  • 8. Skinner FA, Jones PC, Mollison JE. 1952. A comparison of a direct- and a plate counting technique for the quantitative estimation of soil micro-organisms. J Gen Microbiol 6:261–271. doi: 10.1099/00221287-6-3-4-261 [DOI] [PubMed] [Google Scholar]
  • 9. Johnson EA, Brown BW. 1961. The Spearman estimator for serial dilution assays. Biometrics 17:79. doi: 10.2307/2527497 [DOI] [Google Scholar]
  • 10. Taylor J. 1962. The estimation of numbers of bacteria by tenfold dilution series. J Appl Bacteriol 25:54–61. doi: 10.1111/j.1365-2672.1962.tb01119.x [DOI] [Google Scholar]
  • 11. Harris RF, Sommers LE. 1968. Plate-dilution frequency technique for assay of microbial ecology. Appl Microbiol 16:330–334. doi: 10.1128/am.16.2.330-334.1968 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Thomas HA, Woodward RL. 1955. Estimation of coliform density by the membrane filter and the fermentation tube methods. Am J Public Health Nations Health 45:1431–1437. doi: 10.2105/ajph.45.11.1431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Pipes WO, Ward P, Ahn SH. 1977. Frequency distributions for coliform bacteria in water. J AWWA 69:664–668. doi: 10.1002/j.1551-8833.1977.tb06847.x [DOI] [Google Scholar]
  • 14. Horvitz DG, Thompson DJ. 1952. A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47:663–685. doi: 10.1080/01621459.1952.10483446 [DOI] [Google Scholar]
  • 15. Hurley MA, Roscoe ME. 1983. Automated statistical analysis of microbial enumeration by dilution series. J Appl Bacteriol 55:159–164. doi: 10.1111/j.1365-2672.1983.tb02660.x [DOI] [Google Scholar]
  • 16. Kirchman D, Sigda J, Kapuscinski R, Mitchell R. 1982. Statistical analysis of the direct count method for enumerating bacteria. Appl Environ Microbiol 44:376–382. doi: 10.1128/aem.44.2.376-382.1982 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Ben-David A, Davidson CE. 2014. Estimation method for serial dilution experiments. J Microbiol Methods 107:214–221. doi: 10.1016/j.mimet.2014.08.023 [DOI] [PubMed] [Google Scholar]
  • 18. Haas N, Rose JB, Gerba CP. 2014. Exposure assessment, p 159–234. In Quantitative microbial risk assessment. John Wiley & Sons, Ltd. [Google Scholar]
  • 19. Christen JA, Parker AE. 2020. Systematic statistical analysis of microbial data from dilution series. J Agric Biol Environ Stat 25:339–364. doi: 10.1007/s13253-020-00397-0 [DOI] [Google Scholar]
  • 20. Haas CN, Heller B. 1988. Averaging of TNTC counts. Appl Environ Microbiol 54:2069–2072. doi: 10.1128/aem.54.8.2069-2072.1988 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Hedges AJ. 2002. Estimating the precision of serial dilutions and viable bacterial counts. Int J Food Microbiol 76:207–214. doi: 10.1016/s0168-1605(02)00022-3 [DOI] [PubMed] [Google Scholar]
  • 22. Corry JEL, Jarvis B, Passmore S, Hedges A. 2007. A critical review of measurement uncertainty in the enumeration of food micro-organisms. Food Microbiol 24:230–253. doi: 10.1016/j.fm.2006.05.003 [DOI] [PubMed] [Google Scholar]
  • 23. El-Shaarawi AH, Esterby SR, Dutka BJ. 1981. Bacterial density in water determined by Poisson or negative binomial distributions. Appl Environ Microbiol 41:107–116. doi: 10.1128/aem.41.1.107-116.1981 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Havelaar AH, Hogeboom WM, Sekhuis VM, van Erne EH. 1987. Proficiency testing of water microbiology laboratories in The Netherlands. J Appl Bacteriol 62:555–564. doi: 10.1111/j.1365-2672.1987.tb02689.x [DOI] [PubMed] [Google Scholar]
  • 25. Luria SE, Delbrück M. 1943. Mutations of bacteria from virus sensitivity to virus resistance. Genetics 28:491–511. doi: 10.1093/genetics/28.6.491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Haas CN, Heller B. 1988. Test of the validity of the Poisson assumption for analysis of most-probable-number results. Appl Environ Microbiol 54:2996–3002. doi: 10.1128/aem.54.12.2996-3002.1988 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Jarvis B, Hedges AJ, Corry JEL. 2012. The contribution of sampling uncertainty to total measurement uncertainty in the enumeration of microorganisms in foods. Food Microbiol 30:362–371. doi: 10.1016/j.fm.2012.01.002 [DOI] [PubMed] [Google Scholar]
  • 28. Weenk GH. 1992. Microbiological assessment of culture media: comparison and statistical evaluation of methods. Int J Food Microbiol 17:159–181. doi: 10.1016/0168-1605(92)90113-h [DOI] [PubMed] [Google Scholar]
  • 29. Sutton S. 2011. Accuracy of plate counts. J Validation Technol 17:42–46. [Google Scholar]
  • 30. Martini KM. 2023. Calculate colony forming units (CFUs). Available from: https://huggingface.co/spaces/KMichaelMartini/CFUestimator
  • 31. Blodgett R. 2020. BAM appendix 2: most probable number from serial dilutions. FDA. Available from: https://www.fda.gov/food/laboratory-methods- food/bam-appendix-2-most-probable-number-serial-dilutions [Google Scholar]
  • 32. Eisenhart C, Wilson PW. 1943. Statistical methods and control in bacteriology. Bacteriol Rev 7:57–137. doi: 10.1128/br.7.2.57-137.1943 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. USDA . 2015. Quantitative analysis of bacteria in foods as sanitary indicators. USDA. Available from: https://www.fsis.usda.gov/sites/default/files/media_ file/2021-03/MLG-3.pdf [Google Scholar]
  • 34. Canales RA, Wilson AM, Pearce-Walker JI, Verhougstraete MP, Reynolds KA. 2018. Methods for handling left-censored data in quantitative microbial risk assessment. Appl Environ Microbiol 84:e01203-18. doi: 10.1128/AEM.01203-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Gijbels I. 2010. Censored data. WIREs Comput Stats 2:178–188. doi: 10.1002/wics.80 [DOI] [Google Scholar]
  • 36. EPA C. 2014. Evaluation of options for interpreting environmental microbiology field data results having low spore counts. Environmen- tal Protection Agency [Google Scholar]
  • 37. Alexander M. 1983. Most probable number method for microbial populations. Methods Soil Anal Part 2 Chem Microbiol Prop 9:815–820. doi: 10.2134/agronmonogr9.2 [DOI] [Google Scholar]
  • 38. Clarke KR, Owens NJP. 1983. A simple and versatile microcomputer program for the determination of ‘most probable number’. J Microbiol Methods 1:133–137. doi: 10.1016/0167-7012(83)90031-3 [DOI] [Google Scholar]
  • 39. Salama IA, Koch GG, Tolley DH. 1978. On the estimation of the most probable number in a serial dilution experiment. Commun Stat - Theory Methods 7:1267–1281. doi: 10.1080/03610927808827710 [DOI] [Google Scholar]
  • 40. Tremaine SC, Mills AL. 1987. Tests of the critical assumptions of the dilution method for estimating bacterivory by microeucaryotes. Appl Environ Microbiol 53:2914–2921. doi: 10.1128/aem.53.12.2914-2921.1987 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Gronewold AD, Wolpert RL. 2008. Modeling the relationship between most probable number (MPN) and colony-forming unit (CFU) estimates of fecal coliform concentration. Water Res 42:3327–3334. doi: 10.1016/j.watres.2008.04.011 [DOI] [PubMed] [Google Scholar]
  • 42. Jarvis B, Hedges AJ, Corry JEL. 2007. Assessment of measurement uncertainty for quantitative methods of analysis: comparative assessment of the precision (uncertainty) of bacterial colony counts. Int J Food Microbiol 116:44–51. doi: 10.1016/j.ijfoodmicro.2006.12.037 [DOI] [PubMed] [Google Scholar]
  • 43. Jarvis B, Corry JEL, Hedges AJ. 2007. Estimates of measurement uncertainty from proficiency testing schemes, internal laboratory quality monitoring and during routine enforcement examination of foods. J Appl Microbiol 103:462–467. doi: 10.1111/j.1365-2672.2006.03258.x [DOI] [PubMed] [Google Scholar]
  • 44. Jarvis B. 2016. Statistical aspects of the microbiological examination of foods. Academic Press. [Google Scholar]
  • 45. Fenner F. 1951. The enumeration of viable tubercle bacilli by surface plate counts. Am Rev Tuberc 64:353–380. doi: 10.1164/art.1951.64.4.353 [DOI] [PubMed] [Google Scholar]
  • 46. Hassan AN, Frank JF. 2003. Influence of surfactant hydrophobicity on the detachment of Escherichia coli O157:H7 from lettuce. Int J Food Microbiol 87:145–152. doi: 10.1016/s0168-1605(03)00062-x [DOI] [PubMed] [Google Scholar]
  • 47. Garthright WE. 1993. Bias in the logarithm of microbial density estimates from serial dilutions. Biometrical J 35:299–314. doi: 10.1002/bimj.4710350306 [DOI] [Google Scholar]
  • 48. Loyer MW, Hamilton MA. 1984. Interval estimation of the density of organisms using a serial-dilution experiment. Biometrics 40:907–916. doi: 10.2307/2531142 [DOI] [PubMed] [Google Scholar]
  • 49. Chiang PJ, Tseng MJ, He ZS, Li CH. 2015. Automated counting of bacterial colonies by image analysis. J Microbiol Methods 108:74–82. doi: 10.1016/j.mimet.2014.11.009 [DOI] [PubMed] [Google Scholar]
  • 50. Brugger SD, Baumberger C, Jost M, Jenni W, Brugger U, Mühlemann K. 2012. Automated counting of bacterial colony forming units on agar plates. PLoS One 7:e33695. doi: 10.1371/journal.pone.0033695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Bewes JM, Suchowerska N, McKenzie DR. 2008. Automated cell colony counting and analysis using the circular Hough image transform algorithm (CHiTA). Phys Med Biol 53:5991–6008. doi: 10.1088/0031-9155/53/21/007 [DOI] [PubMed] [Google Scholar]
  • 52. Savage GM, Halvorson HO. 1941. The effect of culture environment on results obtained with the dilution method of determining bacterial population. J Bacteriol 41:355–362. doi: 10.1128/jb.41.3.355-362.1941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Chacón JM, Möbius W, Harcombe WR. 2018. The spatial and metabolic basis of colony size variation. ISME J 12:669–680. doi: 10.1038/s41396-017-0038-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Strijbosch LW, Buurman WA, Does RJ, Zinken PH, Groenewegen G. 1987. Limiting dilution assays: experimental design and statistical analysis. J Immunol Methods 97:133–140. doi: 10.1016/0022-1759(87)90115-3 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental material. spectrum.03946-23-s0001.pdf.

Maximum likelihood estimators for CFU concentration.

DOI: 10.1128/spectrum.03946-23.SuF1

Articles from Microbiology Spectrum are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES