Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1997 May 27;94(11):5498–5500.

The Evaluation of Forensic DNA Evidence*

PMCID: PMC33668  PMID: 9190606

Abstract

The following is an excerpt from the Executive Summary of the National Research Council Report.

INTRODUCTION

Nearly a decade has passed since DNA typing methods were first used in criminal investigations and trials. Law enforcement agencies have committed substantial resources to the technology; prosecutors, defense counsel, and judges have struggled with the terminology and ideas of molecular biology, genetics, and statistics. In 1992, a broad-ranging report released by the National Research Council attempted to explain the basics of the relevant science and technology, to offer suggestions for improving forensic DNA testing and its use in law enforcement, and to quiet the controversy that had followed the introduction of DNA profiling in court. Yet, the report did not eliminate all controversy. Indeed, in propounding what the committee regarded as a moderate position—the ceiling principle and the interim ceiling principle—the report itself became the target of criticism from scientists and lawyers on both sides of the debate on DNA evidence in the courts. Moreover, some of the statements in the 1992 report have been misinterpreted or misapplied in the courts.

This committee was formed to update and clarify discussion of the principles of population genetics and statistics as they apply to DNA evidence. Thus, this second report is much narrower than the 1992 report. Issues such as confidentiality and security, storage of samples for future use, the desirability and legality of data banks on convicted felons, and the international exchange of information are not in our charge. Rather, this report deals mainly with the computation of probabilities used to evaluate the implications of DNA test results that incriminate suspects. It focuses on situations where the DNA profile of a suspect (or sometimes a victim) apparently matches that of the evidence DNA. (We use the phrase “evidence DNA” to refer to the sample of biological material, such as blood or semen, usually taken from the crime scene or from the victim). The central question that the report addresses is this: What information can a forensic scientist, population geneticist, or statistician provide to assist a judge or jury in drawing inferences from the finding of a match?

To answer this question, the committee reviewed the scientific literature and the legal cases and commentary on DNA profiling, and it investigated the various criticisms that have been voiced about population data, statistics, and laboratory error. Much has been learned since the last report. The technology for DNA profiling and the methods for estimating frequencies and related statistics have progressed to the point where the reliability and validity of properly collected and analyzed DNA data should not be in doubt. The new recommendations presented here should pave the way to more effective use of DNA evidence.

This report describes both the science behind DNA profiling and the data on the frequency of profiles in the human populations, and it recommends procedures for providing various statistics that may be useful in the courtroom. The procedures are based on population genetics and statistics, and they render the ceiling principle and the interim ceiling principle unnecessary.

This Executive Summary outlines the structure and contents of the full report, and it gives the recommendations. This summary does not constitute a complete exposition, and it is no substitute for a careful reading of the chapters that follow. As the report will reveal, the committee agrees with many recommendations of the 1992 report but disagrees with others. Since the committee has not attempted to review all the statements and recommendations in the 1992 report, the lack of discussion of any statement should not be interpreted as either endorsing or rejecting that statement.

CONTENTS OF THE REPORT

Overview.

The report begins with an extended summary of the chapters that make up the full report. This overview describes the essentials of the subject with the minimum of jargon, statistics, and technical details, and it includes a numerical example that illustrates how the procedures that are discussed and recommended would apply in a typical case. The main report offers fuller explanations, details, and justifications.

Chapter 1.

The first chapter describes the 1992 report, the changes since that report, the uses and validity of DNA typing, differences between DNA typing in criminal cases and in civil paternity litigation, reasons for seemingly contradictory probability estimates that different experts sometimes present in court, and the committee’s approach to the issue of “population structure.”

Chapter 2.

The second chapter describes the genetic and molecular basis of DNA typing. It introduces the fundamental concepts of genetics, and it surveys the genetic systems and the technologies used in DNA profiling.

Chapter 3.

The third chapter concerns laboratory performance. Although our focus is on the statistics that can be used to characterize the significance or implications of a match between two DNA samples, these statistics do not float in a vacuum. They relate to specific claims or hypotheses about the origin of the DNA samples. If DNA from an evidence sample and DNA from a suspect share a profile that has a low frequency in the population, this suggests that the samples came from the same person; the lower the frequency, the stronger the evidence. But the possibility remains that the match is only apparent—that an error has occurred and the profiles differ from what the laboratory has reported. This chapter describes ways that errors can arise and how their occurrence might be minimized. It contains recommendations regarding quality control and assurance, laboratory accreditation, proficiency tests, and confirmatory testing.

Chapter 4.

Much of the controversy about the forensic use of DNA has involved population genetics. The fourth chapter explains the generally applicable principles, then considers the implications of the fact that the population of the United States includes different groups and subgroups with different mixes of genes. The chapter develops and illustrates procedures for taking this fact into account in computing random-match probabilities for an incriminating DNA profile in a population or a subgroup of a population.

Chapter 5.

The fifth chapter considers how the estimated frequency of an incriminating DNA profile relates to conclusions about the source of the DNA in the evidence sample. It discusses how the frequencies are interpreted as probabilities and related quantities, the degree of uncertainty in such estimates, and the type of calculations that might indicate that a profile is unique. It concludes that the abundance of data in different ethnic groups within the major races and the methods outlined in Chapters 4 and 5 imply that the 1992 report’s suggested ceiling principle and interim ceiling principle are unnecessary. In addition, it makes recommendations to help assure the accuracy of estimates for what are known as VNTR profiles and to handle the special situation in which the suspect was identified as a result of a search through a database of DNA profiles of known offenders.

Chapter 6.

The sixth and final chapter discusses the legal implications of the conclusions and recommendations. It describes the most important legal rules that affect the use of DNA evidence, identifies the questions of scientific fact that have been disputed in court, reviews case law on the admissibility of DNA evidence, and explains how the conclusions and recommendations might be used in applying and developing the law. The report makes no recommendations on matters of legal policy, but it does suggest that the formulation of such policy might be assisted by behavioral research into the various ways that DNA test results can be presented in the courtroom.

Appendices.

A glossary of scientific terms is provided at the end of this report.

RECOMMENDATIONS

Major conclusions and recommendations are given at the end of the chapter in which the subject is discussed. For convenience, the report also lists them as a group at the end of the overview. This Executive Summary lists the recommendations only.

Recommendations to Improve Laboratory Performance

Recommendation 3.1.

Laboratories should adhere to high quality standards (such as those defined by the Technical Working Group on DNA Analysis Methods and the DNA Advisory Board) and make every effort to be accredited for DNA work (by such organizations as the American Society of Crime Laboratory Directors—Laboratory Accreditation Board).

Recommendation 3.2.

Laboratories should participate regularly in proficiency tests, and the results should be available for court proceedings.

Recommendation 3.3.

Whenever feasible, forensic samples should be divided into two or more parts at the earliest practicable stage and the unused parts retained to permit additional tests. The used and saved portions should be stored and handled separately. Any additional tests should be performed independently of the first by personnel not involved in the first test and preferably in a different laboratory.

Recommendations for Estimating Random-Match Probabilities

Recommendation 4.1.

In general, the calculation of a profile frequency should be made with the product rule. If the race of the person who left the evidence-sample DNA is known, the database for the person’s race should be used; if the race is not known, calculations for all the racial groups to which possible suspects belong should be made. For systems such as VNTRs, in which a heterozygous locus can be mistaken for a homozygous one, if an upper bound on the frequency of the genotype at an apparently homozygous locus (single band) is desired, then twice the allele (bin) frequency, 2p, should be used instead of p2. For systems in which exact genotypes can be determined, p2 + p(1 − p) Inline graphic should be used for the frequency at such a locus instead of p2. A conservative value of Inline graphic for the U.S. population is 0.01; for some small, isolated populations, a value of 0.03 may be more appropriate. For both systems, 2pipj should be used for heterozygotes.

Recommendation 4.2.

If the particular subpopulation from which the evidence sample came is known, the allele frequencies for the specific subgroup should be used as described in Recommendation 4.1. If allele frequencies for the subgroup are not available, although data for the full population are, then the calculations should use the population-structure equations 4.10 for each locus, and the resulting values should then be multiplied.

Recommendation 4.3.

If the person who contributed the evidence sample is from a group or tribe for which no adequate database exists, data from several other groups or tribes thought to be closely related to it should be used. The profile frequency should be calculated as described in Recommendation 4.1 for each group or tribe.

Recommendation 4.4.

If the possible contributors of the evidence sample include relatives of the suspect, DNA profiles of those relatives should be obtained. If these profiles cannot be obtained, the probability of finding the evidence profile in those relatives should be calculated with formula 4.8 or 4.9.

Recommendations on Interpreting the Results of Database Searches, on Binning, and on Establishing the Uniqueness of Profiles

Recommendation 5.1.

When the suspect is found by a search of DNA databases, the random-match probability should be multiplied by n, the number of persons in the database.

Recommendation 5.2.

If floating bins are used to calculate the random-match probabilities, each bin should coincide with the corresponding match window. If fixed bins are employed, then the fixed bin that has the largest frequency among those overlapped by the match window should be used.

Recommendation 5.3.

Research into the identification and validation of more and better marker systems for forensic analysis should continue with a view to making each profile unique.

Recommendation on Research on Juror Comprehension

Recommendation 6.1.

Behavioral research should be carried out to identify any conditions that might cause a trier of fact to misinterpret evidence on DNA profiling and to assess how well various ways of presenting expert testimony on DNA can reduce any such misunderstandings.

GLOSSARY

Allele one of two or more alternative forms of a gene

Chromosome the structure by which hereditary information is physically transmitted from one generation to the next; the organelle that carries the genes

Deoxyribonucleic acid (DNA) the genetic material of organisms, usually double-stranded—composed of two complementary chains of nucleotides in the form of a double helix; a class of nucleic acids characterized by the presence of the sugar deoxyribose and the four bases adenine, cytosine, guanine, and thymine

Diploid having two sets of chromosomes, in pairs

DNA deoxyribonucleic acid

DNA databank (database) a collection of DNA typing profiles of selected or randomly chosen individuals

DNA probe a short segment of single-stranded DNA labeled with a radioactive or chemical tag that is used to detect the presence of a particular DNA sequence through hybridization to its complementary sequence

Gene the basic unit of heredity; a sequence of DNA nucleotides on a chromosome

Genotype the genetic makeup of an organism, as distinguished from its physical appearance or phenotype

Heterozygote a diploid organism that carries different alleles at one or more genetic loci on its homologous chromosomes

Heterozygous having different alleles at a particular locus; for most forensic DNA probes, the autoradiogram displays two bands if the person is heterozygous at the locus

Homozygote a diploid organism that carries identical alleles at one or more genetic loci on its homologous chromosomes

Homozygous having the same allele at a particular locus; for most forensic DNA probes, the autoradiogram displays a single band if the person is homozygous at the locus

Locus (plural loci) the specific physical location of a gene on a chromosome

Marker a gene with a known location on a chromosome and a clear-cut phenotype that is used as a point of reference in the mapping of other loci

PCR polymerase chain reaction

Polymerase chain reaction (PCR) an in vitro process that yields millions of copies of desired DNA through repeated cycling of a reaction that involves the enzyme DNA polymerase

Population a group of individuals occupying a given area at a given time

Proficiency tests tests to evaluate the competence of technicians and the quality performance of a laboratory; in open tests, the technicians are aware that they are being tested, but in blind tests, they are not aware; internal proficiency tests are conducted by the laboratory itself, and external tests are conducted by an agency independent of the laboratory being tested

Variable number of tandem repeats (VNTR) repeating units of a DNA sequence for which the number varies between individuals

VNTR variable number of tandem repeats

NATIONAL RESEARCH COUNCIL

Virtual Commission

Commission on DNA Forensic Science: An Update

Thomas D. Pollard, Chair, The Johns Hopkins University,  Baltimore, Maryland

Peter J. Bickel, University of California, Berkeley

Michael T. Clegg, University of California, Riverside

Burton Singer, Princeton University, Princeton, New Jersey

NRC Staff

Paul Gilman, Executive Director

Committee on DNA Forensic Science: An Update

James F. Crow, Ph.D.,§ (Chair)

Professor Emeritus of Genetics

University of Wisconsin

Madison, WI

Margaret A. Berger, J.D.

Professor of Law

Brooklyn Law School

Brooklyn, NY

Shari S. Diamond, J.D., Ph.D.

Professor of Psychology and Criminal Justice

University of Illinois

Chicago, IL

David H. Kaye, J.D.

Regents’ Professor of Law

College of Law

Arizona State University

Tempe, AZ

Haig H. Kazazian, Jr., M.D.§

Chairman of the Department of Genetics and

Seymour Gray Professor of Molecular Medicine  in Genetics

University of Pennsylvania School of Medicine

Philadelphia, PA

Arno G. Motulsky, M.D.,§

Professor of Medicine and Genetics

University of Washington

Seattle, WA

Thomas A. Nagylaki, Ph.D.

Professor of Ecology and Evolution and of Genetics

University of Chicago

Chicago, IL

Masatoshi Nei, Ph.D.

Evan Pugh Professor of Biology and

Director Institute of Molecular Evolutionary Genetics

Pennsylvania State University

University Park, PA

George F. Sensabaugh, Jr., D.Crim.

Professor of Forensic Science and Biomedical Sciences

School of Public Health

University of California

Berkeley, CA

David O. Siegmund, Ph.D.

Professor of Statistics and

Associate Dean for the Natural Sciences

Stanford University

Stanford, CA

Stephen M. Stigler, Ph.D.

Ernest DeWitt Burton Distinguished Service Professor

of Statistics

University of Chicago

Chicago, IL

National Research Council Staff

Advisor

Victor A. McKusick, The Johns Hopkins Hospital,  Baltimore, Maryland

NRC Staff

Eric A. Fischer, Study Director

Lee R. Paulson, Senior Staff Officer

Miron L. Straf, Senior Staff Officer

John R. Tucker, Senior Staff Officer

Paulette A. Adams, Senior Project Assistant

Norman Grossblatt, Editor

Footnotes

*

This article is an excerpt from the Executive Summary of the report “The Evaluation of Forensic DNA Evidence”, Committee on DNA Forensic Science: An Update. National Research Council, ISBN 0-309-05395-1, 1996. U.S. $37.95. To order, contact National Academy Press at 1-800-624-6242 or 202-334-3313; 2101 Constitution Avenue, NW, Washington, DC 20418; or http://www.nap.edu.

Abbreviations and technical terms are defined in the glossary. The underlying concepts are explained in the overview and in appropriate chapters in the body of the report.

The 2p rule involves replacing the quantity p2 for a single-banded VNTR locus with the much larger quantity 2p in the product rule. This substitution accounts for cases in which one VNTR band from a heterozygote is not detected, and the person is mistakenly classified as a homozygote. The substitution also ensures that the estimate of the profile frequency will be larger than an estimate from a more precise formula that accounts for the population structure explicitly. The technology for PCR-based systems however, does not have these problems, and the 2p rule is inappropriate for these systems. Therefore, Recommendation 4.1 calls for using p2 + p(1 − p) Inline graphic (rather than 2p) in place of p2 for such systems.

Member, National Academy of Sciences.

§

Member, Institute of Medicine.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES