Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2017 Aug 8;7:7526. doi: 10.1038/s41598-017-06106-1

Statistical analysis of mutant allele frequency level of circulating cell-free DNA and blood cells in healthy individuals

Ligang Xia 1,#, Zhoufang Li 2,#, Bo Zhou 2,#, Geng Tian 3,#, Lidong Zeng 4, Hongyu Dai 5, Xiaohua Li 5, Chaoyu Liu 5, Shixin Lu 2, Feiyue Xu 5, Xiaonian Tu 5, Fang Deng 6, Yuancai Xie 7, Weiren Huang 8, Jiankui He 2,4,5,
PMCID: PMC5548860  PMID: 28790338

Abstract

Cell-free DNA (cfDNA) in plasma has emerged as a potential important biomarker in clinical diagnostics, particularly in cancer. However, somatic mutations are also commonly found in healthy individuals, which interfere with the effectiveness for cancer diagnostics. This study examined the background somatic mutations in white blood cells (WBC) and cfDNA in healthy controls based on sequencing data from 821 non-cancer individuals and several cancer samples with the aim of understanding the patterns of mutations detected in cfDNA. We determined the mutation allele frequencies in both WBC and cfDNA using a panel of 50 cancer-associated genes that covers 20 K-nucleotide region and ultra-deep sequencing with average depth >40000-fold. Our results showed that most of the mutations in cfDNA were highly correlated to WBC. We also observed that the NPM1 gene was the most frequently mutated gene in both WBC and cfDNA. Our study highlighted the importance of sequencing both cfDNA and WBC to improve the sensitivity and accuracy for calling cancer-related mutations from circulating tumour DNA, and shedded light on developing a strategy for early cancer diagnosis by cfDNA sequencing.

Introduction

A high percentage of cancers are diagnosed at advanced stages, resulting in the failure of curative treatment and poor quality of life for patients1, 2. Thus, the identification of diagnostic markers would enable earlier diagnosis and more successful treatment for cancer patients. Cell-free DNA (cfDNA) in the blood circulation and circulating tumour DNA (ctDNA) in the plasma in cancer patients have emerged as potential important biomarkers for cancer monitoring and treatment3. Multiple groups recently reported that the levels of plasma ctDNA were elevated in cancers4, 5. Previous studies proposed that cfDNA could be used as a non-invasive, sensitive tool for early diagnosis of cancer610.

The circulating cfDNA is a mixture of DNA from blood cells, viruses, solid organs and many other sources. In healthy individuals, over 90% of cfDNA is from debris of blood cells. However, in cancer patients, tumour-related ctDNA comprises 0.1–0.01% of the plasma cfDNA, which means that accurate detection of ctDNA for cancer early diagnosis must involve the removal of the background of somatic mutations that originate from blood cells1113. However, there is limited knowledge available for the somatic mutations in normal cells, particularly in WBCs, which are the main contributors for the background mutations. Human genomic mutations occur at a rate of approximately 2.5 × 108 per base per cell generation14. The Campbell group studied the background mutation spectrum of skin samples and focused on 74 cancer genes in normal skin samples from four individuals; their results proved that somatic mutations accumulate in normal cells, although the mechanisms were poorly understood15. Together this indicates that a better understanding of the baseline spectrum of somatic mutations in healthy individuals is urgently required before the detection of ctDNA can be useful in early cancer diagnosis.

With the dramatically decreased cost of next-generation sequencing (NGS) in recent years, it is now possible to screen a large number of individuals at ultra-deep sequencing depths to identify cancer-related mutations1619. The development of molecular technology to suppress the technique noise also makes it possible to pinpoint the mutations, fusions and copy number variations related to cancer in the ultra-low concentration of cfDNA. Both academic research groups and industry players are chasing the pan-cancer screening by a simple blood draw. However, the reliable and accurate application of ctDNA detection requires better understanding of background somatic information in healthy individuals.

Our studies were designed to reveal the background somatic mutations in cfDNA by ultra-deep sequencing for 50 cancer-associated genes in 1134 non-cancer samples and several cancer samples, providing critical information of the baseline profile of somatic mutations in healthy individuals, which will fill the gap in early cancer diagnosis and personalized cancer therapy.

Results

Validating the detecting sensitivity using the standard reference and tumor samples

To evaluate the sensitivity and precision of our method used in cfDNA analysis, we firstly tested the method by sequencing the Horizon Standard Reference libraries. The reference libraries cover six hotspot mutations in EGFR, NRAS, PIK3CA and KRAS genes. The average size of the reference genome is around 160 bp, mimicking the plasma cfDNA that we derived from blood plasma. We examined the sensitivity and precision of this cfDNA reference standard using our sample and library preparation method and tested the sensitivity of the method at variant allele frequencies of 0.0005, 0.001, 0.005 and 0.01. We sequenced the library at an average depth of >30000-fold coverage. The results demonstrated that our method could detect the majority of variant alleles at a frequency of 0.001; however, the average allele frequency ranged from 0.001 to 0.003, indicating substantial stochastic fluctuation (Fig. 1). When the allele frequency in the reference was 0.01, the performance of our method was much better and the stochastic fluctuation was much smaller (Supplementary Fig. S1 and Supplementary Table S1). Therefore, our method may be used to detect the variant alleles qualitatively at a frequency of 0.001 and quantitatively at a frequency of 0.01.

Figure 1.

Figure 1

Sensitivity test using Horizon Reference Standard for cfDNA. The reference was manufactured from engineered human cancer cell lines with an allele frequency of 0.0005, 0.001, 0.005, and 0.01. We performed deep sequencing and calculated the allele frequency detected by sequencing in five repeated experiments. Error bars are standard errors.

We also demonstrated the possibility of using mutation in cfDNA as the diagnostic marker for revealing the real mutation in tumor tissue. We recruited five new patients with colon cancer and extracted the genomic DNA of WBC, cfDNA and genomic DNA from tumor tissue for each patient. The mutation information of five above patients was summarized in Supplementary Table S2. Then (1) we checked the mutations in two tumor sections collected at different positions from the same solid tumor mass of the same patient; (2) we checked the mutations of cfDNA in blood sample of the same patient. In total, we detected 25 mutations and all these mutations obtained were these filtered by WBC. Among the 25 mutations, (1) there were 6 mutations shared between tumor section 1 and tumor section 2; (2) there were 3 mutations shared between cfDNA and tumor section 1 while 4 mutations shared between cfDNA and tumor section 2. In summary, we do admit that cfDNA may not 100% represent the condition of tumor progress, but it is undoubtedly a powerful tool in clinical practice to help diagnose cancer and evaluate the treatment effect to some extent, particularly when the tumor tissue biopsy is hardly feasible.

The mutations detected in cfDNA are highly correlated with mutations in WBCs

We performed ultra-deep target sequencing on 50 cancer-associated genes for both plasma DNA (cfDNA) and blood cell DNA (Supplementary Fig. S2). We analysed the correlation of the mutant allele frequency in 309 WBC samples and the corresponding cfDNA. In Fig. 2a, each dot represents the average mutant allele frequency of one position in both WBC and cfDNA. Pearson’s correlation study suggested that the mutant allele frequency of cfDNA and WBC is highly correlated (adjusted R2 = 0.92) (Fig. 2a and Supplementary Fig. S3a–c). We plotted the data from only one individual and found that the mutant allele frequency of WBC compared with cfDNA in one individual sample was also highly correlated, with R2 = 0.80 (Fig. 2b). The somatic mutations in the blood cells significantly contribute to mutations in cfDNA. These results also highlight the importance of sequencing both cfDNA and blood cells to remove the background mutations contributed by blood cells.

Figure 2.

Figure 2

Correlation study of mutant allele frequency in ctDNA and white blood cells. (a) We analysed the correlation of mutant allele frequency between cfDNA and genomic DNA of white blood cells. A total of 309 paired samples were examined. Each dot represents the mutant allele frequency of one position in WBC and its corresponding cfDNA, averaged for 309 samples. (b) Correlation study of one individual. Only the total sequencing depth larger than 10000× and mutant allele frequency larger than 0.3% in both white blood cells and cfDNA were selected in this analysis.

Reproducibility

One of the key issues to be addressed is to distinguish the true biological variations from experimental noise or errors. Two technical replicates were performed for both WBC and cfDNA samples from the same individual, and then we compared the mutant allele frequency in the two WBC samples and the two cfDNA samples. The reproducibility validation experiment in the two WBC samples was used to estimate the PCR error rate and sequencing errors in our study cohort. Our data showed that the mutant allele frequency in the WBC samples were highly correlated (Adj R2 = 0.9672), which suggested the mutations with frequency >0.3% were highly reproducible (Fig. 3a and Supplementary Fig. S3). The two cfDNA samples also showed good correlations (Adj R2 = 0.85), but were not as good as the WBC samples (Fig. 3b). The lower correlation of the cfDNA replicates may arise from both biological and technique variations. For example, cfDNA is more complex in composition and comprises debris of different tissues, and each subtype may contain a very small amount of DNA. The tiny amount of cfDNA for experiments could increase the difficulty for effective amplification, and therefore introduce technique variations during the amplification process.

Figure 3.

Figure 3

Reproducibility validation. We compared the sequencing information of two replicate WBC samples (a) and cfDNA samples (b) to evaluate the reproducibility of the methods. Only a total sequencing depth larger than 10000× and mutant allele frequency larger than 0.003 were included in the correlation study.

NPM1 is the most frequently mutated gene

The mutation frequencies of 50 genes were analysed and ranked by comparing 309 cfDNA samples with average reads depth more than 10000X (Fig. 4a). For example, mutation frequency for gene A FA = (F1 + F2 + … + Fi + … + Fn)/n, (n: length of the amplicon of gene A in this study, Fi: total frequency of non reference alleles at position i (i = 1 … n). The top four mutated genes were NPM1, PI3K, KIT and PTEN genes (Fig. 4b and Supplementary Fig. S4a–c). The average mutant allele frequency of the NPM1 gene in each nucleotide was up to 0.12%. This observation was detected in both the cfDNA and WBC (Supplementary Fig. S4d), suggesting that the major origin of NPM1 mutations was from blood cells. This result suggests that caution needs to be taken for these blood prone mutations when using cfDNA as a diagnostic marker.

Figure 4.

Figure 4

Ranking of the average mutant allele frequency of 50 cancer-associated genes. (a) Mutant allele frequency per position for the 50 cancer-associated genes. (b) The mutant allele frequency of each position in the NPM1 gene. Here, only one amplicon was designed for NPM1 in our study.

In order to show the profile of hotspot mutations in cfDNA, we selected 775 cfDNA samples from total 821 cfDNA dataset that with good sequencing quality and calculated the mutation frequency for each gene) (Supplementary Figure S5). To increase the confidence of the called mutations, we here define the mutation as the variant allele frequency larger than 1% and the average depth more than 5000x for demonstration. The ranking for the top mutated genes was slightly different from that when we calculated all the positions in the selected amplicons. The number of hotspot mutations detected in each individual for all 1134 non-cancer samples were summarized in Supplementary Table S3. We also showed one example of the mutant allele frequency of all 2800 hotspots in both WBC and cfDNA (Supplementary Table S4).

Discussion

In recent years, cfDNA and ctDNA has emerged as the research frontier of non-invasive cancer biomarkers for the detection, monitoring and treatment of cancer2025. However, several challenges have prevented the use of cfDNA and ctDNA in early diagnosis strategies for cancer.

The first challenge is the system errors derived both from the PCR and the sequencing. The ctDNAs are released to the peripheral blood in the very beginning of tumour initiation, far before the tumour can be detected by either CT or other clinical tools. However, in the very early stage of tumours, the amount of ctDNA is as low as 0.1–0.01%. The sequencing alone may introduce ~0.1% error by Illumina sequencing chemistry. Several groups have proposed to use digital barcoding of every molecule to reduce the errors of PCR and sequencing26. We also pursued digital barcoding and the results will be published in upcoming articles.

The second challenge is distinguishing the tumour mutations from the background mutations from WBC. More than 90% of cfDNA are from blood cells, and we also observed that mutations in cfDNA were highly correlated to those in the WBC. Therefore, the majority of mutations detected in cfDNA were the background mutations from WBC. The cfDNA also contains many other sources of DNA, and its composition and origin varies by individual and by different disease states. It is critical to perform systematic analysis of WBC somatic mutations and investigate the origin of cfDNA to understand all factors that may contribute to the mutation spectrum of cfDNA. In our study, we demonstrated the importance of sequencing both cfDNA and WBC for every individual.

Somatic mutations in healthy patients are very prevalent, with an average mutation number of around 2–6 mutations/1 M bases6, 15, 26. There is little information available to determine these biologically significant variations in cfDNA. This study revealed the baseline mutation spectrum in WBC and cfDNA of healthy patients and can help fill the gaps for the establishing early cancer diagnosis strategies. We observed an interesting pattern in WBC, in that the NPM1 gene was mutated at a very high rate. The average mutant allele frequency for the NPM1 gene in all 1134 samples of healthy controls was as high as 0.12%. The NPM1 mutations behave as gatekeepers for leukaemia and have been reported to be important genetic contributors in multiple types of leukaemia27. Surprisingly, the TP53 gene was not the most frequently mutated gene in cfDNA samples. This observation can be explained by fact that TP53 mutations are more frequent in solid organ tumours, while the cfDNA is mainly from blood cells. Therefore, we need to carefully calibrate the background mutations in WBC and cfDNA of healthy individuals to reduce a false positive rate during early cancer diagnosis.

Our study can help define the threshold of mutation detection of ctDNA by removing the background mutations in WBC and cfDNA in a healthy population. In the future, three aspects should be further improved: (1) a larger sample size is required to refine the baseline spectrum; (2) experimental methods that can increase the detection limit and remove technique noise are highly desired; and (3) an efficient bioinformatics pipeline needs to be developed to remove the noise and increase the accuracy and sensitivity in the mutation detection of ctDNA.

Methods

Ethics

This study was approved by the Second People’s Hospital of Shenzhen (The First Affiliated Hospital of Shenzhen). All the experiments were performed in accordance with guidelines and regulations of the Second People’s Hospital of Shenzhen. Written consent was obtained from each participant and all analyses were performed anonymously.

Patients

A total 1149 samples were included in this study (Supplementary Table S5), which included 821 cfDNA samples and 313 WBC samples from non-cancer individuals (Only 309 WBC sequencing data, 4 are lost during sample preparation) and other 15 samples from 5 cancer patients. The sample group comprised 56.6% females and 43.4% males. The participant ages ranged from 16 to 86 years, with a median of 42 years old. Our analysis of the patient clinical information revealed no significant genetic diversity in the samples. The statistical analysis of sample information is summarized in Table 1.

Table 1.

Summary of characteristics of 821 non-cancer individuals.

Age years Parameter value
Mean (SD) 43.1 (12.8)
Median (range) 42.5 (16–86)
10–19 n (%) 2 (0.2)
20–29 n (%) 128 (15.6)
30–39 n (%) 233 (28.4)
40–49 n (%) 194 (23.7)
50–59 n (%) 158 (19.3)
60–69 n (%) 87 (10.6)
70–79 n (%) 17 (2.1)
80–89 n (%) 1 (0.1)
Gender
Female n (%) 464 (56.6)
Male n (%) 356 (43.4)
Smoking
Female n (%) 62 (7.6)
Male n (%) 149 (18.1)
Family tumor history
single parent n (%) 81 (9.9)
both parents n (%) 2 (0.2)

Blood plasma isolation

For each sample, 10 ml of blood in a cell-free DNA collection tube (VanGenes, Xiamen, China) was collected and then centrifuged at 1600 g for 10 min at 4 °C to roughly separate the sample into plasma and blood cells. The upper phase was then transferred into a new tube, leaving around 8 mm of “buffering” layer from the buffy coat after the centrifugation and avoid contaminating the plasma layer by blood or cell debris. The plasma was further centrifuged at 16000 g for another 10 min to remove the cell debris. The upper clear layer was then aliquoted into 2-ml tubes, clearly labelled with the patients’ identity and immediately stored at −80 °C for cfDNA extraction.

Extraction of cfDNA

Each extraction of cfDNA was performed from 1 ml of plasma. Extraction of cfDNA was conducted using the E.Z.N. A Circulating DNA Kit (Omega Bio-tek, GA, USA). Genomic DNA of WBCs was extracted by the Qiagen DNA mini kit (Qiagen, Hilden, Germany). The concentration of extracted DNA was measured using the Qubit 2.0, dsDNA high-sensitivity assay (Life Technologies, Carlsbad, CA). All methods were performed according the manufacturers’ instructions.

Spike in control

The sensitivity and precision of the current method were evaluated by Horizon’s Partners Spike-in control (Horizon, Cambridge, UK) using our 207-pair primer set. Briefly, we performed a serial dilution (from 0.0005 to 1) using the wild-type reference genome and the provided reference standard. We performed multiplex PCR, and then library construction and sequenced on the HiseqX10 (Illumina, CA, USA). Six reference variants were included in our primer set: EGFR (L858R, T790M), KRAS (G12D), NRAS (Q61K, A59T) and PIK3CA (E545K). All other primers were used as the background.

Multiplex PCR and sequencing library construction

We developed a multiplex PCR method to enrich cancer-associated genes. Fifty cancer-associated genes were included and were amplified by 207 amplicons and covered a 22027 bp region. A total 2800 mutations were defined as hotspot mutations according to Mayo CliWe used 10 ng of DNA per sample for amplicon production by multiplex PCR. The resulting multiplex PCR reaction pool was further used in sequencing library preparation using the reagents provided in the same kit following the manual’s instructions. The sequencing library was then sent to WuxiNextCODE on the Illumina Hiseq X10 platform (Illumina, Beghelli, CA, USA).

Data filtering and analysis

We performed ultra-deep target sequencing on 50 cancer-associated genes for both cfDNA and WBC DNA. For each sample, the average sequencing depth was 40000× . The sequencing data was first mapped to the human reference genome (human genome build19; hg19) by Burrows–Wheeler transformation (BWA, Version: 0.7.5a-r405) software package, converted to mpileup format for downstream analysis. We set two filtering criteria to filter the reads: (1) read sequences with mutant allele frequency higher than 5% in a single reads were deleted; and (2) bases with base quality lower than 30 were deleted. Loci with sequencing depth less than 10000× were removed for further analysis.

Mutant allele frequency in cfDNA and WBCs in the population and in individuals

We analysed the correlation of the mutant allele frequency between cfDNA and genomic DNA of WBCs. The mutant allele frequency was defined as mutant allele frequency divided by total reads covering the locus. For example, for a particular position, the total sequencing depth is 10000; we obtain 9990 for A, 3 for C, 5 for G, and 2 for T, and thus the mutant allele frequency for this position is (3 + 5 + 2)/10000 = 1/1000. Among the samples from 821 individuals, we only used 309 pairs of samples for which both cfDNA and WBCs were available from the same person (collected at the same time). A total of 22027 nucleotides in 50 genes were included in this study. We calculated the average mutant allele frequency of each position in 309 individuals in WBCs and cfDNA. We removed the positions with mutant allele frequency higher than 0.1.

Reproducibility evaluation

To validate the reproducibility of our methods, we drew blood from an individual, spilt the sample in half, and performed multiplex PCR and sequencing independently. By comparing the sequencing data of the two replicates in WBC samples, we were able to evaluate the stability of the methods and remove background noise. Positions with a sequencing depth lower than 10000× and mutant allele frequency larger than 0.003 were excluded in the correlation study.

Data Availability

The datasets generated during and/or analysed during the current study are available in Sequence Read Archive with accession number SRP105315. The datasets are also available from the corresponding author on reasonable request.

Electronic supplementary material

Table S1 (15.1KB, xlsx)
Table S2 (16.4KB, xlsx)
Table S3 (27.9KB, xlsx)
Table S4 (97.1KB, xlsx)
Table S5 (195KB, xls)

Acknowledgements

The authors thank WuXiNextCODE for the sequencing services. This work was supported by State Key Research Program of China (Grant No: 2016YFA0501604) and the National Natural Science Foundation of China (Grant No: 31401145).

Author Contributions

L.X., G.T., F.D., Y.X. and W.H. collected original blood samples from the volunteers. B.Z., L.Z., S.L., Z.L. and X.T. performed data analysis. H.D. summarized and performed statistical analysis of the clinical information of all the volunteers. X.L., F.X. and C.L. designed and carried out the spike-in experiments. J.H. and Z.L. designed the study. Z.L., X.L. and J.H. wrote and revised the manuscript. All authors read and approved the final manuscript.

Competing Interests

Hongyu Dai, Chaoyu Liu, Xiaohua Li, Feiyue Xu and Xiaonian Tu are employees of Shenzhen GeneHealth Bio Tech Co., Ltd. The other authors declare no competing financial interests.

Footnotes

Ligang Xia, Zhoufang Li, Bo Zhou and Geng Tian contributed equally to this work.

Electronic supplementary material

Supplementary information accompanies this paper at doi:10.1038/s41598-017-06106-1

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Kaczor-Urbanowicz KE, et al. Emerging technologies for salivaomics in cancer detection. Journal of cellular and molecular medicine. 2016 doi: 10.1111/jcmm.13007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yuan P, et al. Time-to-Progression of NSCLC from Early to Advanced Stages: An Analysis of data from SEER Registry and a Single Institute. Scientific reports. 2016;6:28477. doi: 10.1038/srep28477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lo YM, et al. Presence of fetal DNA in maternal plasma and serum. Lancet. 1997;350:485–487. doi: 10.1016/S0140-6736(97)02174-0. [DOI] [PubMed] [Google Scholar]
  • 4.Gonzalez-Masia JA, Garcia-Olmo D, Garcia-Olmo DC. Circulating nucleic acids in plasma and serum (CNAPS): applications in oncology. OncoTargets and therapy. 2013;6:819–832. doi: 10.2147/OTT.S44668. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Sato KA, et al. Individualized Mutation Detection in Circulating Tumor DNA for Monitoring Colorectal Tumor Burden Using a Cancer-Associated Gene Sequencing Panel. PloS one. 2016;11:e0146275. doi: 10.1371/journal.pone.0146275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Volik S, Alcaide M, Morin RD, Collins C. Cell-free DNA (cfDNA): Clinical Significance and Utility in Cancer Shaped By Emerging Technologies. Mol Cancer Res. 2016;14:898–908. doi: 10.1158/1541-7786.MCR-16-0044. [DOI] [PubMed] [Google Scholar]
  • 7.Waldron D. Cancer genomics: A nucleosome footprint reveals the source of cfDNA. Nat Rev Genet. 2016;17:125. doi: 10.1038/nrg.2016.3. [DOI] [PubMed] [Google Scholar]
  • 8.Togneri FS, et al. Genomic complexity of urothelial bladder cancer revealed in urinary cfDNA. Eur J Hum Genet. 2016;24:1167–1174. doi: 10.1038/ejhg.2015.281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Earl J, et al. Circulating tumor cells (Ctc) and kras mutant circulating free Dna (cfdna) detection in peripheral blood as biomarkers in patients diagnosed with exocrine pancreatic cancer. BMC Cancer. 2015;15:797. doi: 10.1186/s12885-015-1779-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kienel A, Porres D, Heidenreich A, Pfister D. cfDNA as a Prognostic Marker of Response to Taxane Based Chemotherapy in Patients with Prostate Cancer. J Urol. 2015;194:966–971. doi: 10.1016/j.juro.2015.04.055. [DOI] [PubMed] [Google Scholar]
  • 11.Drake JW, Charlesworth B, Charlesworth D, Crow JF. Rates of spontaneous mutation. Genetics. 1998;148:1667–1686. doi: 10.1093/genetics/148.4.1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nachman MW, Crowell SL. Estimate of the mutation rate per nucleotide in humans. Genetics. 2000;156:297–304. doi: 10.1093/genetics/156.1.297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Roach JC, et al. Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science. 2010;328:636–639. doi: 10.1126/science.1186802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Smith T, et al. Extensive Variation in the Mutation Rate Between and Within Human Genes Associated with Mendelian Disease. Human mutation. 2016;37:488–494. doi: 10.1002/humu.22967. [DOI] [PubMed] [Google Scholar]
  • 15.Martincorena I, et al. Tumor evolution. High burden and pervasive positive selection of somatic mutations in normal human skin . Science. 2015;348:880–886. doi: 10.1126/science.aaa6806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Vogelstein B, et al. Cancer genome landscapes. Science. 2013;339:1546–1558. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Castera L, et al. Next-generation sequencing for the diagnosis of hereditary breast and ovarian cancer using genomic capture targeting multiple candidate genes. European journal of human genetics: EJHG. 2014;22:1305–1313. doi: 10.1038/ejhg.2014.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Li S, Mao M. Next generation sequencing reveals genetic landscape of hepatocellular carcinomas. Cancer letters. 2013;340:247–253. doi: 10.1016/j.canlet.2012.09.027. [DOI] [PubMed] [Google Scholar]
  • 19.Nik-Zainal S, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature. 2016;534:47–54. doi: 10.1038/nature17676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.ctDNA is a specific and sensitive biomarker in multiple human cancers. Cancer discovery4, OF8, doi:10.1158/2159-8290.CD-RW2014-051 (2014). [DOI] [PubMed]
  • 21.Kidess-Sigal, E. et al. Enumeration and targeted analysis of KRAS, BRAF and PIK3CA mutations in CTCs captured by a label-free platform: Comparison to ctDNA and tissue in metastatic colorectal cancer. Oncotarget, doi:10.18632/oncotarget.13350 (2016). [DOI] [PMC free article] [PubMed]
  • 22.Ma F, et al. ctDNA dynamics: a novel indicator to track resistance in metastatic breast cancer treated with anti-HER2 therapy. Oncotarget. 2016 doi: 10.18632/oncotarget.11791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Romero D. Breast cancer: Tracking ctDNA to evaluate relapse risk. Nature reviews. Clinical oncology. 2015;12:624. doi: 10.1038/nrclinonc.2015.159. [DOI] [PubMed] [Google Scholar]
  • 24.Hutchinson L. Biomarkers: ctDNA-identifying cancer before it is clinically detectable. Nature reviews. Clinical oncology. 2015;12:372. doi: 10.1038/nrclinonc.2015.77. [DOI] [PubMed] [Google Scholar]
  • 25.Rosell R, Karachaliou N. Lung cancer: Using ctDNA to track EGFR and KRAS mutations in advanced-stage disease. Nature reviews. Clinical oncology. 2016;13:401–402. doi: 10.1038/nrclinonc.2016.83. [DOI] [PubMed] [Google Scholar]
  • 26.Newman AM, et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nature biotechnology. 2016;34:547–555. doi: 10.1038/nbt.3520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.McKerrell T, et al. Leukemia-associated somatic mutations drive distinct patterns of age-related clonal hemopoiesis. Cell reports. 2015;10:1239–1245. doi: 10.1016/j.celrep.2015.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1 (15.1KB, xlsx)
Table S2 (16.4KB, xlsx)
Table S3 (27.9KB, xlsx)
Table S4 (97.1KB, xlsx)
Table S5 (195KB, xls)

Data Availability Statement

The datasets generated during and/or analysed during the current study are available in Sequence Read Archive with accession number SRP105315. The datasets are also available from the corresponding author on reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES