DNA fragmentation, damage and type specific error. a, Left, fragment length distribution of the Anzick-1 DNA sequences mapping to a human reference genome. The maximum read length with the applied chemistry on the HiSeq Illumina platform is 94 bp (100 - 6 bp index read), hence the large peak at this length simply represent the entire tail of the distribution. Right, the declining part of the distribution for the nuclear DNA, and the fit to an exponential model. The decay constant (λ) is estimated to 0.018. b, Damage patterns for the Anzick-1 individual in a random 0.5% subset of all mapped reads. Mismatch frequency relative to the reference as function of read position, C to T in red and G to A in blue. c) Type specific error rates for the Anzick-1 sample and the individual libraries. Estimates of overall error rates are given in the right-hand side.