Skip to main content
. Author manuscript; available in PMC: 2009 Jul 27.
Published in final edited form as: Nature. 2008 Nov 6;456(7218):60–65. doi: 10.1038/nature07484

Table 1.

Data production and alignment results for the YH genome

Data type Number of reads Number of
mapped reads
Total bases
(Gb)
Mapped bases (Gb) Effective depth
(fold)
Percentage with unique
placement
Rate of nucleotide
mismatches (%)
SE 2,019,025,890 1,921,271,902 72 64.4 22.5 83.60 1.62
PE 1,315,249,404 1,028,695,924 45.7 38.5 13.5 90.20 1.16
Total 3,334,275,294 2,949,967,826 117.7 102.9 36 86.10 1.45

Single-end (SE) and paired-end (PE) sequencing reads were aligned onto the reference assembly in NCBI build 36.1, allowing at most two mismatches or one continuous gap with a size of 1–3 bp. Effective depth was determined through the calculation of all mapped bases divided by the length of NCBI36 (excluding Ns, 2,858,013,089 bp in length). ‘Unique placement’ means a read had only one best placement with the least number of mismatches and gaps. The rate of nucleotide mismatches is the percentage of mismatched nucleotides over all mapped nucleotides, including sequencing errors and real genetic variations. In total, 487 million reads (14.6%) could not be aligned to the reference genome.