Fig. 7.
Demonstration of how PCR duplicates are removed. This process uses the random sequence that was moved to the read identifier line for deduplication as unique molecular identifiers (UMI). Each time an aligned read that is identical to another read that was previously seen is identified, UMIs of these two reads are compared to one another. If the UMIs are different, then both alignments are kept as shown on the left. On the other hand, if the UMIs are identical, then only one of the alignments is kept