Figure 2.
One End Anchor reads. Due to the limited insert size, orphan reads are in the neighborhood of anchor reads in paired end sequencing, thus, the location of anchor reads can be used to filter out the false sequences and estimate the location of true sequences. Note that full orphan reads could be used to help detect longer insertion sequences. a) Illustration of OEA reads. b) the anchor reads corresponding to one micSeq candidate are distributed along the chromosome with very modest coherence in one location, it is very likely this is a false micSeq. Candidate micSqs of this type (>50% of reads distributed across the chromosome) are filtered out; c) the anchor reads are highly clustered in one short region, these are brought forward for validation analysis.