Figure 1.
Sequences shared among the three human sperm sncRNA libraries. The proportion of the total number of sequences that are unique to a library, shared with at least one other library or present in all three libraries (in common to all three libraries) are presented for each of the libraries. For each library, contigs were constructed according to information on chromosome, strand, start and end position, allowing new reads to join to a contig if there is overlap or if the read is less than or equal to four nucleotides away on the same chromosome/strand. A contig is synonymous with a unique locus. (A)–(C) Represent counts of reads in common between AS062, AS064 and AS066, based on these contigs. (A). For each library, reads were individually compared with the sets of contigs of the other libraries, resulting in numbers of reads that were unique to the library, or in common to one or both of the other libraries. The results were normalized to a million counts and plotted with a color code to show the percentage for each commonality subset, described in the legend, with purple representing reads common to all libraries and blue for reads that are donor-specific. (B). Identical to 1A, except showing only the reads that are only in one library, and the corresponding percentages. (C). This figure compares the relative percentages of shared reads—the percentages were calculated after the donor-specific reads were subtracted out.