Skip to main content
. 2005 Dec 2;33(21):e185. doi: 10.1093/nar/gni184

Table 3.

Summary of cDNA clones recovered

Classification Clone counta
ORF identical to gene annotationb 43
ORF alters gene annotationc:
    5′ extension 10
    3′ extension 3
    5′ short with upstream in-frame stop codon 2
    Exon variant 12
    Dicistronic 1
    Gene merge 1
Subtotal: high-quality, full-length cDNAs 72
Compromised clones:
    Nucleotide discrepancyd 6
    Shorte
        5′ short 1
        3′ short 5
        5′and 3′ short 1
    Co-ligated insertf 7
    Antisense transcriptg 1
    Genomic contaminanth 4
    Retained introni 3
    SLIP artifact 4
Subtotal: compromised cDNAs 32
Gene-specific clones recovered 104

aOne cDNA clone was selected per target gene.

bThe clone encodes a protein that is identical to the corresponding Release 4.1 annotation.

cThese clones encode proteins that differ from their corresponding annotation. ‘5′ extension’ and ‘3′ extension’ clones encode additional N-terminal and C-terminal residues, respectively, relative to the annotation. ‘5′ short with upstream in-frame stop codon’ clones encode full-length ORFs that are missing sequences encoding N-terminal residues relative to the annotation and may represent alternatively spliced products. ‘Exon variant’ clones contain sequence differences relative to the annotation at internal positions in the CDS and represent alternatively spliced products.

dThe sequence of these clones have nucleotide differences, most likely the result of errors generated by reverse transcriptase during library construction, that introduce a missense or frameshift change in the ORF relative to the annotated CDS.

eThese clones are missing sequences encoding the N-terminal portion of the predicted protein sequence of the annotation for the ‘5′ short’ class, the C-terminal portion for the ‘3′ short’ class, or both for the ‘5′ and 3′ short’ class.

fThese clones contain sequences from two unrelated genes and are almost certainly the result of two cDNA molecules being cloned into the same plasmid vector during library construction. In three such cases, the clones encode proteins that are identical to the targeted annotation.

gThe sequence of the clone overlaps the annotated gene model but is transcribed from the opposite strand.

hThese clones do not include a poly-adenylated tail. These are genomic clones that contaminate the cDNA libraries.

iThese clones are poly-adenylated and include unprocessed intron.