Skip to main content
. 2012 Jun 27;79(6):521–536. doi: 10.1007/s11103-012-9924-z

Table 1.

Summary of sequence assembling, data processing and annotation

Raw data Number
Raw reads 15,778,993
Average read length (bp) 352
Assembled reads 14,435,855
Average read length (bp) 345
Assembling results
Before processing/after processing
 Contigs 83,240/67,651
 Total contig length (bp) 131,955,922/102,128,874
 Average contig length (bp) 1,585/1,510
 The largest contig length (bp) 16,000/16,000
 Contig N50 (bp) 2,042/1,911
 GC content of contigs 44,17 %/44.27 %
 Singletons 755,503/301,978
 Average singleton length (bp) 213/348
 Total singleton length (bp) 161,174,728/104,941,945
Annotation
 Contigs annotated based on the plant UniProt database 52,090
 Contigs annotated based on the NR database 635
 Gene models 30,854
 Contigs annotated with GO terms 36,086
 Contigs annotated with KO identifiers 7,032
 Contigs assigned with EC numbers 5,727