. 2012 Jun 27;79(6):521–536. doi: 10.1007/s11103-012-9924-z

Table 1.

Summary of sequence assembling, data processing and annotation

Raw data	Number
Raw reads	15,778,993
Average read length (bp)	352
Assembled reads	14,435,855
Average read length (bp)	345
Assembling results
Before processing/after processing
Contigs	83,240/67,651
Total contig length (bp)	131,955,922/102,128,874
Average contig length (bp)	1,585/1,510
The largest contig length (bp)	16,000/16,000
Contig N50 (bp)	2,042/1,911
GC content of contigs	44,17 %/44.27 %
Singletons	755,503/301,978
Average singleton length (bp)	213/348
Total singleton length (bp)	161,174,728/104,941,945
Annotation
Contigs annotated based on the plant UniProt database	52,090
Contigs annotated based on the NR database	635
Gene models	30,854
Contigs annotated with GO terms	36,086
Contigs annotated with KO identifiers	7,032
Contigs assigned with EC numbers	5,727