Table 1.
Summary of sequence assembling, data processing and annotation
Raw data | Number |
---|---|
Raw reads | 15,778,993 |
Average read length (bp) | 352 |
Assembled reads | 14,435,855 |
Average read length (bp) | 345 |
Assembling results | |
Before processing/after processing | |
Contigs | 83,240/67,651 |
Total contig length (bp) | 131,955,922/102,128,874 |
Average contig length (bp) | 1,585/1,510 |
The largest contig length (bp) | 16,000/16,000 |
Contig N50 (bp) | 2,042/1,911 |
GC content of contigs | 44,17 %/44.27 % |
Singletons | 755,503/301,978 |
Average singleton length (bp) | 213/348 |
Total singleton length (bp) | 161,174,728/104,941,945 |
Annotation | |
Contigs annotated based on the plant UniProt database | 52,090 |
Contigs annotated based on the NR database | 635 |
Gene models | 30,854 |
Contigs annotated with GO terms | 36,086 |
Contigs annotated with KO identifiers | 7,032 |
Contigs assigned with EC numbers | 5,727 |