Table 1.
Summary of sequence assembling, data processing and annotation
| Raw data | Number |
|---|---|
| Raw reads | 15,778,993 |
| Average read length (bp) | 352 |
| Assembled reads | 14,435,855 |
| Average read length (bp) | 345 |
| Assembling results | |
| Before processing/after processing | |
| Contigs | 83,240/67,651 |
| Total contig length (bp) | 131,955,922/102,128,874 |
| Average contig length (bp) | 1,585/1,510 |
| The largest contig length (bp) | 16,000/16,000 |
| Contig N50 (bp) | 2,042/1,911 |
| GC content of contigs | 44,17 %/44.27 % |
| Singletons | 755,503/301,978 |
| Average singleton length (bp) | 213/348 |
| Total singleton length (bp) | 161,174,728/104,941,945 |
| Annotation | |
| Contigs annotated based on the plant UniProt database | 52,090 |
| Contigs annotated based on the NR database | 635 |
| Gene models | 30,854 |
| Contigs annotated with GO terms | 36,086 |
| Contigs annotated with KO identifiers | 7,032 |
| Contigs assigned with EC numbers | 5,727 |