Figure 1.
Overview of pipeline used for correction of genome annotation and genome assembly using transcriptomic and proteomic data. MS/MS spectra, which did not assign to the known protein database, were further searched against six-frame translated genome, three-frame translated transcripts, and Anopheles gambiae protein database. Further analysis of these peptides resulted in identification of novel protein-coding genes and revised gene annotations, which were compared against 15 other Anopheline species.