Schematic representation of molecular portrait of human papillomavirus 16 (HPV16) genome. The ~8 kb human papillomavirus genome may be found as an episomal or linear integrated form in the nucleus of the infected cell. The viral genome harbours two polyadenylation signals such as early polyadenylation signal (pAE) and late polyadenylation signal (pAL). The pAE signal terminates the transcription of early (E) genes such as E1–E7, whereas pAL signal terminate transcription of late (L) genes L1 and L2. The LCR of the genome contains the origin of DNA replication (ori) and the early viral promoter, p97 while the late promoter, p670, is located in the E7 coding region. eUTR and lUTR represent the early and late 3'UTR respectively. Known 5' splice donor site (SD) like SD226, SD880, SD1302 and SD3632 are shown as green circle with black border whereas 3' splice acceptors (SA) SA409, SA526, SA742, SA2582, SA2709, SA3358 and SA5639 are shown as blue circle with black border. Apart from these, two novel splice donor sites SD174 & SD221 and accepter sites SA718 & SA850 are depicted as green circle with red border and blue circle with red border respectively. Alternative splicing among these splice sites are produce two sets of mRNA transcripts from respective promoter p97 and p670. Red colour E6^E7*I & E6^E7*II represent the novel transcripts. Each transcript represents the most likely candidate mRNA for production of the corresponding proteins.