Skip to main content
Viruses logoLink to Viruses
. 2020 Apr 9;12(4):427. doi: 10.3390/v12040427

In silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand

Ngoc Minh Hien Phan 1,2,*, Helen Faddy 1,2,3, Robert Flower 1,2, Kirsten Spann 1,4, Eileen Roulis 1,2
PMCID: PMC7232418  PMID: 32283837

Abstract

The extent of whole genome diversity amongst hepatitis B virus (HBV) genotypes is not well described. This study aimed to update the current distribution of HBV types and to investigate mutation rates and nucleotide diversity between genotypes in Southeast Asia, Australia and New Zealand. We retrieved 930 human HBV complete genomes from these regions from the NCBI nucleotide database for genotyping, detection of potential recombination, serotype prediction, mutation identification and comparative genome analyses. Overall, HBV genotypes B (44.1%) and C (46.2%) together with predicted serotypes adr (36%), adw2 (29%) and ayw1 (19.9%) were the most commonly circulating HBV types in the studied region. The three HBV variants identified most frequently were p.V5L, c.1896G>A and double mutation c.1762A>T/c.1764G>A, while genotypes B and C had the widest range of mutation types. The study also highlighted the distinct nucleotide diversity of HBV genotypes for whole genome and along the genome length. Therefore, this study provided a robust update to HBV currently circulating in Southeast Asia, Australia and New Zealand as well as an insight into the association of HBV genetic hypervariability and prevalence of well reported mutations.

Keywords: human hepatitis B virus, genetic diversity, genotypes, serotypes, HbsAg subtypes, recombination, mutations

1. Introduction

Hepatitis B virus (HBV) infection is a global health burden. 257 million people had chronic hepatitis B (CHB) infection worldwide in 2015 [1]. CHB can develop into liver cirrhosis, and hepatocellular carcinoma (HCC), resulting in 720,000, and 470,000, deaths worldwide in 2015, respectively [1].

HBV is a small, enveloped DNA virus of the Hepadnaviridae family. Its genome (3.2 kbp) is circular and partially double-stranded with four overlapping open reading frames (ORFs) encoding for reverse transcriptase (RT), pre-surface/surface (pre-S/S) proteins, precore/core (PC/C) proteins and transcriptional transactivator X protein [2]. The overlapping structure of pre-S/S and RT ORFs is essential for HBV surface antigen (HBsAg) antigenicity and RT functionality [3]. The PC/C region is responsible for formation of two polypeptides: (1) HBV core antigen (HBcAg)—a structural protein of the nucleocapsid translated from pregenomic RNA and (2) HBV e antigen (HBeAg)—a viral secreted protein translated from PC RNA [4,5]. The core promoter is located immediately upstream of the PC/C region and consists of the basal core promoter (BCP) and the upper regulatory region (URR). URR is composed of the negative regulatory element (NRE) and the core upstream regulatory sequence (CURS), with a possible role in regulating core promoter activity [6]. The X ORF encodes for X protein, which is associated with viral regulation, cell growth and carcinogenesis by disrupting several cellular signaling pathways [7,8,9].

Several factors contribute to HBV genetic diversity. HBV possesses a unique replication cycle via an RNA intermediate, using an error-prone RT enzyme [2,10,11]. This results in a nucleotide substitution rate of >10−5 substitution/site/year, which is higher than most DNA viruses and similar to RNA viruses with low mutation rate [2,10,11]. Moreover, it has a rapid replication rate with over 1011 virus particles released in circulation per day [12].

Due to its high sequence heterogeneity, HBV has 10 recognised genotypes (A to I, and a provisional genotype J) with at least 8% nucleotide divergence, and over 35 subgenotypes with at least 4–8% nucleotide divergence over the whole genome [13,14]. Viral strains typically exhibit geographical and ethnic distributions [13,15]. Generally, genotypes A and D are highly prevalent in Europe, North Africa, North America and Asia, whilst in East and Southeast Asia, Australia and New Zealand genotypes B and C are more commonly found [15]. Genotypes E and A predominate in Sub-Saharan Africa, whilst genotypes F, G and H are recorded in Latin America [15]. To date, only a single instance of genotype J was identified in Japan [16].

Recombination is common in HBV, classified into 44 different patterns where at least two different HBV genotypes or subgenotypes exchange genetic information [17,18]. For instance, subgenotypes B2, B3 and B4 are recombinants of genotypes B and C while fragments of genotypes A, C and G are observed in genotype I [2,19]. There is also evidence of recombinants C/D and D/E [20].

HBV can be categorised into different serotypes based on serological reactivity of HBsAg including ayw1, ayw2, ayw3, ayw4, ayr, adw2, adw4, and adr. The presence of subdeterminants, d/y and r/w, depends on variation at amino acid positions 122, 160, 127, 159 and 140 [21,22]. Although the same HBV genotypes can encode different serotypes and one serotype can belong to different genotypes, the distinct variants associated with viral serotypes are consistently found at the genomic level [21,23].

The genetic variability of HBV plays an important role in clinical outcomes and planning HBV prevention strategies. Acute genotype A infections persist longer than genotype D. Genotypes A and D surpass B and C in viral chronicity, possibly due to their different roles in host-viral interactions [24,25,26]. Genotype C infections have an increased risk of higher viral loads and development of cirrhosis and HCC than genotype B [24,27]. Interferon-based treatment was shown to be more effective for genotypes A and B than genotypes C and D, demonstrated through lower expression of HBsAg, HBeAg and decreased HBV DNA levels [28].

Most HBV vaccines are generated from genotype A2 or C strains [29,30]. The vaccines are protective against non-A/C genotype infections by largely boosting the production of antibodies targeted against determinant a at amino acid positions 121–149 which corresponds to the first loop of B-cell epitope of HBsAg for all HBV genotypes and serotypes [3,21,30,31]. However, cases of cross-genotype HBV infection of vaccinated individuals have been recorded, raising questions as to the complete protection of HBV immunoprohylaxis, especially in countries where genotypes A and C are not prevalent [32,33,34].

Several HBV mutations within particular coding regions are associated with important clinical manifestations. Mutations across the major hydrophilic region (MHR) of HBsAg, where determinant a is located, disrupt the conformation of surface antigen, affect protein hydrophilicity and alter the binding of neutralizing antibodies. This allows HBV to escape host immune responses for clearance [35,36], resulting in failed immunoprophylaxis [37,38,39,40,41,42]. RT mutations are involved in the development of drug-resistant viral strains and loss of efficacy of nucleos(t)ide analogs and anti-viral drugs. These include p.M204IVS, p.V173L, p.A181T and p.L80VI for lamivudine, p.N236T and p.181V for adenovir, p.A194T for tenofovir, and p.M250V/p.I169T and p.T184G/p.S202I for entecavir [43,44]. Mutations within the PC region acquired after HBeAg seroconversion (c.1896G>A and p.P130T) can lead to defective HBV replication, decreased or lost secretion of HBeAg, HBeAg-negative CHB and even HCC with poorer prognosis [5,7,45,46,47]. The two HBcAg mutations p.I/F97L and p.P130T are frequently observed concurrently in CHB patients [44], leading to the secretion of virions containing immature genomes and hypermaturations, respectively [48]. Meanwhile, double mutations c.1762A>T / c.1764G>A in the BCP region and mutation c.1613G>A within NRE, are associated with the suppressed expression of HBeAg, increased viral load in HBV carriers and increased risk of liver carcinogenesis [6,49,50]. A study on HBV X mutations reported the presence of the mutant p.V5M/L amongst Korean patients with severe liver diseases such as cirrhosis and HCC [51].

The rate of international travel to Southeast Asia, Australia, New Zealand has increased in recent years. Given the geographically distinct distribution of HBV genotypes and serotypes, this trend could influence the prevalence of HBV strains currently circulating in this region. Moreover, there have been conflicting reports of the predominant genotype in a number of countries within this area. A study in 2004 [13] identified only HBV genotypes B and C in the region, except for Australia, in which genotypes D and C were found. Meanwhile, a recent systematic literature review [15] reported genotypes B, C, D and A circulating in these countries. This study therefore aimed to robustly update the situation of HBV types in this region. Moreover, given the potential complications arising from aforementioned HBV mutations, we sought to estimate a number of mutations across the viral genome by genotypes. We also investigated the nucleotide variability of HBV genotypes in the same region to explore whether there is an association between HBV heterogeneity and mutation rates for each genotype.

2. Materials and Methods

2.1. Data Collection from NCBI Nucleotide Database

A search for human HBV complete genomes listed on the NCBI nucleotide database was conducted in August 2019, firstly using the term [(“HBV” OR “hepatitis B virus”) AND “complete genome”]. We then applied the following filters: viruses for species, gDNA/RNA for molecule types, International Nucleotide Sequence Database Collaboration (INSDC) (GenBank) and [country name]. The studied countries were located in Southeast Asia, Australia and New Zealand, with names as described by the United Nations (https://unstats.un.org/unsd/methodology/m49/). Each genome was manually checked for inclusion, including HBV complete genome, Homo sapiens host, country area, and unique sequences. The country of origin of each genome was checked either by using the information given on NCBI or listed in publications.

2.2. Analysis of HBV Genotypes

For each sequence, we used NCBI HBV genotyping [52] to assess the agreement between genotypes submitted on GenBank and results provided on the HBVdb genotyping tool [53]. We included sequences KF214650|I, EU833891|I, JF899337|I, and AB486012|J for genotypes I and J as references. A genotype was assigned based on (1) concurrence of genotype assigned by three sources (Genbank submission, NCBI HBV genotyping and HBVdb genotyping tool) or (2) the first E-value given by NCBI BLAST < 1 × 10−75 in the case of genotype assignment disagreement from one or more sources/tools. We nominated unverified genotypes or potential recombinants by phylogenetic analysis using FastTree 2.1.11 [54,55] and RAxML 8.2.11 [56]. For FastTree, the default tree building setting was used. For RaxML, we used a general time-reversible (GTR) GAMMA I model, rapid bootstrapping and 1000 replicates. Figure 1 illustrates the steps of genotyping the collected sequences. Sequences were aligned using MAFFT [57,58] and Geneious Prime 2019.2.1 for the whole study.

Figure 1.

Figure 1

Flowchart of genotype assignment for each sequence retrieved from the NCBI nucleotide database.

2.3. HBV Recombination Analysis

We utilised the Recombination Detection Program (RDP) 4 [59] to detect potential recombination events within our data. Five built-in methods (RDP [60], Geneconv [61], Maxchi [62], Chimaera [63] and 3SEQ [64]) were run iteratively with their default settings, to identify potential recombinants by each method. Sequences detected as potential recombinants with overlapping breakpoints by all 5 methods were then pooled together with the sequences of unverified genotypes/recombinants for the phylogenetic analysis as outlined in 2.2. Reference sequences were included (Table S1).

2.4. Representative Phylogenetic Analysis

A number of sequences were randomly selected from the complete dataset for the identified genotypes to establish a representative phylogeny. The number of sequences chosen for the representative phylogeny was proportional to the number of sequences of each genotype in our dataset: 5 sequences for genotype A, 20 for B, 20 for C, 10 for D, 1 for G and 5 for I. Potential recombinants were also included in the population. Approximated maximum-likelihood trees were built by using FastTree 2.1.11 [54,55], RAxML 8.2.11 [56] and PHYML 3.3.20180621 [65]. Model parameters for FastTree and RAxML were as described in 2.2, Model parameters for PHYML were established by using jmodeltest: GTR+I+G, proportion of invariable sites 0.463, gamma distribution of 0.854 and 1000 bootstraps. Reference sequences were included (Table S1).

2.5. HBV Serotype Prediction

Serotypes for each HBV genome were predicted using the decision tree developed by Purdy et al. (2007) [22]. Translated HBsAg sequences were analysed with the mutation reporter tool [66] to determine serotypes. We used FLLVLLDY as an input for anchor motif at amino acid position 93. We confirmed our predictions by using the HBV serotyper tool (http://hvdr.bioinf.wits.ac.za/SmallGenomeTools/) [67] or manually. Fourteen sequences that were not predicted to any serotypes by either of the two methods above were individually investigated for their amino acid composition.

2.6. Nucleotide Diversity

We used DnaSP 6 for a comprehensive analysis of sequence variation by each genotype [68]. All degenerate nucleotides were converted to Ns prior to analysis as required by the software. Sites with gaps in the complete data file were excluded from the analysis. To evaluate the degree of polymorphism within genotypes, we estimated the average number of nucleotide differences per site between two sequences, called nucleotide diversity Pi (μ), with window length of 100 bp and step size of 25 bp.

2.7. Analysis of Nucleotide and Amino Acid Variations

Analysis of HBV variations was performed across five regions—NRE, BCP, PC, C, HBsAg, polymerase/RT, and X. Variations selected for analysis were all reported to be associated with viral life cycle and/or clinical complications (Figure 2) [69,70]. Drug resistance mutations over the RT ORF and escape mutations within HBsAg ORF were identified by HIV-grade HBV drug resistance interpretation (DRI) [70]. Other variants were determined based on nucleotides or amino acids substituted at the selected positions along nucleotide sequences or translated sequences using Geneious Prime 2019.2.1.

Figure 2.

Figure 2

Selected mutations across RT, HBsAg, NRE, BCP, PC, C and X ORFs for analysis. Nucleotide position 1 on an HBV genome corresponds to the HBV EcoR1 restriction site.

3. Results

3.1. Data Collection from NCBI Nucleotide Database

Of the 9182 complete HBV genomes identified on NCBI nucleotide database, 930 genomes met the inclusion criteria (File S1 and Figure 3). Five pairs of HBV genomes, despite having different GenBank accession numbers, were identical, therefore, one sequence of each pair was excluded. No human HBV complete genomes were found for the following geographical locations: Brunei Darussalam, Singapore, Timor-Leste, Christmas Island, Cocos Islands, Norfolk Island, Heard Island and McDonald Island. No relevant information regarding clinical condition was available for any of the sequences collected.

Figure 3.

Figure 3

Number of HBV complete genomes retrieved through each step of data collection.

3.2. Analysis of HBV Genotypes

Over 50% of sequences originated from Malaysia and Vietnam, with 296 and 180 sequences, respectively. We identified six HBV genotypes (A, B, C, D, G and I) circulating in Southeast Asia, Australia and New Zealand (Table 1 and Figure 4). The frequencies of each HBV genotype varied widely between countries. In the studied region, genotypes C and B were the most prevalent, constituting 46.2% and 44.1%. Genotype I (2.3%) was observed only in Laos, Vietnam and Thailand, whilst genotype D (5.3%) was found in New Zealand, Indonesia and Malaysia. Only one sequence was genotyped G and originated in Thailand. Countries with the widest range of circulating HBV genotypes were Malaysia (A, B, C and D) and Thailand (B, C, G and I), followed by Vietnam (B, C and I), Philippines (A, B and C) and Indonesia (B, C and D).

Table 1.

Identified HBV genotypes by country.

Genotype A B C D G I B/BC or C/CB G/GC Total
Country
Australia 1 66 67 (7.2%)
Cambodia 4 24 28 (3%)
Indonesia 82 51 9 142 (15.3%)
Laos 14 14 (1.5%)
Malaysia 2 170 116 4 4 296 (31.8%)
Myanmar 18 18 (1.9%)
New Zealand 6 36 42 (4.5%)
Philippines 9 7 10 26 (2.8%)
Thailand 16 96 1 1 1 2 117 (12.6%)
Vientam 131 43 6 180 (19.4%)
Total 12 (1.3%) 410 (44.1%) 430 (46.2%) 49 (5.3%) 1 (0.1%) 21 (2.3%) 5 (0.5%) 2 (0.2%) 930 (100%)

Figure 4.

Figure 4

Distribution of HBV genotypes and recombinants by country in Southeast Asia, Australia and New Zealand. Pie charts indicate the percentage of HBV genotypes with the diameters. Countries with human HBV complete genomes collectable from NCBI nucleotide database were highlighted in bold and uppercase.

3.3. HBV Recombination Analysis

We ran five automatic detection methods in RDP4 to detect potential recombination within our data (Table 2). In total, 65 genomes were identified as recombinant by all five methods; 35 of them had overlapping breakpoint positions, including 10 with the same major parents who contribute greater proportion of sequence. The 65 suspected recombinants were pooled together with sequences of ambiguous genotype or suspected recombination for further analysis with FastTree and RaxML (Figures S1 and S2). According to these phylogenies, 34 of the 65 sequences were clustered in genotype B, particularly B2 and B3, whilst 13 clustered in genotype C and 12 in genotype A. Of the seven sequences that were initially suspected as B/BC, C/CB or G/CG recombinants (Table 1) by HBV genotyping, two C/CB were located between the clusters of genotypes B2 and C, two G/GC between G and C, while the three B/BC were positioned within the cluster of B2 and B3. Results of recombination and phylogeny analyses are summarised in Table S2. They all shared approximately 92% similarity with their corresponding reference sequences.

Table 2.

HBV recombination detection by five automatic methods built in RDP4.

Detection Method RDP 3SEQ Chimaera Geneconv Maxchi
Events/Recombinants
Recombination events 83 80 49 87 41
Potential recombinants (possibly present in different events) 467 613 744 1199 1355
Potential recombinants (after removing replicate recombinants) 380 566 510 857 548
Recombinants identified by all methods: 83 (65 sequences from our data and 18 reference sequences)
+ with overlapping breakpoint positions: 35 (31 sequences from our data and 4 reference sequences)
+ with at least one same major parent: 10 (8 sequences from our data and 2 reference sequences)

3.4. Representative Phylogenetic Analysis

Generally, all three trees built with FastTree, RAxML and PHYML (Figures S3–S5) were consistent in clustering the sequences of the same genotypes identified in 3.2, supporting our genotyping results. The divisions into genotypes A, B, D and I were highly supported by all three methods. Lower support values were calculated for genotype G (82% by RAxML and 78.1% by PHYML) and C (62% by RAxML and 53.5% by PHYML). PHYML and RAxML trees were generally congruent, except for five sequences, one of genotype B and four of genotype C, though still clustering within clades of their corresponding genotypes. Differently, for FastTree tree, genotype roots were relocated and a bigger number of sequences of genotypes B, C, D and F changed their arrangement within the same clade.

3.5. HBV Serotype Prediction

We predicted serotypes for the 930 HBV genomes using the decision tree of Purdy et al. [22]. Table 3 summarises the distribution of the predicted HBV serotypes by genotypes. Serotypes adr, adw2 and ayw1 were most prevalent in the studied region, accounting for 36%, 29% and 19.9%, respectively. They were predominantly found with HBV genotypes B and C, which are common in Southeast Asia and Australia. All sequences of serotype adr and ayr were within genotype C. Serotypes adw2 and ayw1 were found in a wider range of genotypes than the other serotypes. Serotypes adw3, ayw2 and ayw3 were scattered among genotypes B, C and D with no discernible distribution pattern.

Table 3.

Predicted HBV serotypes by genotype and country in Southeast Asia, Australia and New Zealand.

Serotype adr
(n = 335, 36%)
adw2
(n = 270, 29%)
adw3
(n = 7, 0.8%)
ayr
(n = 6, 0.7%)
ayw1
(n = 185, 19.9%)
ayw2
(n = 45, 4.8%)
ayw3
(n = 70, 7.5%)
Unassigned (n = 12, 1.3%)
Genotype/
Country
A 9 2 1
Australia 1
Malaysia 1 1
Philippines 7 2
B 218 5 172 1 6 8
Cambodia 2 2
Indonesia 46 1 34 1
Malaysia 141 4 20 1 4
Philippines 1 6
Thailand 12 4
Vietnam 16 106 6 3
C 331 29 1 6 60 3
Australia 6 60
Cambodia 23 1
Indonesia 45 3 3
Malaysia 107 5 1 3
Myanmar 17 1
New Zealand 6
Philippines 1 9
Thailand 86 7 3
Vietnam 40 3
D 1 44 4
Indonesia 1 8
Malaysia 4
New Zealand 36
G 1
Thailand 1
I 11 10
Laos 4 10
Thailand 1
Vietnam 6
B/BC 1 1 1
Thailand 1
Malaysia 1 1
C/CB 1 1
Malaysia 1 1
G/GC 2
Thailand 2

3.6. Nucleotide Diversity

Nucleotide diversity analysis indicates a wide range of genetic variability for whole genome (Table 4) and along the HBV genome length (Figure 5). Genotype G was not included because only one sequence of this genotype was identified. Genotypes B and C demonstrated a higher number of segregating sites, Pi values and average number of nucleotide differences, indicating greater genetic diversity compared to other genotypes (Table 4). Despite the lowest number of genomes included in the dataset, genotype A demonstrated Pi and k values greater than genotypes D and I. This suggests that genotype D and I were the most conserved in their overall nucleotide sequences.

Table 4.

DNA polymorphism of HBV genotypes identified in Southeast Asia, Australia and New Zealand.

Genotype Number of Polymorphic (Segregating) Sites Average Number of Nucleotide Differences (k) Nucleotide Diversity (Pi)
Genotype A (n = 12) 44 10.788 0.00775
Genotype B (n = 410) 454 11.515 0.00827
Genotype C (n = 430) 446 12.548 0.00901
Genotype D (n = 49) 68 7.770 0.00558
Genotype I (n = 21) 37 8.148 0.00585
Total N = 930 (including genotype G and recombinants) 676 20.944 0.01505

Figure 5.

Figure 5

Nucleotide diversity (Pi) over HBV genome. The MAFF-aligned genomes had 3434 bp in length; genotypes were colour-coded (sites with gaps in the complete data file were excluded; window length 100 bp, step size 25 bp).

Moreover, we found genetic differences appeared to be unevenly distributed along the genome length (Figure 5). At a genotypic level, genotypes B and C were the most diverse within the overlapping RT and pre-S/S coding genes, while genotype A expressed more genetic variations located near or within X gene and PC/C gene. Genotypes D and I were more conserved than the others over the whole genome, except for a distinct locus in RT ORF.

3.7. Analysis of Single Nucleotide and Amino Acid Variations

Our dataset was investigated for the frequency of the selected variations by genotype (Table 5). We observed the p.V5L variation was present in almost all sequences analysed (≥83% prevalence for all genotypes). c.1896G>A was the second most common variant, particularly with HBV genotypes B, D, I and C, while c.1762A>T/c.1764G>A ranked third but was less genotype-specific. It was noted that the compound nucleotide substitution at positions 1762 and 1764 within BCP was more likely to occur together, rather than to have either substitution singularly. Escape mutations in the viral surface coding gene (14.4%) and c.1613G>A in NRE (12.8%) also occurred frequently, predominantly in genotypes B and C. Additionally, p.P130T in core gene and p.V5M in the X gene were mainly acquired in genotypes B and C, while drug resistance mutations occurred sporadically for genotypes A–D.

Table 5.

Number and percentage of selected mutations by HBV genotype 1.

Mutation Type (Region) A (n = 12) B (n = 410) C (n = 430) D (n = 49) G (n = 1) I (n = 21) B/BC, C/CB, G/GC (n = 7) Total (n = 930)
Drug resistance mutations (polymerase) 1 (8%) 15 (3.7%) 22 (5%) 2 (4%) 0 0 0 40 (4.3%)
Escape mutations (pre-S/S) 2 (16%) 41 (10%) 79 (18%) 8 (16%) 0 2 (10%) 2 (27%) 134 (14.4%)
c.1613G>A (NRE) 2 (16%) 44 (11%) 67 (15%) 0 1 (100%) 4 (19%) 1 (14%) 119 (12.8%)
Only c.1762A>T (BCP) 0 7 (2%) 3 (0.7%) 0 0 0 0 10 (1.1%)
Only c.1764G>A (BCP) 1 (8%) 3 (0.7%) 20 (5%) 1 (2%) 0 0 0 25 (2.7%)
Double c.1762A>T/ c.1764G>A (BCP) 5 (42%) 60 (15%) 159 (37%) 7 (14%) 1 (100%) 5 (24%) 2 (27%) 239 (25.7%)
c.1896G>A (PC) 0 193 (47%) 77 (18%) 13 (27%) 0 4 (19%) 2 (27%) 289 (31.1%)
p.P130T (C) 0 28 (6.8%) 60 (14%) 0 1 (100%) 0 0 89 (9.6%)
p.V5M (X) 0 8 (2%) 11 (2.6%) 0 0 0 1 (14%) 20 (2.2%)
p.V5L (X) 10 (83%) 401 (97%) 395 (92%) 48 (97%) 1 (100%) 21 (100%) 6 (86%) 882 (95%)

1 Note: One genome can acquire one or more variations.

4. Discussion

This in silico study of the distribution of HBV genotypes and recombinants in Southeast Asia, Australia and New Zealand was performed through the phylogenetic, recombination and comparative genomic analysis of 930 human HBV complete genomes. We observed a high prevalence of genotypes C and B in the studied region. This observation is consistent with previous studies [13,15]. The exception is Laos, where we identified genotype I as the dominant genotype, rather than genotypes B and C as reported by Norder [13] and Velkov [15]. We also detected frequent recombinants between genotypes B and C, all from Thailand and Malaysia.

This study also investigated the distribution of HBsAg subtypes in the study region predicted using a decision tree [22]. The three most prevalent serotypes were adr (36%), adw2 (29%) and ayw1 (19.9%). We found serotypes adr, ayr and ayw3 most commonly associated with genotype C, adw2 for B and C, adw3 and ayw1 for B, and ayw2 for D. These findings differ to Norder’s study [13], which reported that the serotypes adw2, ayw1 and ayw3 were more pronounced amongst genotypes A, B and D. Moreover, our observations of HBsAg subtype distribution for genotypes B and D are dissimilar to the finding of Chen et al. [71] who reported that adr and ayw2 were prevalent among genotype B strains while most of genotype D sequences were assigned to adr [71]. This is likely due to the geographical selection of HBV sequences used in this study. We targeted Southeast Asia, Australia and New Zealand, while the Norder study [13] was worldwide and the Chen study [71] focused on China.

To our knowledge, this study is also the first to investigate nucleotide diversity between HBV genotypes. We found that levels of nucleotide diversity varied amongst the different genotypes. Genotypes C and B showed an overall higher genetic diversity and more mutation types than genotypes A, D and I. Furthermore, we found that each genotype demonstrated varying genetic diversity within different ORFs (Figure 5). Specifically, genotypes B and C displayed the highest heterogeneity in the polymerase and pre-S/S regions while genotype A showed the highest diversity within the X and PC/C genes.

Information on the distribution of HBV mutations along genome length amongst genotypes is limited. Our study showed that the frequency of these mutants varied with genotypes (Table 5). This observation was also reported by Bayliss et al. [69]. We found that sequences with mutations, except for c.1896G>A, were mainly associated with to genotype C. However, we observed that p.V5L, c.1896G>A and c.1762A>T/c.1764G>A were the three most prevalent variants, contrasting to c.G1613A, c.1762A>T/c.1764G>A and c.1896G>A in the Bayliss study. These differences are likely attributable to sample selection criteria. In the Bayliss study, only HBeAg-positive patients with CHB were included, which may favour the presence of particular mutations. In our study, there was very little information linked to clinical diseases available on the NCBI nucleotide database for the collected genomes. Interestingly, in our study, escape mutations associated with risk of failed immunoprophylaxis [37,38,39,40,41,42] occurred significantly for genotypes A and C (totally 18%), compared to non-A/C genotypes (totally 10.6%) (X2 = 10.588, df = 1, p = 0.001138), given that most currently used HBV vaccines are generated from genotype A or C strains [29,30]. Due to the unavailability of relevant clinical information for these entries in GenBank, vaccine status for these sequences is unknown.

In considering whether an association between genetic hypervariability and mutation rate exists for each HBV genotype, we observed that the PC/C ORF of genotype A demonstrated high nucleotide diversity had a high proportion of c.1762A>T/c.1764G>A (42%) and c.1613G>A (16%) mutations. By contrast, although the polymerase and pre-S/S genes of genotypes B and C demonstrated the highest diversity compared with other ORFs, the percentage of mutations within these regions, including drug resistance and escape mutations, were much lower than all other mutations within the PC/C region combined (Table 5). This observation may be explained by a finding of Osiowy et al. [10], that the RT region has a higher ratio of synonymous to non-synonymous substitutions (dS/dN) than the core coding region. In other words, nucleotide subtitutions in the RT ORF are less likely to alter amino acid sequence, and therefore are less likely to be associated with clinically significant mutations than variation in the core region. This feature may allow HBV to play critical roles of the overlapping pre-S/S and RT regions on maintaining HBV transcription and HBsAg functionality against host immune system [3].

We recognise that our study has limitations. Our sequence selection was regionally focused, and HBV strains circulating in countries outside our studied area could account for genetic diversity of genotypes and subgenotypes which are absent or less frequent in Southeast Asia, Australia and New Zealand. There is a need for a global study of HBV genetic variability, to better understand contemporary distribution pattern—particularly considering the rise in human migration and ease of global travel.

In conclusion, this investigation is an up-to-date, thorough report on the distribution of HBV types circulating in Southeast Asia, Australia and New Zealand. We highlighted HBV nucleotide variability, and the prevalence of different mutations within genotypes and ORFs. Given that several HBV variants are associated with failure of immunoprophylaxis, drug resistance and liver disease progression, the understanding of variability and mutations within genotypes highlighted in our study may guide clinicians in predicting the risks associated with failed protection and unsuccessful treatment.

Acknowledgments

Computational and data visualization resources and services used in this work were provided by HPC and Research Support Group, Queensland University of Technology, Brisbane, Australia as well as by ICT Services Desk, Australian Red Cross Blood Service, Brisbane, Australia. The authors gratefully acknowledge Elvina Viennet for her advice on using R mapping.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/12/4/427/s1, Figure S1: FastTree maximum likelihood tree built from sequences of ambiguous genotypes and potential recombinants identified by RDP4, Figure S2: RAxML maximum likelihood tree built from sequences of ambiguous genotypes and potential recombinants identified by RDP4, Figure S3: FastTree maximum likelihood tree built from representative population, Figure S4: RAxML maximum likelihood tree built from representative population, Figure S5: PHYML maximum likelihood tree built from representative population, Table S1: Reference sequences used in this study, Table S2: Features of finally identified potential recombinants, File S1: Summarised results of 930 human HBV genomes from Southeast Asia, Australia and New Zealand.

Author Contributions

Conceptualization, N.M.H.P., E.R., H.F.; methodology, N.M.H.P. and E.R.; investigation, N.M.H.P. and E.R.; data curation, N.M.H.P. and E.R.; writing—original draft preparation, N.M.H.P.; writing—review and editing, all authors; visualization, N.M.H.P., E.R.; supervision, E.R., H.F., R.F. and K.S.; project administration, H.F., R.F. and E.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. Australian governments fund Australian Red Cross Lifeblood for the provision of blood, blood products and services to the Australian community.

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.World Health Organization . Global Hepatitis Report 2017. World Health Organization; Geneva, Switzerland: 2017. [Google Scholar]
  • 2.McNaughton A.L., D’arienzo V., Ansari M.A., Lumley S.F., Littlejohn M., Revill P., McKeating J.A., Matthews P.C. Insights from deep sequencing of the hbv genome—Unique, tiny, and misunderstood. Gastroenterology. 2019;156:384–399. doi: 10.1053/j.gastro.2018.07.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cento V., Mirabelli C., Dimonte S., Salpini R., Han Y., Trimoulet P., Bertoli A., Micheli V., Gubertini G., Cappiello G., et al. Overlapping structure of hepatitis b virus (hbv) genome and immune selection pressure are critical forces modulating hbv evolution. J. Gen. Virol. 2013;94:143. doi: 10.1099/vir.0.046524-0. [DOI] [PubMed] [Google Scholar]
  • 4.Schödel F., Peterson D., Zheng J., Jones J.E., Hughes J.L., Milich D.R. Structure of hepatitis b virus core and e-antigen. A single precore amino acid prevents nucleocapsid assembly. J. Biol. Chem. 1993;268:1332. [PubMed] [Google Scholar]
  • 5.Kim H., Lee S.A., Do S.Y., Kim B.J. Precore/core region mutations of hepatitis b virus related to clinical severity. World J. Gastroenterol. 2016;22:4287–4296. doi: 10.3748/wjg.v22.i17.4287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li M.S., LauTerrence C.K., ChanSophie K.P., Wong C.H., NgPatrick K.S., SungJoseph J.Y., ChanHenry L.Y., TsuiStephen K.W. The g1613a mutation in the hbv genome affects hbeag expression and viral replication through altered core promoter activity. PLoS ONE. 2011;6:e21856. doi: 10.1371/journal.pone.0021856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zhang Z., Wu C., Chen X., Li X., Li J., Lu M. Genetic variation of hepatitis b virus and its significance for pathogenesis. World J. Gastroenterol. 2016;22:126–144. doi: 10.3748/wjg.v22.i1.126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wang W., Shi Y.E., Bai G., Tang Y., Yuan Y., Zhang T., Li C. Hbxag suppresses apoptosis of human placental trophoblastic cell lines via activation of the pi3k/akt pathway. Cell Biol. Int. 2016;40:708. doi: 10.1002/cbin.10608. [DOI] [PubMed] [Google Scholar]
  • 9.Cha M.-Y., Kim C.-M., Park Y.-M., Ryu W.-S. Hepatitis b virus x protein is essential for the activation of wnt/beta-catenin signaling in hepatoma cells. Hepatol. (Baltim. Md.) 2004;39:1683. doi: 10.1002/hep.20245. [DOI] [PubMed] [Google Scholar]
  • 10.Osiowy C., Giles E., Tanaka Y., Mizokami M., Minuk G.Y. Molecular evolution of hepatitis b virus over 25 years. J. Virol. 2006;80:10307. doi: 10.1128/JVI.00996-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Park S.G., Kim Y., Park E., Ryu H.M., Jung G. Fidelity of hepatitis b virus polymerase. Eur. J. Biochem. 2003;270:2929–2936. doi: 10.1046/j.1432-1033.2003.03650.x. [DOI] [PubMed] [Google Scholar]
  • 12.Nowak M.A., Bonhoeffer S., Hill A.M., Boehme R., Thomas H.C., Mcdade H. Viral dynamics in hepatitis b virus infection. Proc. Natl. Acad. Sci. USA. 1996;93:4398. doi: 10.1073/pnas.93.9.4398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Norder H., Couroucé A.-M., Coursaget P., Echevarria J.M., Lee S.-D., Mushahwar I.K., Robertson B.H., Locarnini S., Magnius L.O. Genetic diversity of hepatitis b virus strains derived worldwide: Genotypes, subgenotypes, and hbsag subtypes. Intervirology. 2004;47:289–309. doi: 10.1159/000080872. [DOI] [PubMed] [Google Scholar]
  • 14.Okamoto H., Tsuda F., Sakugawa H., Sastrosoewignjo R.I., Imai M., Miyakawa Y., Mayumi M. Typing hepatitis b virus by homology in nucleotide sequence: Comparison of surface antigen subtypes. J. Gen. Virol. 1988;69:2575–2583. doi: 10.1099/0022-1317-69-10-2575. [DOI] [PubMed] [Google Scholar]
  • 15.Velkov S., Ott J., Protzer U., Michler T. The global hepatitis b virus genotype distribution approximated from available genotyping data. Genes. 2018;9:495. doi: 10.3390/genes9100495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tatematsu K., Tanaka Y., Kurbanov F., Sugauchi F., Mano S., Maeshiro T., Nakayoshi T., Wakuta M., Miyakawa Y., Mizokami M. A genetic variant of hepatitis b virus divergent from known human and ape genotypes isolated from a japanese patient and provisionally assigned to new genotype j. J. Virol. 2009;83:10538. doi: 10.1128/JVI.00462-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pérez-Losada M., Arenas M., Galán J.C., Palero F., González-Candelas F. Recombination in viruses: Mechanisms, methods of study, and evolutionary consequences. Infect. Genet. Evol. 2015;30:296–307. doi: 10.1016/j.meegid.2014.12.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shi W., Carr M.J., Dunford L., Zhu C., Hall W.W., Higgins D.G. Identification of novel inter-genotypic recombinants of human hepatitis b viruses by large-scale phylogenetic analysis. Virology. 2012;427:51–59. doi: 10.1016/j.virol.2012.01.030. [DOI] [PubMed] [Google Scholar]
  • 19.Liao H., Li X., Liu Y., Xu Z., Huang P., Nian X., Liu X., Xu D. Intergenotype recombinant analysis of full-length hepatitis b virus genomes from 516 chinese patients with different illness categories. J. Med. Virol. 2017;89:139–145. doi: 10.1002/jmv.24609. [DOI] [PubMed] [Google Scholar]
  • 20.Boyce C., Ganova-Raeva L., Archampong T., Lartey M., Sagoe K., Obo-Akwa A., Kenu E., Kwara A., Blackard J. Identification and comparative analysis of hepatitis b virus genotype d/e recombinants in africa. Virus Genes. 2017;53:538–547. doi: 10.1007/s11262-017-1469-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Norder H., Courouce A.M., Magnius L.O. Molecular basis of hepatitis b virus serotype variations within the four major subtypes. J. Gen. Virol. 1992;73:3141–3145. doi: 10.1099/0022-1317-73-12-3141. [DOI] [PubMed] [Google Scholar]
  • 22.Purdy M.A., Talekar G., Swenson P., Araujo A., Fields H. A new algorithm for deduction of hepatitis b surface antigen subtype determinants from the amino acid sequence. Intervirology. 2007;50:45–51. doi: 10.1159/000096312. [DOI] [PubMed] [Google Scholar]
  • 23.Norder H., Hammas B., Lofdahl S., Courouce A.M., Magnius L.O. Comparison of the amino acid sequences of nine different serotypes of hepatitis b surface antigen and genomic classification of the corresponding hepatitis b virus strains. J. Gen. Virol. 1992;73:1201–1208. doi: 10.1099/0022-1317-73-5-1201. [DOI] [PubMed] [Google Scholar]
  • 24.Lin C.-L., Kao J.-H. Hepatitis b virus genotypes and variants. Cold Spring Harb. Perspect. Med. 2015;5:a021436. doi: 10.1101/cshperspect.a021436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ito K., Yotsuyanagi H., Yatsuhashi H., Karino Y., Takikawa Y., Saito T., Arase Y., Imazeki F., Kurosaki M., Umemura T., et al. Risk factors for long-term persistence of serum hepatitis b surface antigen following acute hepatitis b virus infection in japanese adults. Hepatology. 2014;59:89–97. doi: 10.1002/hep.26635. [DOI] [PubMed] [Google Scholar]
  • 26.Mayerat C., Mantegani A., Frei P.C. Does hepatitis b virus (hbv) genotype influence the clinical outcome of hbv infection? J. Viral Hepat. 1999;6:299–304. doi: 10.1046/j.1365-2893.1999.00174.x. [DOI] [PubMed] [Google Scholar]
  • 27.Yu M.-W., Yeh S.-H., Chen P.-J., Liaw Y.-F., Lin C.-L., Liu C.-J., Shih W.-L., Kao J.-H., Chen D.-S., Chen C.-J. Hepatitis b virus genotype and DNA level and hepatocellular carcinoma: A prospective study in men. J. Natl. Cancer Inst. 2005;97:265–272. doi: 10.1093/jnci/dji043. [DOI] [PubMed] [Google Scholar]
  • 28.Zhang Y., Wu Y., Ye S., Wang T., Zhao R., Chen F., Abe K., Jin X. The response to interferon is influenced by hepatitis b virus genotype in vitro and in vivo. Virus Res. 2013;171:65–70. doi: 10.1016/j.virusres.2012.10.027. [DOI] [PubMed] [Google Scholar]
  • 29.Cassidy A., Mossman S., Olivieri A., Ridder M.D., Leroux-Roels G. Hepatitis b vaccine effectiveness in the face of global hbv genotype diversity. Expert Rev. Vaccines. 2011;10:1709–1715. doi: 10.1586/erv.11.151. [DOI] [PubMed] [Google Scholar]
  • 30.Hamada-Tsutsumi S., Iio E., Watanabe T., Murakami S., Isogawa M., Iijima S., Inoue T., Matsunami K., Tajiri K., Ozawa T., et al. Validation of cross-genotype neutralization by hepatitis b virus-specific monoclonal antibodies by in vitro and in vivo infection. PLoS ONE. 2015;10:e0118062. doi: 10.1371/journal.pone.0118062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Shokrgozar M.A., Shokri F. Subtype specificity of anti-hbs antibodies produced by human b-cell lines isolated from normal individuals vaccinated with recombinant hepatitis b vaccine. Vaccine. 2002;20:2215–2220. doi: 10.1016/S0264-410X(02)00116-0. [DOI] [PubMed] [Google Scholar]
  • 32.Stramer S.L., Wend U., Candotti D., Foster G.A., Hollinger F.B., Dodd R.Y., Allain J.-P., Gerlich W. Nucleic acid testing to detect hbv infection in blood donors. N. Engl. J. Med. 2011;364:236–247. doi: 10.1056/NEJMoa1007644. [DOI] [PubMed] [Google Scholar]
  • 33.Griffiths E., Reeve C., Marley J.V. Hepatitis b notifications in a vaccinated cohort of aboriginal people in the kimberley region. Med. J. Aust. 2014;201:343–346. doi: 10.5694/j.1326-5377.2014.tb04228.x. [DOI] [PubMed] [Google Scholar]
  • 34.Seed C.R., Jones N.T., Pickworth A.M., Graham W.R. Two cases of asymptomatic hbv “vaccine breakthrough” infection detected in blood donors screened for hbv DNA. Med. J. Aust. 2012;196:651–652. doi: 10.5694/mja11.11589. [DOI] [PubMed] [Google Scholar]
  • 35.Purdy M. Hepatitis b virus s gene escape mutants. Asian J. Transfus. Sci. 2007;1:62–70. doi: 10.4103/0973-6247.33445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Xiao Y., Sun K., Duan Z., Liu Z., Li Y., Yan L., Song Y., Zou H., Zhuang H., Wang J., et al. Quasispecies characteristic in “a” determinant region is a potential predictor for the risk of immunoprophylaxis failure of mother-to-child-transmission of sub-genotype c2 hepatitis b virus: A prospective nested case–control study. Gut. 2020;69:933–941. doi: 10.1136/gutjnl-2019-318278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Huang X., Lu D., Ji G., Sun Y., Ma L., Chen Z., Zhang L., Huang J., Yu L. Hepatitis b virus (hbv) vaccine-induced escape mutants of hbv s gene among children from qidong area, china. Virus Res. 2004;99:63–68. doi: 10.1016/j.virusres.2003.10.007. [DOI] [PubMed] [Google Scholar]
  • 38.Oon C.J., Chen W. Current aspects of hepatitis b surface antigen mutants in singapore. J. Viral Hepat. 1998;5:17–23. doi: 10.1046/j.1365-2893.1998.0050s2017.x. [DOI] [PubMed] [Google Scholar]
  • 39.Kato H., Sugiyama M., Mizokami M. Hepatitis b virus genotypes. In: Liaw Y.-F., Zoulim F., editors. Hepatitis b Virus in Human Diseases. 1st ed. Springer International Publishing; Cham, Switzerland: 2016. [Google Scholar]
  • 40.Ngui S.L., O’Connell S., Eglin R.P., Heptonstall J., Teo C.G. Low detection rate and maternal provenance of hepatitis b virus s gene mutants in cases of failed postnatal immunoprophylaxis in england and wales. J. Infect. Dis. 1997;176:1360–1365. doi: 10.1086/514133. [DOI] [PubMed] [Google Scholar]
  • 41.Haruki K., Ayano I., Shuichiro U., Tomoyuki T., Tsuyoshi S., Yasuhiro K., Tomoo F. Evaluation of the g145r mutant of the hepatitis b virus as a minor strain in mother-to-child transmission. PLoS ONE. 2016;11:e0165674. doi: 10.1371/journal.pone.0165674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mokaya J., McNaughton A.L., Hadley M., Beloukas A., Geretti A., Goedhals D., Matthews P.C. A systematic review of hepatitis b virus (hbv) drug and vaccine escape mutations in africa: A call for urgent action. PLoS Negl. Trop. Dis. 2018;12:e0006629. doi: 10.1371/journal.pntd.0006629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Shaw T., Bartholomeusz A., Locarnini S. Hbv drug resistance: Mechanisms, detection and interpretation. J. Hepatol. 2006;44:593–606. doi: 10.1016/j.jhep.2006.01.001. [DOI] [PubMed] [Google Scholar]
  • 44.Caligiuri P., Cerruti R., Icardi G., Bruzzone B. Overview of hepatitis b virus mutations and their implications in the management of infection. World J. Gastroenterol. 2016;22:145. doi: 10.3748/wjg.v22.i1.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Colombatto P., Barbera C., Bortolotti F., Maina A.M., Moriconi F., Cavallone D., Calvo P., Oliveri F., Bonino F., Brunetto M.R. Hbv pre-core mutant in genotype-d infected children is selected during hbeag/anti-hbe seroconversion and leads to hbeag negative chronic hepatitis b in adulthood. J. Med. Virol. 2018;90:1232–1239. doi: 10.1002/jmv.25068. [DOI] [PubMed] [Google Scholar]
  • 46.Funk M.L., Rosenberg D.M., Lok A.S.F. World-wide epidemiology of hbeag-negative chronic hepatitis b and associated precore and core promoter variants. J. Viral Hepat. 2002;9:52–61. doi: 10.1046/j.1365-2893.2002.00304.x. [DOI] [PubMed] [Google Scholar]
  • 47.Xie Y., Liu S., Zhao Y., Zhang L., Liu B., Guo Z. Precore/core region mutations in hepatitis b virus DNA predict postoperative survival in hepatocellular carcinoma. PLoS ONE. 2015;10:e0133393. doi: 10.1371/journal.pone.0133393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wu S., Chang Y., Chu T., Shih C. Persistence of hepatitis b virus DNA and the tempos between virion secretion and genome maturation in a mouse model. J. Virol. 2019:93. doi: 10.1128/JVI.01001-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Nie H., Evans A.A., London W.T., Block T.M., Ren X.D. Quantitative dynamics of hepatitis b basal core promoter and precore mutants before and after hbeag seroconversion. J. Hepatol. 2012;56:795–802. doi: 10.1016/j.jhep.2011.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Yang Z., Zhuang L., Lu Y., Xu Q.N., Tang B., Chen X. Naturally occurring basal core promoter a1762t/g1764a dual mutations increase the risk of hbv-related hepatocellular carcinoma: A meta-analysis. Oncotarget. 2016;7:12525–12536. doi: 10.18632/oncotarget.7123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kim H.J., Park J.H., Jee Y., Lee S.A., Kim H., Song B.C., Yang S., Lee M., Yoon J.H., Kim Y.J., et al. Hepatitis b virus x mutations occurring naturally associated with clinical severity of liver disease among korean patients with chronic genotype c infection. J. Med. Virol. 2008;80:1337–1343. doi: 10.1002/jmv.21219. [DOI] [PubMed] [Google Scholar]
  • 52.Rozanov M., Plikat U., Chappey C., Kochergin A., Tatusova T. A web-based genotyping resource for viral sequences. Nucleic Acids Res. 2004;32:W654–W659. doi: 10.1093/nar/gkh419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hayer J., Jadeau F., Deléage G., Kay A., Zoulim F., Combet C. Hbvdb: A knowledge database for hepatitis b virus. Nucleic Acids Res. 2013;41:D566. doi: 10.1093/nar/gks1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Price M.N., Dehal P.S., Arkin A.P. Fasttree: Computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 2009;26:1641–1650. doi: 10.1093/molbev/msp077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Price M.N., Dehal P.S., Arkin A.P. Fasttree 2—Approximately maximum-likelihood trees for large alignments (fasttree 2) PLoS ONE. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Stamatakis A. Raxml-vi-hpc: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
  • 57.Katoh K., Standley D.M. Mafft multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Katoh K., Misawa K., Kuma K.-I., Miyata T. Mafft: A novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Res. 2002;30:3059. doi: 10.1093/nar/gkf436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Martin D.P., Murrell B., Golden M., Khoosal A., Muhire B. Rdp4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015;1:vev003. doi: 10.1093/ve/vev003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Martin D., Rybicki E. Rdp: Detection of recombination amongst aligned sequences. Bioinformatics. 2000;16:562. doi: 10.1093/bioinformatics/16.6.562. [DOI] [PubMed] [Google Scholar]
  • 61.Padidam M., Sawyer S., Fauquet C.M. Possible emergence of new geminiviruses by frequent recombination. Virology. 1999;265:218–225. doi: 10.1006/viro.1999.0056. [DOI] [PubMed] [Google Scholar]
  • 62.Smith J. Analyzing the mosaic structure of genes. J. Mol. Evol. 1992;34:126–129. doi: 10.1007/BF00182389. [DOI] [PubMed] [Google Scholar]
  • 63.David P., Keith A.C. Evaluation of methods for detecting recombination from DNA sequences: Computer simulations. Proc. Natl. Acad. Sci. USA. 2001;98:13757. doi: 10.1073/pnas.241370698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lam H.M., Ratmann O., Boni M.F. Improved algorithmic complexity for the 3seq recombination detection algorithm. Mol. Biol. Evol. 2018;35:247–251. doi: 10.1093/molbev/msx263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Guindon S., Dufayard J.-F., Lefort V., Anisimova M., Hordijk W., Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of phyml 3.0. Syst. Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
  • 66.Bell T.G., Kramvis A. Mutation reporter tool: An online tool to interrogate loci of interest, with its utility demonstrated using hepatitis b virus. Virol. J. 2013;10:62. doi: 10.1186/1743-422X-10-62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Bell T.G., Kramvis A. Bioinformatics tools for small genomes, such as hepatitis b virus. Viruses. 2015;7:781–797. doi: 10.3390/v7020781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J.C., Guirao-Rico S., Librado P., Ramos-Onsins S.E., Sánchez-Gracia A. Dnasp 6: DNA sequence ppolymorphism analysis of large data sets. Mol. Biol. Evol. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  • 69.Bayliss J., Yuen L., Rosenberg G., Wong D., Littlejohn M., Jackson K., Gaggar A., Kitrinos K.M., Subramanian G.M., Marcellin P., et al. Deep sequencing shows that hbv basal core promoter and precore variants reduce the likelihood of hbsag loss following tenofovir disoproxil fumarate therapy in hbeag-positive chronic hepatitis b. Gut. 2017;66:2013. doi: 10.1136/gutjnl-2015-309300. [DOI] [PubMed] [Google Scholar]
  • 70.Neumann-Fraune M., Beggel B., Kaiser R., Obermeier M. Hepatitis B virus drug resistance tools: One sequence, two predictions. Intervirology. 2014;57:232–236. doi: 10.1159/000361076. [DOI] [PubMed] [Google Scholar]
  • 71.Chen X., Gao J., Ji Z., Zhang W., Zhang L., Xu R., Zhang J., Li F., Li S., Hu S., et al. A description of the hepatitis b virus genomic background in a high-prevalence area in China. Virol. J. 2014;11:101. doi: 10.1186/1743-422X-11-101. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Viruses are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES