Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2023 Nov 2;14:1285367. doi: 10.3389/fmicb.2023.1285367

A novel candidate hepatitis C virus genotype 4 subtype identified by next generation sequencing full-genome characterization in a patient from Saudi Arabia

Mariantonietta Di Stefano 1,, Mona H Ismail 2,3,*,, Thomas Leitner 4, Giuseppina Faleo 1, Marwan Jabr Alwazzeh 3,5, Jean Lutamyo Mbisa 6, Josè Ramon Fiore 1,, Teresa Antonia Santantonio 1,
PMCID: PMC10653324  PMID: 38029191

Abstract

Background and aim

Hepatitis C virus (HCV) infection is a major global public health concern, being a leading cause of chronic liver diseases such as chronic hepatitis, cirrhosis, and hepatocellular carcinoma. The virus is classified into 8 genotypes and 93 subtypes, each displaying distinct geographic distributions. Genotype 4 is the most predominant in the Middle East and Eastern Mediterranean and is associated with high rates of hepatitis C infection worldwide. This study used next-generation sequencing to fully characterize the HCV genome and identify a novel subtype within genotype 4 isolated from a 64-year-old Saudi man diagnosed with hepatitis C.

Methods

We analyzed the complete genome of the 141-HCV isolate using whole-genome sequencing.

Results

Our phylogenetic reconstructions, based on the entire genome of HCV-4 strains, revealed that the 141-HCV isolate formed a distinct group within the genotype 4 classification, providing valuable new insights into the variability of HCV.

Conclusion

This discovery of a previously unclassified HCV subtype within genotype 4 sheds light on the ongoing evolution and diversity of the virus. Such knowledge has significant implications for diagnostic and therapeutic approaches, as different subtypes may exhibit varying drug sensitivities and resistance profiles.

Keywords: Hepatitis C virus (HCV), genotypes, subtypes, next generation sequence, Saudi Arabia

Introduction

Hepatitis C virus (HCV) is a single-stranded, positive polarity RNA virus belonging to the genus Hepacivirus, a member of the Flaviviridae family (Li and Lo, 2015). Despite new, highly efficient antiviral drugs, HCV infection is still a major global public health problem, being a major cause of chronic hepatitis, chronic liver diseases, cirrhosis, and hepatocellular carcinoma (Bertino et al., 2016). The Middle East and Eastern Mediterranean (M and E) region have been reported to have the highest rates of HCV infection in the world, with an incidence of 62.5 per 100,000 person-years and a prevalence of 2.3% among the general population. In 2015, it was estimated that approximately one-fourth of 1.75 million newly HCV- infected persons and one-fifth of 71 million chronically infected individuals in the world resided in M and E countries. The median of the anti-HCV seropositivity rate in this region ranged broadly from 0.3% in Iran to 13.0% in Egypt. In Saudi Arabia, the seroprevalence was estimated between 0.4 and 7.3% of the general population (Abdel-Moneim et al., 2012; Alghamdi et al., 2016).

Based on genetic differences, HCV strains are classified into 8 genotypes (Borgia et al., 2018) and 93 subtypes as of March 2022 (International Committee on Taxonimy of Viruses (ICTV), 2018), which vary in geographical distribution. Studying this aspect is vital to define better HCV epidemiology, evolution, and clinically relevant therapeutic approaches. Recently, we characterized several HCV isolates from Saudi Arabia as belonging to different genotype 4 subtypes: “a” (n = 9), “o” (n = 2), “d” (n = 3), two possible recombinants (consisting of GT 4a/GT 4o/GT4a) and one patient with an unclassifiable GT 4 subtype (Di Stefano et al., 2021).

Here, we report a candidate novel HCV genotype 4 variant identified in one of these cases that showed significant genetic diversity and evolution from other characterized HCV genotype 4 isolates. The full-length genome of this novel HCV strain was obtained by using a sequence agnostic whole genome sequence approach, demonstrating that genetically divergent and unknown HCV subtypes are circulating in the human population in Saudi Arabia.

Materials and methods

Subject and sample

In this study, we characterized the 141-HCV isolate obtained from the plasma of a 64-year-old heterosexual Saudi man with no risk factors for HCV infection. The patient tested positive for anti- HCV in January 2010 (Architect HCV enzyme immunoassay, Abbott Diagnostic, Chicago, IL, United States). He was later referred to the Hepatology clinic in April 2019 at King Fahd Hospital of the University, Alkhobar, Saudi Arabia. His initial investigation showed that complete blood count, serum aminotransferase levels, bilirubin, albumin, and prothrombin time were within the normal range. HCV RNA in the plasma sample was 6.81 logs IU/mL using the Abbott RealTime HCV assay (Abbott, Rungis, France). However, his initial assessment of liver fibrosis by Fibroscan (502 Touch, Echosens, Paris, France) in May 2019 revealed advanced fibrosis/cirrhosis F3–F4, S0 [Elastography (E) for fibrosis (F):13.2 kPa, Controlled Attenuation Parameter (CAP) for steatosis (S):196 dB/m]. Initial abdominal ultrasound showed features of an irregular liver surface with no splenomegaly or dilated portal vein. Upper endoscopy showed no signs or features of portal hypertension.

For treatment, the patient received a combination of sofosbuvir and daclatasvir in 2019 for 3 months, which resulted in a cure and a successful sustained virological response. Subsequently, a follow-up Fibroscan in 2020 showed F3 and S1 (E: 10.2 kPa, and CAP: 257 dB/m), while in April 2022, showed a significant improvement and regression of liver fibrosis, with F0/S1 (E:4.5 kPa/252 dB/m), indicating a positive response to HCV anti-viral therapy.

An unclassifiable GT4 subtype was initially detected by amplifying and sequencing three different genes (NS3, NS5A, and NS5B) (Di Stefano et al., 2021). For a more comprehensive virus analysis, the HCV strain’s entire genomic sequence (141) was sequenced using sequential capture technology.

HCV whole genome sequencing

The complete genomic sequence of the HCV isolate was obtained using a previously described state- of-the-art sequence capture methodology. The TAKARA SMARTer Stranded Total RNA-Seq Kit v2 was used to generate libraries from DNAse-digested nucleic acid extracts. Libraries were pooled according to both total mass and HCV-specific fragment frequency. Pools were hybridized to customized HCV-specific biotinylated oligonucleotide probes, designed to cover the diversity of HCV genotypes 1–8, with enriched fragments partitioned onto streptavidin beads and subjected to further cycles of PCR amplification before being sequenced on an Illumina MiSeq instrument. Human reads were removed from the de-multiplexed, adapter-stripped, paired-nend FASTQ datasets by mapping against the hg38 reference. Surviving paired-end reads were mapped to an HCV reference genome set, and the subtypes of each genome match collated to determine the initial subtype composition. In parallel, whole genomes were built with a combination of de novo assembly (Manso et al., 2020) to derive a nucleotide frequency table. From this table, a consensus sequence was inferred using a depth threshold of 30 reads per position (Ns were coded at loci with insufficient reads), and positions with mixtures greater than 15% were coded as IUPAC ambiguities.

Phylogenetic evaluation

The complete genome sequence of the 141-HCV isolate was aligned to 238 HCV reference sequences from the International Committee on Taxonomy of Viruses (ICTV) version 09.03.22, downloaded 2022-10-06, using MAFFT version 7 (Katoh and Standley, 2013). Genotype and subtype were determined by phylogenetic reconstruction using PhyML 3.0 (Guindon et al., 2010) under a GTR + G + I substitution model with NNI + SPR search and aLRT branch support to assess phylogenetic robustness. For assessment of potential recombinant origin, sample 10,002 was also analyzed using PhyloPlace,1 which implements the branching index (BI) to assess phylogenetic relatedness in genomic windows of 400 nt moved in 80 nt steps across the genome and a BI threshold of 0.69 (Hraber et al., 2008). Phylogenetic verification was also done with HCV-Glue.2

Results

As part of a survey of HCV infection in Saudi Arabia, we previously found an unclassifiable subtype within genotype 4 in an isolate based on the sequencing of NS3, NS5A, and NS5B regions (Di Stefano et al., 2021). In agreement with the previous analysis of the three shorter regions, the complete genome sequence of the 141-HCV isolate did not cluster inside any previously known subtype, nor any reference sequences in the ICTV set that may represent other new subtypes, e.g., JX227964 and JX227963 (Figure 1A). The 141-HCV isolate formed a distinct branch with no reference sequences nearby. Furthermore, the 141-HCV isolate was at a typical genotype 4 inter-subtype distance (Figures 1B, p= 0.263, Wilcoxon rank sum test with continuity correction, using 6,483 sequences from 17 genotype 4 subtypes). We investigated the branching index (Wilbe et al., 2003) across the genome to assess the potential recombinant origin. Figure 2 shows that no part of the 141-HCV isolate genome had an origin close enough to any known subtype of genotype 4. All parts of the 141-genome sequence were far below the HCV BI threshold of 0.69, indicating a recombinant origin; thus, this isolate represents a new candidate subtype of HCV genotype 4.

Figure 1.

Figure 1

HCV genotype 4 comparisons to the 141-HCV isolate. (A) Maximum-likelihood tree comparing the 141-HCV isolate (red dot) to ICTV genotype 4 subtype reference sequences, labeled with subtype and GenBank accession number. Robust phylogenetic support is indicated by an open circle at nodes with aLRT 0.95. (B) Pairwise p-distances among known subtypes (blue) and between 141-HCV isolate and all known subtypes (red). p value indicates true location shift is not equal to 0 between the two distributions using a Wilcoxon rank sum test with continuity correction.

Figure 2.

Figure 2

Branching index analysis of the 141-HCV isolate. The red horizontal line indicates significant genotype association. The 141-HCV isolate does not associate with any known genotype across its genome (x-axis).

Discussion

HCV has been shown to have high genetic diversity and is currently classified into 8 genotypes with distinct geographical distributions. The HCV GT 4 is the most dominant in the Middle East and Central and East Africa. In contrast, HCV GT1 is the most common in North and South America, Europe, and Australia, while HCV GT 3 is the most prevalent in the Indian subcontinent. On the other hand, GT 2 is most common in West Africa, GT 5 in South Africa, and GT 6 in Southeast Asia. Finally, the newly discovered genotypes 7 and 8 are dominant in Central Africa and the subcontinent of India, respectively.

Interestingly, increased human migration leads to gradual changes in the distribution of HCV genotype, although the picture of circulating HCV diverging strains is not yet complete. Divergent HCV GT1 has been reported in Germany (Wang et al., 2019) and Cyprus (Oikonomopoulou et al., 2017), while in Canada, several divergent HCV GT 2 were observed, mainly in patients of African origin (Li et al., 2012). However, to classify it as a new HCV subtype, the characterization of three HCV isolates that cluster but are not epidemiologically linked is required (Smith et al., 2014). In the case of HCV genotype 4, four different sub-genotypes (4d, 4r, 4 L, and 4v) were reported in Ethiopia (Hundie et al., 2017). Although subgenotype 5a is the most predominant in South Africa, other subgenotypes and recombinant viruses could be found (Simmonds et al., 2005), and these recombination events could play a significant role in the evolution of RNA viruses. We should also consider that HCV GT 4 epidemiology is changing. It started to expand beyond its strongholds in Africa and the Middle East to several Western countries, especially in Europe, due to changes in demographic structure, immigration, and injection drug use (de Bruijne et al., 2009; Ciccozzi et al., 2012).

Understanding the different genotypes and subtypes of HCV is crucial as they can significantly impact the success of treatment and clinical outcomes of HCV infections (Schinazi et al., 2014; Waldenström et al., 2016). For instance, Interferon-based therapy has shown higher success rates for GT 2 and 3 (Waldenström et al., 2016), while the first- generation HCV protease inhibitors were more effective in GT 1 (Schinazi et al., 2014). Fortunately, second-generation direct acting antivirals have broader coverage across genotypes. However, despite the promise of pangenotypic regimens, they are expensive and often unavailable in low or middle-income countries. As a result, access to these treatments becomes a challenge for many patients in these regions.

These findings demonstrate that through phylogenetic reconstructions using the entire genome of HCV-4 strains, we found that the 141-HCV isolate formed a distinct cluster within the HCV-4 group. This provides valuable new information about the variability of HCV and sheds light on the unique characteristics of this particular strain.

Data availability statement

The data presented in the study are deposited in the GenBank repository, accession number OR594285.

Ethics statement

This study was performed according to the Declaration of Helsinki, and ethical approval was obtained from Imam Abdulrahman Bin Faisal University (IRB-2020-01-313, 26 October 2020). The study was conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MD: Conceptualization, Supervision, Writing – original draft, Writing – review & editing. MI: Data curation, Investigation, Writing – review & editing. TL: Formal analysis, Funding acquisition, Investigation, Writing – review & editing. GF: Methodology, Investigation, Writing – original draft. JLM: Formal analysis, Investigation, Writing – review & editing. MT: Investigation, Methodology, Writing – review & editing. JF: Conceptualization, Supervision, Writing – original draft, Writing – review & editing. TS: Writing – original draft, Writing – review & editing.

Funding Statement

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. TL was supported by an NIH grant (R01AI087520).

Footnotes

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2023.1285367/full#supplementary-material

References

  1. Abdel-Moneim A. S., Bamaga M. S., Shehab G. M., Abu-Elsaad A. A., Farahat F. M. (2012). HCV infection among Saudi population: high prevalence of genotype 4 and increased viral clearance rate. PLoS One 7:e29781. doi: 10.1371/journal.pone.0029781, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alghamdi A. S., Alghamdi M., Sanai F. M., Alghamdi H., Aba-Alkhail F., Alswat K., et al. (2016). SASLT guidelines: update in treatment of hepatitis C virus infection. Saudi. J. Gastroenterol. 22, S25, 3767.188067–3767.188057. doi: 10.4103/1319- [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bertino G., Ardiri A., Proiti M., Rigano G., Frazzetto E., Demma S., et al. (2016). Chronic hepatitis C: this and the new era of treatment. World J. Hepatol. 8, 92–106. doi: 10.4254/wjh.v8.i2.92, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Borgia S. M., Hedskog C., Parhy B., Hyland R. H., Stamm L. M., Brainard D. M., et al. (2018). Identification of a novel hepatitis C virus genotype from Punjab, India: expanding classification of hepatitis C virus into 8 genotypes. J. Infect. Dis. 218, 1722–1729. doi: 10.1093/infdis/jiy401, PMID: [DOI] [PubMed] [Google Scholar]
  5. Ciccozzi M., Equestre M., Costantino A., Marascio N., Quirino A., Lo Presti A., et al. (2012). Hepatitis C virus genotype 4d in southern Italy: reconstruction of its origin and spread by a phylodynamic analysis. J. Med. Virol. 84, 1613–1619. doi: 10.1002/jmv.23384, PMID: [DOI] [PubMed] [Google Scholar]
  6. de Bruijne J., Schinkel J., Prins M., Koekkoek S. M., Aronson S. J., van Ballegooijen M. W., et al. (2009). Emergence of hepatitis C virus genotype 4: phylogenetic analysis reveals three distinct epidemiological profiles. J. Clin. Microbiol. 47, 3832–3838. doi: 10.1128/JCM.01146-09, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Di Stefano M., Ismail M. H., Leitner T., Faleo G., Elmnan Adem S. A., Elamin M. O. M. E., et al. (2021). Genetic subtypes and natural resistance mutations in HCV genotype 4 infected Saudi Arabian patients. Viruses 13:1832. doi: 10.3390/v13091832, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Guindon S., Dufayard J. F., Lefort V., Anisimova M., Hordijk W., Gascuel O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. doi: 10.1093/sysbio/syq010, PMID: [DOI] [PubMed] [Google Scholar]
  9. Hraber P., Kuiken C., Waugh M., Geer S., Bruno W. J., Leitner T. (2008). Automatic classification of HCV and HIV-1 sequences with the branching index. J. Gen. Virol. 89, 2098–2107. doi: 10.1099/vir.0.83657-0, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Hundie G. B., Raj V. S., GebreMichael D., Pas S. D., Haagmans B. L. (2017). Genetic diversity of hepatitis C virus in Ethiopia. PLoS One 12:e0179064. doi: 10.1371/journal.pone.0179064, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. International Committee on Taxonimy of Viruses (ICTV) . (2018). Available at: https://talk.ictvonline.org/ictv_wikis/flaviviridae/w/sg_flavi/56/hcv-classification.version09.03.22,downloaded2022-10-06,usingMAFFTversion7.
  12. Katoh K., Standley D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. doi: 10.1093/molbev/mst010, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Li C., Cao H., Lu L., Murphy D. (2012). Full-length sequences of 11 hepatitis C virus genotype 2 isolates representing five subtypes and six unclassified lineages with unique geographical distributions and genetic variation patterns. J. Gen. Virol. 93, 1173–1184. doi: 10.1099/vir.0.038315-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Li H. C., Lo S. Y. (2015). Hepatitis C virus: virology, diagnosis and treatment. World J. Hepatol. 7, 1377–1389. doi: 10.4254/wjh.v7.i10.1377, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Manso C. F., Bibby D. F., Lythgow K., Mohamed H., Myers R., Williams D., et al. (2020). Technical validation of a hepatitis C virus whole genome sequencing assay for detection of genotype and antiviral resistance in the clinical pathway. Front. Microbiol. 11:576572. doi: 10.3389/fmicb.2020.576572, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Oikonomopoulou M., Demetriou V. L., Kostrikis L. G., Paraskevis D. (2017). Re-analysis of unassigned hepatitis C virus (HCV) strain CYHCV025: evidence of a highly divergent lineage within genotype 1. Mol. Phylogenet. Evol. 109, 362–366. doi: 10.1016/j.ympev.2017.02.003 [DOI] [PubMed] [Google Scholar]
  17. Schinazi R., Halfon P., Marcellin P., Asselah T. (2014). HCV direct-acting antiviral agents: the best interferon-free combinations. Liver Int. 34, 69–78. doi: 10.1111/liv.12423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Simmonds P., Bukh J., Combet C., Deléage G., Enomoto N., Feinstone S., et al. (2005). Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology 42, 962–973. doi: 10.1002/hep.20819 [DOI] [PubMed] [Google Scholar]
  19. Smith D. B., Bukh J., Kuiken C., Muerhoff A. S., Rice C. M., Stapleton J. T., et al. (2014). Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: updated criteria and genotype assignment web resource. Hepatology 59, 318–327. doi: 10.1002/hep.26744, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Waldenström J., Färkkilä M., Rembeck K., Norkrans G., Langeland N., Mørch K., et al. (2016). Short interferon and ribavirin treatment for HCV genotype 2 or 3 infection: NORDynamIC trial and real-life experience. Scand. J. Gastroenterol. 51, 337–343. doi: 10.3109/00365521.2015.1087588, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Wang B., Krüger L., Machnowska P., Eshetu A., Gunsenheimer-Bartmeyer B., Bremer V., et al. (2019). Characterization of a hepatitis C virus genotype 1 divergent isolate from an HIV-1 coinfected individual in Germany assigned to a new subtype 1o. Virol. J. 16:28. doi: 10.1186/s12985-019-1135-7, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Wilbe K., Salminen M., Laukkanen T., McCutchan F., Ray S. C., Albert J., et al. (2003). Characterization of novel recombinant HIV-1 genomes using the branching index. Virology 316, 116–125. doi: 10.1016/j.virol.2003.08.004, PMID: [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data presented in the study are deposited in the GenBank repository, accession number OR594285.


Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES