Skip to main content
PLOS One logoLink to PLOS One
. 2022 Mar 21;17(3):e0264307. doi: 10.1371/journal.pone.0264307

Transcriptomic profiling of blood from autoimmune hepatitis patients reveals potential mechanisms with implications for management

Michele May-Sien Tana 1,2,*, Arielle Klepper 1, Amy Lyden 3, Angela Oliveira Pisco 3, Maira Phelps 3, Breann McGee 4, Kelsey Green 4, Sandy Feng 1,2, Joseph DeRisi 1,3, Emily Dawn Crawford 1,3, Craig S Lammert 4
Editor: Gualtiero I Colombo5
PMCID: PMC8936448  PMID: 35312680

Abstract

Autoimmune hepatitis (AIH) is a poorly understood, chronic disease, for which corticosteroids are still the mainstay of therapy and most patients undergo liver biopsy to obtain a diagnosis. We aimed to determine if there was a transcriptomic signature of AIH in the peripheral blood and investigate underlying biologic pathways revealed by gene expression analysis. Whole blood RNA from 75 AIH patients and 25 healthy volunteers was extracted and sequenced. Differential gene expression analysis revealed 249 genes that were significantly differentially expressed in AIH patients compared to controls. Using a random forest algorithm, we determined that less than 10 genes were sufficient to differentiate the two groups in our cohort. Interferon signaling was more active in AIH samples compared to controls, regardless of treatment status. Pegivirus sequences were detected in five AIH samples and 1 healthy sample. The gene expression data and clinical metadata were used to determine 12 genes that were significantly associated with advanced fibrosis in AIH. AIH patients with a partial response to therapy demonstrated decreased evidence of a CD8+ T cell gene expression signal. These findings represent progress in understanding a disease in need of better tests, therapies, and biomarkers.

Introduction

Autoimmune hepatitis (AIH) is an uncommon, chronic inflammatory disorder of the liver that can lead to cirrhosis, liver transplantation, and death. Epidemiologic studies suggest its incidence and prevalence are rising, and our prior work has shown that AIH disproportionately affects people of color in the United States [1, 2]. It is well-recognized that AIH patients are predominantly female. The pathogenesis of AIH remains obscure and may be influenced by genetic predisposition and unknown environmental triggers, potentially viruses, chemicals, or medications. The result is a dysregulated innate and adaptive immune response, with T- and B-cell mediated inflammation. Normal regulatory mechanisms do not maintain tolerance to as yet unknown autoantigens. There is no single, reliable blood test for the diagnosis of AIH, and an invasive liver biopsy is often required to secure the diagnosis and stage disease severity. Management frequently consists of lifelong nonspecific immunosuppression with azathioprine or other salvage therapies [3].

RNA sequencing (RNA-seq) is a powerful and comprehensive tool that provides rapid, affordable, and high-resolution analysis of transcriptomes. RNA-seq has diverse applications across multiple fields of biology and can be used to assess transcriptomic profiles, discover novel biomarkers, and evaluate pathophysiologic mechanisms. Multiple studies have analyzed cytokines of interest and specific lymphocyte populations in patients with AIH, but no research has focused on an unbiased look at gene expression in the peripheral blood.

In this study, we performed whole blood transcriptomic analysis using RNA-Seq to determine whether peripheral blood gene expression signatures could distinguish AIH patients from healthy controls, and to explore biologic pathways underlying AIH. Our goals were to identify signatures that may reflect clinical markers of AIH disease activity and shed light on the pathophysiologic basis of this enigmatic disease.

Methods

Clinical cohort

This work was approved by the institutional review boards at Indiana University and University of California, San Francisco. Data were analyzed in a deidentified manner. RNA was provided by the Genetic Repository of Autoimmune Liver Diseases and Contributing Exposures (GRACE) cohort, established in 2014 at Indiana University [4]. This cohort was developed to strategically warehouse and link biospecimens, high quality clinical data, and environmental exposure histories among autoimmune liver disease patients [5]. AIH patients (cases) included in the GRACE cohort included adults with a diagnosis of AIH meeting at least the simplified AIH criteria [5, 6]. Cases meeting inclusion criteria were serially recruited from in- and outpatient settings. Controls were healthy volunteers, recruited from the community at Autoimmune Hepatitis Association conferences and other events. They were screened for liver disease through self-reported history and through blood tests of liver enzymes. All participants provided informed consent.

Clinical data collected included demographics, time of diagnosis and biospecimen collection, medications, laboratory results, imaging studies, liver histology, and outcomes such as decompensation. Study data were collected and managed using REDCap electronic data capture tools hosted at UCSF [7]. REDCap (Research Electronic Data Capture) is a secure, web-based software platform designed to support data capture for research studies, providing 1) an intuitive interface for validated data capture; 2) audit trails for tracking data manipulation and export procedures; 3) automated export procedures for seamless data downloads to common statistical packages; and 4) procedures for data integration and interoperability with external sources.

Autoimmune hepatitis type was determined through chart review. Type 1 was defined by positivity of antinuclear, anti-smooth muscle, or anti-actin antibodies. Type 2 was defined by antibodies to LKM-1 or LC-1. Cases following exposure to an offending drug with the ability to wean immunosuppression was classified as drug-induced AIH. Because not every patient had a liver biopsy near the time for blood collection, we use the following algorithm to estimate fibrosis stage (S1 Fig). Fibrosis stage was based on liver biopsy (preferred) or transient elastography within six months, either before or after, specimen collection. Patients with values > 19kPa were deemed cirrhotic and those with values < 13 non-cirrhotic. For TE values between 13 and 19, FIB-4 was calculated [8]. When neither biopsy nor TE were available, then FIB-4 value < 2.6 was interpreted as non-cirrhotic [9]. For patients with FIB-4 values > = 2.6, results of ultrasound, computed tomography, or magnetic resonance imaging were reviewed for corroborative features of cirrhosis including nodular liver contour, splenomegaly, enlarged main portal vein, or varices. Subjects for whom the diagnosis of cirrhosis remained uncertain were reviewed by two hepatologists until consensus was reached.

Response to treatment was determined by ALT levels. For patients on AIH therapy for six or more months, complete response was defined as AST < 30 U/L and ALT ≤ 19 U/L or ≤ 30 U/L for women or men, respectively. Partial and non-response were defined as ALT ≤ 60 U/L and ALT >60 U/L, respectively.

Extraction and sequencing library preparation

Blood from GRACE participants had been stored in PAXgene tubes and frozen at -80°. After thawing, RNA was extracted from whole blood using the Qiagen PAXgene Blood RNA System [10]. Quantification was performed using Qubit RNA HS Assay (Thermo Fisher Scientific) and analysis of RNA integrity was performed using Fragment Analyzer DNF-472 High Sensitivity RNA (Advanced Analytical). RNA was normalized to 2ng/uL or less in 5uL based on Qubit values. External RNA Controls Consortium (ERCC) controls (Thermo Fisher Scientific) were spiked into the samples by adding 1μL at 25pg/μL as quantified by Qubit. Reverse transcription and library preparation were performed using New England Biolab’s NEBNext Ultra II RNA Library Prep (E7770), with an RNA fragmentation time of 8 minutes and 18 cycles of indexing PCR using custom TruSeq indexing primers. Samples were prepared in two batches using the Agilent Bravo Automated Liquid Handling Platform. A water control was included in each batch. For each ten of the RNA samples, a technical replicate was included. Prior to deep sequencing on a NovaSeq (Illumina), relative library concentrations of each sample were determined by pooling 1μL of each and sequencing on a MiSeq (Illumina) with an average of 360,000 read pairs/sample (Fig 1A).

Fig 1.

Fig 1

A. Schematic of sample processing for library preparation of whole blood for AIH cohort. B. Heat map displaying gene counts (variance stabilizing transformation, see DESeq2) of the top 1000 genes with highest variance amongst samples. Samples are clustered on the x-axis and protein coding genes are clustered on the y-axis using a Ward D2 method. Relevant metadata included patient status, steroid exposure, and fibrosis groups to classify samples. C. PCA plot of variance stabilizing transformed gene counts colored by sex of patient sample. D. Volcano plot of differential gene expression in DESeq2, showing genes with P < 0.05 and log fold change > 1 in red, and genes with only log fold change > 1 in blue, and genes with only P < 0.05 in grey.

Depletion of Abundant Sequences by Hybridization (DASH)

DASH is a CRISPR-Cas9 technology which allows targeted depletion of uninformative sequences from a pooled library using a custom gRNA set [11]. Initial MiSeq sequencing enabled prediction of the effect of depletion of human nuclear and mitochondrial ribosomal RNA (rRNA) using DASHit. It was found that DASH targeting human rRNA would free up 90.7% of sequencing space on average. Samples were pooled to achieve equimolar non-rRNA concentration. Analysis of library concentration and size was performed using the Tapestation High Sensitivity D5000 (Agilent Technologies). Pools were normalized to 100ng in 23μL using Qubit HS DNA (Thermo Fisher Scientific) and 2 additional amplification cycles were performed using KAPA’s Real-time Library Amplification kit (KK2702) to reduce effects of prior overamplification. Pools were then normalized to 2.8nM using Qubit HS DNA (Thermo Fisher Scientific).

DASH was performed on the pools using a gRNA set targeting human rRNA [11]. Cas9 was purified from E. coli as described, and gRNA were transcribed from templates ordered from Integrated DNA Technologies using T7 polymerase as described. Cas9-gRNA complex was incubated at 37°C for 2 hours with pooled samples and reactions were quenched with Proteinase K. Samples were further amplified using KAPA’s Real-time Library Amplification (KK2702).

Sequencing

Libraries were sequenced on Illumina NovaSeq 6000 using 150bp paired-end reads. Samples received an average of 42 million read pairs/sample.

Gene counts

Sequencing data was quality filtered using PriceSeqFilter [12], with parameters specifying that 90% of nucleotides in a read pair must be called with the flag “-rnf 90” and 85% of nucleotides in a read pair must have a 98% probability or higher of being correct with the flag “-rqf 85 0.98”. TruSeq adaptors were trimmed using cutadapt, with a minimum length of read remaining set to 36 [13]. Data was then run through IDSeq (PIPELINE v2.8, NT/NR: 2018-04-01), a pathogen-detection pipeline developed by the Chan-Zuckerberg Initiative [14]. Gene counts were obtained from the initial step in the IDSeq pipeline, Spliced Transcripts Alignment to a Reference (STAR) [15] alignment to human reference genome 38 (hg38). All hemoglobin genes and genes which did not code for proteins were removed based on annotations in BioMart v2.34.2 [16]. In addition, a filter for genes with low counts was applied to remove genes that had fewer than 10 reads in 20% or more of the samples.

Differential gene expression

Genes which had less than 10 total reads in fewer than 20% of the samples were filtered out of the gene counts. Differential gene expression was performed by inputting unnormalized gene counts into the DESeq function of Bioconductor package DESeq2 (v1.18.1), using default settings without a normalization matrix. The DESeq function estimates size factors (normalization for library size included) and dispersion factors for negative binomial distributed data, then runs a Wald test to determine significance of coefficients of the negative binomial GLM of the data. Design matrix modelled sex, age, library prep batch and our variable of interest (ex: ~ library batch + sex + age + AIH status. Variables of interest used for each analysis is indicated in discussion section and includes AIH vs healthy controls, steroid exposure (healthy vs treatment naive vs steroid-containing treatment vs steroid-sparing treatment) and fibrosis groups (AIH with cirrhosis vs AIH without cirrhosis vs healthy). Differential gene lists were generated by pairwise comparison of the groups (i.e., AIH vs healthy controls or treatment naive vs healthy controls) with the results function of DESeq2, with an adjusted p-value cut-off of 0.05.

Pathway enrichment analysis

Differential gene expression results were imported into Ingenuity Pathway Analysis software (Qiagen, Redwood City, CA, USA). Genes were considered for entry into the analysis if they had an absolute log fold change greater than 0.6 between the two groups of interest (cases vs. controls, or treatment-naïve AIH vs. controls), but no adjusted p-value cutoff was required.

Defining an AIH-specific gene signature using random forest

The random forest method (a classifier that combines many single decision trees) was used to calculate the importance of each gene for defining AIH vs control [17]. The varSelRF R package uses the out-of-bag error as the minimization criterion and carries out variable elimination with random forests by successively eliminating the least important variables (with importance as returned from the random forest analysis) [18]. The algorithm iteratively fits random forests, at each iteration building a new forest after discarding genes with the smallest variable importance; the selected set of genes is the one that yields the smallest out-of-bag error rate. This leads to the selection of small sets of non-redundant variables. To examine the discriminatory power of this metric, we varied the standard deviation cutoff by using multiplication factors from 1 to 2 in steps of 0.25.

Weighted gene correlation network analysis

A total of 8,173 genes (identified as described in gene counts, above), as well as clinical metadata variables, were used to construct a module-trait graph to identify relationships between clusters of genes and clinical variables. This analysis was performed using the Weighted Gene Correlation Network Analysis algorithm in R as previously reported [19]. Briefly, genes were analyzed using the Pearson’s correlation test, and a matrix of similarity was constructed with a soft power of β = 6. The adjacency matrix of gene expression data was clustered using topological overlap matrix analysis. A dendrogram was then constructed using average linkage hierarchical clustering and the dynamic tree cut algorithm was applied to the dendrogram for module identification. To identify hub genes, we used a gene significance cutoff of 0.5 and a measure of centrality within the module, module membership, at a cutoff of 0.7. To assess the strength of the correlation between genes of interest and individual modules, gene significance for the genes in the module were compared to module membership using univariate regression, and correlation p-values were generated.

Cibersort

We performed deconvolution of bulk RNASeq data (filtered normalized protein coding gene counts as described above) using the CIBERSORT algorithm, which we ran through the web interface using default parameters, as previously reported [20].

Results

Clinical cohort

75 AIH patients in the GRACE Cohort provided whole blood samples, from which RNA was isolated. After excluding patients with concomitant liver disease (n = 2), those who had undergone liver transplantation (n = 4), and those whose sequencing were extreme outliers determined by hierarchal clustering and rRNA content (n = 2), there were 67 AIH patients and 25 healthy volunteers with whole blood RNA-Seq data (Table 1). The median age was 54 years for AIH patients, and 50 years for healthy volunteers. The AIH cohort was predominantly female (79%), whereas healthy volunteers were 48% female. Both groups were predominantly white. Median ALT was 31 U/L in the AIH patients and 19 U/L in the healthy volunteers. Among AIH subjects, 56 (84%) had Type 1 AIH, 1 (1%) had Type 2 AIH, 2 (3%) had drug-induced AIH, and 8 (12%) had AIH of unknown type. Autoantibody results were available as follows: 32 of 51 subjects were ANA positive; 41 of 50 subjects were positive for anti-actin antibody; only 1 of 19 subjects was positive for anti-LKM-1 antibody. At the time of blood collection, 27% of the AIH subjects had cirrhosis and 7% were treatment-naïve. 10 AIH patients were off therapy at the time of blood biospecimen collection (5 were treatment-naïve, 3 were in remission and had been withdrawn from therapy, and 2 had been previously treated but were off therapy at the time of sample collection). Of subjects on AIH therapy at the time of sample collection, 28 were on a regimen that included corticosteroids: 21 with and 7 without maintenance medications. 29 of the AIH patients on therapy at the time of sample collection were on a steroid-free regimen consisting only of maintenance medications such as azathioprine, mycophenolate, or tacrolimus. Of those who had been on therapy for ≥6 months at the time of blood collection, 27% were in complete remission.

Table 1. Clinical cohort of AIH patients and healthy controls.

Characteristics AIH (n = 67) Healthy (n = 25)
Age, y, median (range) 54 [20–79] 50 [24–70]
Sex, n (%)
    Men 14 (21%) 13 (52%)
    Women 53 (79%) 12 (48%)
Race/ethnicity
    White 60 (90%) 25 (100%)
    People of Color 7 (10%) 0 (0%)
ALT, U/L, median (range) 31 [4–936] 19 [10–34]
IgG, mg/dL, median (range) 1540 [201–3890]
Cirrhosis 18 (27%)
Decompensated 12 (18%)
Treatment Status
    Treatment-naive 5 (7%)
    Previously-treated 5 (7%)
    On therapy ≥6 months
        Complete Remission 18 (27%)
        Partial Response 18 (27%)
        Non-responder 6 (9%)
Required change in treatment due to prior non-response 12 (18%)
Treatment Type
    Steroid-containing regimen 28 (42%)
    Steroid-free regimen 29 (43%)
    Off treatment 10 (15%)

Gene expression profiles can differentiate AIH patients from healthy controls

To identify peripheral blood gene expression profiles unique to AIH, we compared AIH subjects to healthy controls (Fig 1B). Sequencing yielded an average of 42 million read-pairs per sample. Full data set available at https://github.com/czbiohub/AIH-Project. Unsupervised clustering of the 1000 most variably expressed genes in the dataset did not entirely separate samples based on diagnosis, treatment, or fibrosis stage. Unsupervised clustering analysis on all gene counts did not reveal grouping of samples by demographic factors such as age, sex, and race, nor by RNA preparation batch (Fig 1C, S2 Fig).

Analysis of global trends in gene expression identified 249 differentially-expressed genes (Fig 1D); the top 20 genes with greatest fold-change are listed in S1 Table. Random forest modeling with recursive variable elimination indicated that 9 or fewer genes can reliably differentiate AIH patients from healthy controls (Table 2).

Table 2. Random forest results with increasing standard deviation multiplicative factor separating AIH patients and healthy controls.

Standard Deviation Allowed Number of Genes Identified Genes Identified by Random Forest (% of seeds gene appeared)
1 9 ARHGAP4 (30%), COL5A3 (50%), EZH1 (10%), FLYWCH1 (30%), GIGYF1 (100%), MAPK8IP3 (20%), MEGF6 (20%), PTOV1 (90%), RHOT2 (10%)
1.25 7 ARHGAP4 (50%), COL5A3 (60%), FLYWCH1 (70%), GIGYF1 (90%), MAPK8IP3 (10%), MEGF6 (10%), PTOV1 (100%)
1.5 2 GIGYF1 (100%), PTOV1 (100%)
1.75 5 COL5A3 (10%), FLYWCH1 (30%), GIGYF1 (100%), PTOV1 (80%), SLC4A10 (10%)
2 3 FLYWCH1 (20%), GIGYF1 (90%), PTOV1 (90%)

Interferons are the predominant upstream regulators of gene induction in AIH

We performed pathway analysis in order to identify patterns across genes that were enriched in AIH patients relative to healthy controls. Across all AIH subjects, interferon signaling was the canonical pathway most likely to be differentially regulated (by p-value), activated in AIH subjects relative to healthy controls (p < 0.0001, Fig 2A). Analysis further identified several additional pathways of interest, for example, myeloid signaling, as indicated by activation of triggering receptor expressed on myeloid cells 1 (TREM1) and inflammasome pathways, but inhibition of liver X receptor/retinoid X receptor (LXR/RXR) activation (Fig 2A).

Fig 2.

Fig 2

A. Top ten most significant canonical pathways related to differential gene expression of AIH patients compared to healthy controls. B. kmer-based phylogenetic analysis performed with the IDSeq platform shows relationships between four assembled pegivirus genomes from the GRACE cohort and their closest related publicly available pegivirus NCBI reference genomes, with patient metadata annotated.

To eliminate the impact of immunosuppression on the interferon signal, we next compared treatment-naïve subjects (n = 5) to healthy controls (n = 25). Interferon signaling remained one of the top 10 activated canonical pathways (S3 Fig). However, for the majority of pathways, the directionality (activation vs. inhibition) could not be determined. To identify whether a cascade of transcriptional regulation could explain the observed gene expression changes, we performed upstream regulator analysis in Ingenuity Pathway Analysis which revealed 515 significant pathways (S2 Table). The top 10 activated upstream regulators, sorted by p value, is shown in Table 3A. Of these 10 genes, five were interferon genes (IFNA2, IFNG, IFNL1, IRF7 (a master regulator of type I IFN signaling [21, 22]), and IFN downstream response gene 1 [23], confirming the central role of interferon signaling in autoimmune hepatitis. This is consistent with signatures obtained across other autoimmune diseases such as systemic lupus erythematosus [2426] and with the known association between treatment with IFN-a and development of autoimmune hepatitis [27, 28].

Table 3. A. Top 10 activated upstream regulators identified by pathway analysis for treatment-naïve patients compared to healthy controls.

B. Top 10 inhibited upstream regulators identified by pathway analysis for treatment-naïve patients compared to healthy volunteers.

Upstream Regulator Predicted Activation State Activation z-score p-value of overlap Target molecules in dataset
IRF7 Activated 3.898 1.63E-12 CMPK2,GBP1,IFI44,IFI44L,IFI6,IFIT1,IFIT3,IFITM3,ISG15,LILRA5,OAS3,OASL,RIPK2,RSAD2,S100A8,TNFSF13B
IFNG Activated 3.828 9.57E-09 AIF1,B2M,CMPK2,CYP27A1,FCGR1B,FTL,GBP1,GNAS,HLA-DRA,HLA-DRB5,HSPB1,IFI44,IFI44L,IFI6,IFIT1,IFIT3,IFIT5,IFITM3,ISG15,KLF1,LGALS3,MYH9,OAS3,OASL,PCTP,PHB2,PRDX2,PRPF8,RIPK2,RSAD2,S100A8,S100A9,TFRC,TGFBR3,TNFSF13B
MYCN Activated 3.592 4.33E-15 ABCB10,B2M,CLU,LGALS1,MYH9,RPL23,RPL27,RPL35,RPL41,RPL6,RPL7,RPL9,RPLP0,RPS13,RPS25,RPS27,RPS3A,RPS5,RPS6,RPS8,RPS9,SPARC
IFNA2 Activated 3.57 2.32E-09 ANXA1,B2M,CMPK2,GBP1,GNAS,IFI44,IFI44L,IFI6,IFIT1,IFIT3,IFIT5,IFITM3,ISG15,OAS3,RSAD2
IFNL1 Activated 3.554 3.64E-13 CMPK2,GBP1,IFI44,IFI44L,IFI6,IFIT1,IFIT3,IFIT5,IFITM3,ISG15,OAS3,OASL,RSAD2
IRF1 Activated 3.37 6.03E-08 B2M,CMPK2,IFI44L,IFI6,IFIT1,IFIT3,IFIT5,IFITM3,ISG15,OASL,RSAD2,TNFSF13B
PAF1 Activated 2.646 8.13E-09 HIST1H2BD,IFI44,IFI44L,IFIT3,IFITM3,ISG15,OAS3,OASL
ribavirin Activated 2.63 0.000000062 IFI44L,IFI6,IFIT3,ISG15,OASL,RNF125,RSAD2
PRL Activated 2.515 2.27E-08 B2M,CLU,CMPK2,EPSTI1,IFI44,IFI44L,IFI6,IFIT1,IFIT3,IFIT5,ISG15,OAS3,RSAD2,SPARC,TNFSF13B
TCR Activated 2.041 1.02E-11 BSG,IFI44,IFI44L,IFI6,IFIT1,IFIT3,ISG15,OASL,PASK,PHB2,PRDX6,RPL6,RPL7,RPL9,RPLP0,RPS13,RPS3A,RSAD2,SLC1A5,TNFSF13B,UBA52
sirolimus Inhibited -3.798 7.4E-13 B2M,CDA,COX6B1,CREG1,FUS,ISG15,LGALS3,RPL23,RPL27,RPL35,RPL41,RPLP0,RPS13,RPS18,RPS21,RPS27,RPS27A,RPS29,RPS3A,RPS5,RPS6,RPS8,RPS9,SLC1A5,TUBB2A,UBA52
IL1RN Inhibited -3.317 1.45E-08 GBP1,IFI44,IFI44L,IFI6,IFIT3,IFIT5,OAS3,OASL,RIPK2,RSAD2,S100A9
ST1926 Inhibited -3.317 0.000000153 CHCHD2,COX7A2L,GNAS,LGALS1,PSMF1,RPL41,RPL6,RPS27,RPS27A,TPT1,UBA52
RICTOR Inhibited -3.303 1.89E-13 BSG,COX6B1,COX7A2L,ISG15,PSMF1,RPL23,RPL41,RPL6,RPL7,RPL9,RPLP0,RPS13,RPS18,RPS21,RPS27A,RPS29,RPS5,RPS6,RPS8,RPS9
MAPK1 Inhibited -3.207 0.0000023 GBP1,IFI44,IFI6,IFIT1,IFIT3,IFIT5,IFITM3,ISG15,ITGA2B,LGALS1,LGALS3,OAS3,OASL,RGS2
ACKR2 Inhibited -2.449 0.00000168 IFI44,IFIT3,ISG15,OAS3,OASL,RSAD2
filgrastim Inhibited -2.433 4.7E-12 ANXA1,B2M,CLC,CLU,CYP27A1,CYSTM1,EPSTI1,GBP1,HLA-DRA,HLA-DRB5,IFI44,IFI6,IFIT1,IFIT3,IFIT5,ISG15,LGALS3,LILRA5,OAS3,OASL,PRDX2,RSAD2,SPECC1
ZNF106 Inhibited -2.333 1.09E-10 COX7A2L,HSPB1,IFITM3,LGALS1,LGALS3,LYZ,PRDX2,S100A8,TMEM106B
SP110 Inhibited -2.309 3.22E-09 ATF2,CLU,ETS1,FUS,IFI6,IFIT1,IFIT3,IFITM3,MYH9,OAS3,RIPK2,TGFBR3
TRIM24 Inhibited -2.121 0.00000232 CMPK2,EPSTI1,IFI44,IFIT3,ISG15,LGALS3,OASL,PLEC

Analysis of the top 10 inhibited upstream regulators (Table 3B) showed that four were known therapies for AIH or other disease states: sirolimus, ST1926 (a synthetic retinoic acid), IL1RN (anakinra), and filgrastim [2932]. Given that inhibition of genes downstream of these master regulators are suppressed in AIH patients relative to controls, these upstream regulatory networks may represent prime targets with therapeutic potential in AIH.

Pathogen detection platform reveals infection with pegivirus in a handful of samples, but cannot fully explain the activation of interferon signaling in AIH compared to healthy controls

Of the 75 AIH subjects and 25 healthy controls, IDSeq identified pegivirus (formerly known as hepatitis G) sequences in whole blood specimens from seven patients: five AIH subjects, one AIH and hepatitis C subject, and one healthy control. In four, pegivirus read depth was high enough to place on a kmer-based phylogenetic tree along with their closest NCBI reference sequences using the IDSeq platform (Fig 2B). Variation in viral strains precludes the possibility that these infections represent a cluster among the GRACE cohort population. The presence of pegivirus in whole blood is compelling but is insufficient to explain the extent of interferon induction we observed, given that interferon signaling was activated in many samples beyond those with pegivirus viremia.

Gene expression correlates with fibrosis stage in AIH

Weighted Gene Correlation Network Analysis was used to create clusters of highly correlated genes, also known as gene modules. These gene modules were then related to clinical variables of interest in order to identify relationships between clinical phenotypes and gene expression (Fig 3A). Using this approach, we found 12 gene modules. We observed that gene module 9 correlated with not only the greatest number of clinical variables of interest but also markers of poor AIH outcomes in AIH. Specifically, gene module 9 was associated with decompensated cirrhosis (p < 0.001), the need for liver transplantation (p = 0.01), two surrogates of liver cirrhosis, Fib-4 (p < 0.001) and transient elastography scores (p < 0.001), and a composite variable of cirrhosis, as determined using the algorithm in S1 Fig (p < 0.001). Conversely, module 5 was significantly inversely correlated with clinical variables related to advanced fibrosis (fibroscan score, FIB-4 score, cirrhosis, decompensation, and need for transplant). This correlation between a module of genes and the presence of liver cirrhosis is striking, as cirrhosis on index liver biopsy is known to portend a poor prognosis in patients with AIH [33].

Fig 3.

Fig 3

A. WGCNA module trait graph, with patient metadata on the x-axis and generated gene modules on the y-axis. B. Interconnectivity plot for gene module 9 identified by WGCNA. C. Heatmap displaying gene counts (variance stabilizing transformation, see DESeq2) of the top 12 hub genes identified from interconnectivity metrics as described in Methods. Samples are clustered on the x-axis using a Ward D2 method and color annotated with cirrhosis status and treatment at sample collection.

The correlation between liver cirrhosis and the genes in module 9 is measured by gene significance; in Fig 3B this correlation value is plotted against a measure of how connected that gene is to other genes within module 9 to identify the most central genes within the module. We next identified key hub genes within module 9 that separate AIH subjects with and without cirrhosis as previously reported [34] and described in methods. This yielded a list of 12 top hub genes (Fig 3 and S3 Table). Plotting these 12 genes in a heatmap using hierarchical clustering resulted in partial separation of the cirrhotic and non-cirrhotic AIH subjects (Fig 3C).

Complete response to treatment is associated with higher blood CD8 T cell counts

Application of bulk RNA-Seq analysis to whole blood allows for assessment of both viral and cellular RNA, making it possible to assess immunologic response, in our case interferon signaling, while simultaneously searching for possible viral triggers. In parallel, we sought to determine the contribution of individual immunologic populations on AIH pathobiology through deconvolution of bulk RNASeq data using the CIBERSORT algorithm, which we ran through the web interface using default parameters [20]. We compared the leukocyte profiles in AIH subjects and healthy controls to determine whether subsets varied according to response to therapy. Specifically, while incomplete response to immunosuppression has been linked to poor long-term outcomes among AIH patients [35], specific clinical characteristics or a mechanistic understanding of a priori determinants of response remain limited [33]. We compared gene expression of healthy controls (n = 25) to treatment-naive AIH subjects (n = 5) and AIH subjects with either complete (n = 18) or partial (n = 18) response. AIH subjects with partial response to therapy (n = 18) had a lower CD8 T cell signature in the periphery compared to healthy controls (p = 0.05) and compared to complete responders (p = 0.04) (Fig 4A). After excluding patients on corticosteroids from the analysis, partial responders still had a lower CD8 T cell signature than complete responders (p = 0.05, Fig 4B).

Fig 4.

Fig 4

A. CIBERSORT data deconvoluting absolute CD8+ T-cell counts from bulk RNA-seq, with significant differences in counts between healthy patients and those with a partial response to treatment and between those with a complete compared to a partial response to treatment. B. CIBERSORT data deconvoluting absolute CD8+ T-cell counts from bulk RNA-seq, removing steroid patients and showing a significant difference between those with a complete compared to a partial response to treatment.

Discussion

A major barrier to addressing disparities and improving the management of autoimmune hepatitis (AIH) is the disease’s obscure etiopathogenesis. In an effort to promote precision medicine in a field where prednisone has been a primary therapy for several decades, we applied state-of-the art RNA sequencing to whole blood specimens from a well-described cohort of AIH patients and healthy volunteers.

We found 249 genes that were significantly differentially expressed in the whole blood of AIH patients compared to healthy controls. This relatively small number of differentially expressed genes could be related to the fact that most blood samples were drawn from AIH patients after the initiation of therapy, heterogeneity among cases and/or, or perhaps the need to focus on certain cell types within whole blood. We performed PCA on covariance of the genes and didn’t find obvious clusters when looking at cases vs. controls. While sex was not clearly delineated in PC1 or PC2 of our principal component analysis, this does not rule out the possibility that sex could be a minor confounder. However, we did see hierarchical clustering of genes when we focused on the 1000 most highly expressed genes in the dataset (see Fig 1B). The strongest signal of genes differentially expressed in AIH compared to healthy controls was that of interferon activation. Interferons are pro-inflammatory cytokines and part of the innate immune system, traditionally viewed as the body’s first line of defense against pathogens. Interferons play an important role in the liver infected with hepatitis B or C virus and indeed have been used as an antiviral therapy. In fact, autoimmune hepatitis is a well-recognized complication and relative contraindication to interferon-based antiviral therapy [28]. A prior study by Grant et al. showed that CD4+ cells cultured in vitro from the peripheral blood of treatment-naïve AIH patients produce interferon-gamma for an extended period of time, beyond that seen in healthy samples [36]. In addition, another study of treatment-naïve AIH patients found that serum interferon-gamma-inducible protein-10 (IP-10) levels were significantly correlated with histologic inflammation [37]. AIH was historically coined “lupoid hepatitis,” and recent work has highlighted interferon as a therapeutic target for the drug anifrolumab in systemic lupus erythematosus patients [38].

In light of the robust interferon signal, it was logical to query for the presence of a virus. One significant advantage of sequencing whole blood was the ability to answer this question, and pegivirus sequences were identified in the blood of six patients with AIH. Pegivirus has no confirmed pathogenic effect [39]. It has both liver and immune cell tropism [40], and studies have shown that infection with HBV or HCV increase the risk of pegivirus infection [41]. Pegivirus is more prevalent in patients with liver transplant than partial hepatectomy patients, but pegivirus is not associated with any changes in clinical outcomes [42]. Notably, a recent systematic review and metanalysis of studies of pegivirus RNA prevalence in healthy blood donors in 2019 revealed a global prevalence of 3.1% and a North America prevalence of 1.7% [41]. Prevalence varies significantly by geographic region, with some studies reporting prevalence of over 20% [41]. Anti-pegivirus antibody prevalence is higher, suggesting that the virus is frequently cleared [40, 41]. Given that there is no molecular evidence yet pointing towards pegivirus as a causative agent in AIH or any other disease, we note that increased susceptibility to viral infection in patients treated with immunosuppressive therapy may account for the higher incidence in this AIH cohort [41]. However, the activation of interferon signaling in the AIH group as a whole raises the possibility that viral infection could play a role in disease etiology, possibly as a trigger for molecular mimicry, something which warrants further study. Interestingly, pegivirus is structurally very similar to hepatitis C virus. There is a small body of literature linking HCV to AIH, with recent data suggesting that clearance of HCV can lead to resolution of AIH activity in some patients [43]. Furthermore, TP4502D6 (CYP2D6) is an autoepitope recognized by anti-LKM in patients with Type 2 AIH and patients with HCV [44].

In order to focus on disease pathophysiology, AIH patients who submitted blood samples prior to treatment initiation (treatment-naïve patients) were studied in a separate analysis. Among the multiple findings, interferon signaling was an activated canonical pathway. Regarding inhibition, genes that would be found downstream of the drug sirolimus were inhibited in AIH patients. The most significantly inhibited upstream regulators of gene expression in treatment-naïve patients compared to healthy included several druggable targets. RPTOR Independent Companion Of MTOR Complex 2 (RICTOR) was determined by pathway analysis to be a significantly inhibited upstream regulator of gene expression in treatment-naïve AIH patients, with multiple target molecules supporting this in the dataset. Sirolimus is a macrolide drug that inhibits mTOR, a protein that controls the proliferation and survival of activated lymphocytes. Indeed, mTOR signaling was one of the most significant pathways in our focused analysis of treatment-naïve patients vs. healthy controls (S3 Fig). The Grant et al. study cited above also found that sirolimus increases the in vitro responsiveness of lymphocytes to regulatory T cells [36]. Sirolimus has been studied as a salvage therapy for AIH in reports of very small numbers of patients, with mixed results [29]. However, like many second-line therapies for AIH, it has never been studied in a randomized controlled trial. Our results suggest a biologic basis for the use of sirolimus in AIH and that it merits further study as a therapeutic agent. Filgrastim, or G-CSF, was also predicted to be an inhibited upstream regulator of gene expression in the AIH samples. While never studied in AIH, G-CSF has been studied as a treatment for severe acute alcohol-associated hepatitis with some success [32, 45]. In alcohol-associated hepatitis, G-CSF is believed to mobilize CD34+ stem cells from the bone marrow to the liver, where they promote regeneration in the setting of severe inflammation. Given this mechanism, it is plausible that G-CSF may have a role in the treatment of AIH.

We also used a random forest algorithm to determine a small set of genes that distinguish AIH patients from healthy volunteers. The genes predicted by the model do not specifically indicate AIH disease biology but can potentially serve as biomarkers in a diagnostic test to flag a patient for further analysis of AIH status. The diagnosis of autoimmune hepatitis can be challenging, often requiring an invasive liver biopsy and the calculation of diagnostic scores based on multiple clinical factors [6]. Given the current lack of a single accurate diagnostic test for AIH, a biomarker in the peripheral blood would be immensely helpful to the clinician. However, we recognize that many of these genes appear in less than half of seeds and these may not be stable predictors when applied to validation datasets. We acknowledge that our predictive power is greatly limited by the small size of this pilot cohort, and emphasize that this algorithm does not have clinical significance without further testing and validation.

In multiple studies, cirrhosis has been linked with poor outcomes such as death and liver transplantation in AIH [4648]. 30–40% of AIH patients have cirrhosis on index liver biopsy. While some patients have obvious radiographic evidence of cirrhosis, there are many patients with early, compensated cirrhosis (i.e. without clinically evident complications), for whom fibrosis stage is only apparent on a liver biopsy. Noninvasive methods of fibrosis assessment such as transient elastography and magnetic resonance elastography, while validated in AIH, are not available or accessible for many AIH patients. Therefore, a noninvasive blood signature of cirrhosis in AIH could be very valuable in risk stratifying AIH patients. We used gene correlation network analysis to determine a gene expression signature for advanced disease (stage 3–4 fibrosis) from this dataset. If validated in a different cohort, this 12-gene signature could provide insight into more severe cases of AIH and serve as a prognostic marker. While a blood test for cirrhosis in AIH would be incredibly helpful for predicting prognosis, one caveat is that the gene expression signature identified in this study was associated with cirrhosis in a heterogenous group of AIH patients. Some patients had already decompensated with events such as variceal hemorrhage. Others had been diagnosed several years prior to blood sample collection. In contrast, prior studies linking cirrhosis with worse survival in AIH used biopsy results at the time of diagnosis. Therefore this gene expression signature needs to be validated in a prospective fashion but retains value as a potentially powerful tool to identify AIH patients at higher risk for poor outcomes.

Finally, dividing the AIH cohort by response to treatment and deconvoluting the gene expression data by cell type pointed to a role for CD8+ T cells. Patients with a partial response to therapy displayed a lower CD8+ T cell expression signal. CD8+ T lymphocytes are cytotoxic and induce apoptosis of damaged cells in response to antigen presentation on MHC Class I molecules, and they are a major cell type in areas of interface hepatitis [49]. In Type II AIH, the degree of CD8+ T cell response correlates with disease activity [50]. In vitro studies have shown that immunosuppressive therapy alters the ability of regulatory T cells to modulate CD8+ T cell activity [51]. However, these results are limited by the fact that the effect of AIH on CD8 cell populations is difficult to truly separate from the effect of AIH therapies, i.e. various forms of immunosuppression. Further study is required to better understand the relative contribution of immunosuppression and underlying AIH disease activity on CD8 biology. It has been shown that interferon alpha/beta induction inhibits egress of lymphocytes from lymphoid organs [52]. While complete responders may have a CD8 response that is “restored” or closer in character to healthy controls given their treatment response, partial responders may still have active AIH disease activity, i.e. a CD8 signature more similar to active AIH. Our results suggest that those with incomplete response to therapy have fewer or less functional circulating CD8 T cells. This also raises the possibility that in partial responders, CD8 T cells may have left the periphery and entered the liver, where these cells could lead to further inflammation and tissue destruction. These findings may provide insight into why some patients enjoy a complete response to therapy while others have ongoing disease activity or even experience hepatic decompensation.

A weakness of this pilot study is its relatively small sample size, which did not allow us to divide the patients into training and validation sets. The small number of AIH patients who provided blood samples prior to treatment initiation limited our power to detect disease signals independent of medication effects. With the small number of treatment-naive patients, our analysis of pathways activated or inhibited in this group could not use a p-value cutoff for genes considered in the analysis and was thus exploratory only. For the analysis of AIH cases vs. healthy controls, the pathway results were similar with or without a p-value cutoff for included genes. In addition, this relatively recently formed patient cohort had limited follow-up data, so we were not able to look at gene expression as a predictor of subsequent clinical outcomes. We did not have liver biopsy data on all patients but developed an algorithm for determining if patients had cirrhosis, which raises the possibility of bias in our analysis. Moreover, this pilot study did not include patients with other types of liver diseases aside from AIH. However, further efforts to expand longitudinal cohorts that include AIH patients, clinical data, and blood and liver biospecimens are currently underway, and classical immunophenotyping assays are planned.

Nevertheless, this study represents a unique contribution to the AIH field both by shedding light on molecular mechanisms and opening possibilities of better management. Cutting-edge genomic technology was applied to a large cohort of AIH patients with detailed clinical data. Whole blood samples were utilized, allowing for signals from cells not typically included in immunologic studies of AIH, such as platelets and neutrophils. The unbiased and comprehensive approach of RNA sequencing yielded results that corroborated prior in vitro studies of the role of interferon and CD8+ T cells in AIH. We also uncovered a number of AIH samples with viral sequences, an unexpected finding that raises the possibility of molecular mimicry after viral infection as a mechanism in AIH. We were able to develop two gene signatures—one for disease and one for cirrhosis—which will need to be validated in future studies but serve as proof-of-principle that new biomarkers can be discovered in AIH. This study highlighted three existing drugs, anifrolumab, sirolimus, and G-CSF, as potentially promising therapies for AIH. Future studies using single cell RNA-Seq analysis of whole blood from AIH patients as well as studies of liver biospecimens may shed more light on these results. By applying current genomic technology to biospecimens from AIH patients, we hope to provide answers to the many remaining questions in this challenging disease.

Supporting information

S1 Fig. Flow chart depicting algorithm for determining fibrosis status of AIH patient.

(TIF)

S2 Fig

PCA plot of variance stabilizing transformed gene counts colored by A. age bracket of patient, B. race/ethnicity of patient, and C. library preparation batch for patient sample.

(TIF)

S3 Fig. Top ten most significant canonical pathways related to differential gene expression of treatment naïve AIH patients compared to healthy controls.

(TIF)

S1 Table. Most differentially expressed genes in AIH patients compared to healthy volunteers.

(XLSX)

S2 Table. List of all upstream regulators identified by pathway analysis for treatment-naïve patients compared to healthy controls.

Republished from Ingenuity Pathway Analysis under a CC BY license, with permission from Qiagen, original copyright 2019.

(XLS)

S3 Table. Top 12 hub genes correlated with AIH cirrhosis.

(XLSX)

Acknowledgments

The authors would like to acknowledge UCSF CTSI, Ma Somsouk, Jacquelyn Maher, and Montgomery Bissell.

Abbreviations

AIH

autoimmune hepatitis

RNA-seq

RNA sequencing

GRACE

Genetic Repository of Autoimmune Liver Diseases and Contributing Exposures

rRNA

ribosomal RNA

Data Availability

Full data set available at github link https://github.com/czbiohub/AIH-Project. Raw data will be deposited into dbGaP prior to publication.

Funding Statement

EC and MT received the American Association for the Study of Liver Diseases (aasld.org) Autoimmune Liver Diseases Pilot Reseach Award to perform this work. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Lee B, Holt EW, Wong RJ, Sewell JL, Somsouk M, Khalili M, et al. Race/ethnicity is an independent risk factor for autoimmune hepatitis among the San Francisco underserved. Autoimmunity. 2018. Jul 4;51(5):258–64. doi: 10.1080/08916934.2018.1482884 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wen JW, Kohn MA, Wong R, Somsouk M, Khalili M, Maher J, et al. Hospitalizations for Autoimmune Hepatitis Disproportionately Affect Black and Latino Americans. Am J Gastroenterol. 2018. Feb;113(2):243–53. doi: 10.1038/ajg.2017.456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Diagnosis and Management of Autoimmune Hepatitis in Adults and Children: 2019 Practice Guidance and Guidelines From the American Association for the Study of Liver Diseases—Mack—- Hepatology—Wiley Online Library [Internet]. [cited 2020 Jun 8]. Available from: https://aasldpubs.onlinelibrary.wiley.com/doi/10.1002/hep.31065 [DOI] [PubMed]
  • 4.Dakhoul L, Jones KR, Gawrieh S, Ghabril M, McShane C, Vuppalanchi R, et al. Older Age and Disease Duration Are Highly Associated with Hepatocellular Carcinoma in Patients with Autoimmune Hepatitis. Dig Dis Sci. 2019;64(6):1705–10. doi: 10.1007/s10620-018-5441-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Comerford M, Fogel R, Bailey JR, Chilukuri P, Chalasani N, Lammert CS. Leveraging Social Networking Sites for an Autoimmune Hepatitis Genetic Repository: Pilot Study to Evaluate Feasibility. J Med Internet Res [Internet]. 2018. Jan 18 [cited 2019 Feb 28];20(1). Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5795096/ doi: 10.2196/jmir.7683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hennes EM, Zeniya M, Czaja AJ, Parés A, Dalekos GN, Krawitt EL, et al. Simplified criteria for the diagnosis of autoimmune hepatitis. Hepatology. 2008. Jul;48(1):169–76. doi: 10.1002/hep.22322 [DOI] [PubMed] [Google Scholar]
  • 7.Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)—A metadata-driven methodology and workflow process for providing translational research informatics support. Journal of Biomedical Informatics. 2009. Apr 1;42(2):377–81. doi: 10.1016/j.jbi.2008.08.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Vallet‐Pichard A, Mallet V, Nalpas B, Verkarre V, Nalpas A, Dhalluin‐Venier V, et al. FIB-4: An inexpensive and accurate marker of fibrosis in HCV infection. comparison with liver biopsy and fibrotest. Hepatology. 2007;46(1):32–6. doi: 10.1002/hep.21669 [DOI] [PubMed] [Google Scholar]
  • 9.E Anastasiou O, Büchter M, A Baba H, Korth J, Canbay A, Gerken G, et al. Performance and Utility of Transient Elastography and Non-Invasive Markers of Liver Fiibrosis in Patients with Autoimmune Hepatitis: A Single Centre Experience. Hepat Mon. 2016. Nov;16(11):e40737. doi: 10.5812/hepatmon.40737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chai V, Vassilakos A, Lee Y, Wright JA, Young AH. Optimization of the PAXgene blood RNA extraction system for gene expression analysis of clinical samples. J Clin Lab Anal. 2005;19(5):182–8. doi: 10.1002/jcla.20075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gu W, Crawford ED, O’Donovan BD, Wilson MR, Chow ED, Retallack H, et al. Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome Biology. 2016;17(1):1–13. doi: 10.1186/s13059-016-0904-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ruby JG, Bellare P, DeRisi JL. PRICE: software for the targeted assembly of components of (meta) genomic sequence data. G3 Genes Genomes Genetics. 2013;3. doi: 10.1534/g3.113.005967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011. May 2;17(1):10–2. [Google Scholar]
  • 14.Infectious Disease Sequencing Platform. github.com/chanzuckerberg/idseq-web [Internet]. Chan Zuckerberg Initiative; 2019 [cited 2019 Mar 11]. Available from: https://github.com/chanzuckerberg/idseq-web
  • 15.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S. STAR: ultrafast universal RNA-seq aligner. Bioinformatics [Internet]. 2013;29. Available from: 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Smedley D, Haider S, Durinck S, Pandini L, Provero P, Allen J, et al. The BioMart community portal: an innovative alternative to large, centralized data repositories. Nucleic Acids Research. 2015. Jul 1;43(W1):W589–98. doi: 10.1093/nar/gkv350 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Breiman L. Random Forests. Machine Learning. 2001. Oct 1;45(1):5–32. [Google Scholar]
  • 18.Díaz-Uriarte R, Alvarez de Andrés S. Gene selection and classification of microarray data using random forest. BMC Bioinformatics. 2006. Jan 6;7:3. doi: 10.1186/1471-2105-7-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008. Dec 29;9:559. doi: 10.1186/1471-2105-9-559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nature Methods. 2015. May;12(5):453–7. doi: 10.1038/nmeth.3337 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Honda K, Yanai H, Negishi H, Asagiri M, Sato M, Mizutani T, et al. IRF-7 is the master regulator of type-I interferon-dependent immune responses. Nature. 2005. Apr;434(7034):772–7. doi: 10.1038/nature03464 [DOI] [PubMed] [Google Scholar]
  • 22.Ning S, Pagano JS, Barber GN. IRF7: activation, regulation, modification and function. Genes & Immunity. 2011. Sep;12(6):399–414. doi: 10.1038/gene.2011.21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kröger A, Köster M, Schroeder K, Hauser H, Mueller PP. Review: Activities of IRF-1. Journal of Interferon & Cytokine Research. 2002. Jan 1;22(1):5–14. doi: 10.1089/107999002753452610 [DOI] [PubMed] [Google Scholar]
  • 24.Banchereau J, Pascual V. Type I Interferon in Systemic Lupus Erythematosus and Other Autoimmune Diseases. Immunity. 2006. Sep 1;25(3):383–92. doi: 10.1016/j.immuni.2006.08.010 [DOI] [PubMed] [Google Scholar]
  • 25.Pascual V, Farkas L, Banchereau J. Systemic lupus erythematosus: all roads lead to type I interferons. Current Opinion in Immunology. 2006. Dec 1;18(6):676–82. doi: 10.1016/j.coi.2006.09.014 [DOI] [PubMed] [Google Scholar]
  • 26.Bennett L, Palucka AK, Arce E, Cantrell V, Borvak J, Banchereau J, et al. Interferon and Granulopoiesis Signatures in Systemic Lupus Erythematosus Blood. J Exp Med. 2003. Mar 17;197(6):711–23. doi: 10.1084/jem.20021553 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Vial T, Descotes J. Clinical Toxicity of the Interferons. Drug-Safety. 1994. Feb 1;10(2):115–50. doi: 10.2165/00002018-199410020-00003 [DOI] [PubMed] [Google Scholar]
  • 28.García-Buey L, García-Monzón C, Rodriguez S, Borque MJ, García-Sánchez A, Iglesias R, et al. Latent autoimmune hepatitis triggered during interferon therapy in patients with chronic hepatitis C. Gastroenterology. 1995. Jun;108(6):1770–7. doi: 10.1016/0016-5085(95)90139-6 [DOI] [PubMed] [Google Scholar]
  • 29.Chatrath H, Allen L, Boyer TD. Use of sirolimus in the treatment of refractory autoimmune hepatitis. Am J Med. 2014. Nov;127(11):1128–31. doi: 10.1016/j.amjmed.2014.06.016 [DOI] [PubMed] [Google Scholar]
  • 30.Ng C-H, Chng W-J. Recent advances in acute promyelocytic leukaemia. F1000Res. 2017;6:1273. doi: 10.12688/f1000research.10736.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ramírez J, Cañete JD. Anakinra for the treatment of rheumatoid arthritis: a safety evaluation. Expert Opin Drug Saf. 2018. Jul;17(7):727–32. doi: 10.1080/14740338.2018.1486819 [DOI] [PubMed] [Google Scholar]
  • 32.Singh V, Keisham A, Bhalla A, Sharma N, Agarwal R, Sharma R, et al. Efficacy of Granulocyte Colony-Stimulating Factor and N-Acetylcysteine Therapies in Patients With Severe Alcoholic Hepatitis. Clin Gastroenterol Hepatol. 2018. Oct;16(10):1650–1656.e2. doi: 10.1016/j.cgh.2018.01.040 [DOI] [PubMed] [Google Scholar]
  • 33.Ngu JH, Gearry RB, Frampton CM, Stedman CAM. Predictors of poor outcome in patients w ith autoimmune hepatitis: A population-based study. Hepatology. 2013;57(6):2399–406. doi: 10.1002/hep.26290 [DOI] [PubMed] [Google Scholar]
  • 34.Langfelder P, Mischel PS, Horvath S. When Is Hub Gene Selection Better than Standard Meta-Analysis? PLoS One [Internet]. 2013. Apr 17 [cited 2020 Jan 9];8(4). Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3629234/ doi: 10.1371/journal.pone.0061505 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Werner M, Wallerstedt S, Lindgren S, Almer S, Björnsson E, Bergquist A, et al. Characteristics and long-term outcome of patients with autoimmune hepatitis related to the initial treatment response. Scandinavian Journal of Gastroenterology. 2010. Apr 1;45(4):457–67. doi: 10.3109/00365520903555861 [DOI] [PubMed] [Google Scholar]
  • 36.Grant CR, Holder BS, Liberal R, Heneghan MA, Ma Y, Mieli-Vergani G, et al. Immunosuppressive drugs affect interferon (IFN)-γ and programmed cell death 1 (PD-1) kinetics in patients with newly diagnosed autoimmune hepatitis. Clin Exp Immunol. 2017;189(1):71–82. doi: 10.1111/cei.12956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Nishikawa H, Enomoto H, Iwata Y, Kishino K, Shimono Y, Hasegawa K, et al. B-Cell Activating Factor Belonging to the Tumor Necrosis Factor Family and Interferon-γ-Inducible Protein-10 in Autoimmune Hepatitis. Medicine (Baltimore). 2016. Mar;95(12):e3194. doi: 10.1097/MD.0000000000003194 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Morand EF, Furie R, Tanaka Y, Bruce IN, Askanase AD, Richez C, et al. Trial of Anifrolumab in Active Systemic Lupus Erythematosus. New England Journal of Medicine. 2020. Jan 16;382(3):211–21. doi: 10.1056/NEJMoa1912196 [DOI] [PubMed] [Google Scholar]
  • 39.Marano G, Franchini M, Farina B, Piccinini V, Pupella S, Vaglio S, et al. The human pegivirus: A new name for an “ancient” virus. Can transfusion medicine come up with something new? Acta virologica. 2017;61(04):401–12. doi: 10.4149/av_2017_402 [DOI] [PubMed] [Google Scholar]
  • 40.Chivero ET, Stapleton JT. Tropism of human pegivirus (formerly known as GB virus C/hepatitis G virus) and host immunomodulation: insights into a highly successful viral infection. Journal of General Virology. 2015. Jul 1;96(7):1521–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yang N, Dai R, Zhang X. Global prevalence of human pegivirus-1 in healthy volunteer blood donors: a systematic review and meta-analysis. Vox Sanguinis [Internet]. [cited 2020. Jan 26];n/a(n/a). Available from: https://onlinelibrary.wiley.com/doi/abs/10.1111/vox.12876 [DOI] [PubMed] [Google Scholar]
  • 42.Izumi T, Sakata K, Okuzaki D, Inokuchi S, Tamura T, Motooka D, et al. Characterization of human pegivirus infection in liver transplantation recipients. Journal of Medical Virology. 2019;91(12):2093–100. doi: 10.1002/jmv.25555 [DOI] [PubMed] [Google Scholar]
  • 43.Simoes CC, Saldarriaga OA, Utay NS, Stueck AE, Merwat SK, Merwat SN, et al. Direct-Acting Antiviral Treatment of Patients with Hepatitis C Resolves Serologic and Histopathologic Features of Autoimmune Hepatitis. Hepatol Commun. 2019. Aug;3(8):1113–23. doi: 10.1002/hep4.1388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sugimura T, Obermayer-Straub P, Kayser A, Braun S, Loges S, Alex B, et al. A Major CYP2D6 Autoepitope in Autoimmune Hepatitis Type 2 and Chronic Hepatitis C is a Three-dimensional Structure Homologous to Other Cytochrome P450 Autoantigens. Autoimmunity. 2002. Jan 1;35(8):501–13. doi: 10.1080/0891693021000069556 [DOI] [PubMed] [Google Scholar]
  • 45.Singh V, Sharma AK, Narasimhan RL, Bhalla A, Sharma N, Sharma R. Granulocyte colony-stimulating factor in severe alcoholic hepatitis: a randomized pilot study. Am J Gastroenterol. 2014. Sep;109(9):1417–23. doi: 10.1038/ajg.2014.154 [DOI] [PubMed] [Google Scholar]
  • 46.Hoeroldt B, McFarlane E, Dube A, Basumani P, Karajeh M, Campbell MJ, et al. Long-term outcomes of patients with autoimmune hepatitis managed at a nontransplant center. Gastroenterology. 2011. Jun;140(7):1980–9. doi: 10.1053/j.gastro.2011.02.065 [DOI] [PubMed] [Google Scholar]
  • 47.Werner M, Prytz H, Ohlsson B, Almer S, Björnsson E, Bergquist A, et al. Epidemiology and the initial presentation of autoimmune hepatitis in Sweden: a nationwide study. Scand J Gastroenterol. 2008;43(10):1232–40. doi: 10.1080/00365520802130183 [DOI] [PubMed] [Google Scholar]
  • 48.Kirstein MM, Metzler F, Geiger E, Heinrich E, Hallensleben M, Manns MP, et al. Prediction of short- and long-term outcome in patients with autoimmune hepatitis. Hepatology. 2015. Nov;62(5):1524–35. doi: 10.1002/hep.27983 [DOI] [PubMed] [Google Scholar]
  • 49.Hashimoto E, Lindor KD, Homburger HA, Dickson ER, Czaja AJ, Wiesner RH, et al. Immunohistochemical characterization of hepatic lymphocytes in primary biliary cirrhosis in comparison with primary sclerosing cholangitis and autoimmune chronic active hepatitis. Mayo Clin Proc. 1993. Nov;68(11):1049–55. doi: 10.1016/s0025-6196(12)60897-0 [DOI] [PubMed] [Google Scholar]
  • 50.Longhi MS, Hussain MJ, Bogdanos DP, Quaglia A, Mieli-Vergani G, Ma Y, et al. Cytochrome P450IID6-specific CD8 T cell immune responses mirror disease activity in autoimmune hepatitis type 2. Hepatology. 2007. Aug;46(2):472–84. doi: 10.1002/hep.21658 [DOI] [PubMed] [Google Scholar]
  • 51.Longhi MS, Ma Y, Mitry RR, Bogdanos DP, Heneghan M, Cheeseman P, et al. Effect of CD4+ CD25+ regulatory T-cells on CD8 T-cell function in patients with autoimmune hepatitis. J Autoimmun. 2005. Aug;25(1):63–71. doi: 10.1016/j.jaut.2005.05.001 [DOI] [PubMed] [Google Scholar]
  • 52.Shiow LR, Rosen DB, Brdičková N, Xu Y, An J, Lanier LL, et al. CD69 acts downstream of interferon-α/β to inhibit S1P 1 and lymphocyte egress from lymphoid organs. Nature. 2006. Mar;440(7083):540–4. doi: 10.1038/nature04606 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Frederic Rieux-Laucat

10 Jun 2021

PONE-D-20-39992

Transcriptomic Profiling of Blood from Autoimmune Hepatitis Patients Reveals Potential Mechanisms and Management Options

PLOS ONE

Dear Dr. Tana,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

See reviewers comments and concerns below.

Please submit your revised manuscript by Jul 25 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Frederic Rieux-Laucat, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

3. Please upload a copy of Supporting Information Tables 1-3 which you refer to in your text on pages 26 and 27.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Authors Michele May-Sien Tana et al performed bulk RNA sequencing on whole blood in 75 AIH patients and 25 healthy volunteers.

They identified a set of genes which are differentially expressed in AIH patients as compared to healthy controls, including the interferon signaling pathway.

They also compared gene expression modulation in patients treatment-naive patients as compared to healthy controls in order to identify potential target of well-known drugs. The data suggested that Sirolimus or GM-CSF could be a potential treatment in AIH patient. This study need to be expand and validated in a larger cohort.

Major comments:

1. Concerning the healthy controls, you need to precise that they signed an informed consent prior to be enrolled in the study.

2. The supplemental Tables are missing.

3. For the pathway enrichment analysis: why did the authors choose not to use any p-value cut-off (line183)?

Minor comments:

1. The use of Sirolimus as a potential drug can be discussed in relation to the modulation of the mTOR pathway depicted in figure S3

2. Some typos remain in the manuscript. References should be placed before the full stop at the end of each sentence.

3. These analyses could benefit from comparison of the same explorations on liver biopsies in parallel.

In total, this work paved the way of a better understanding of the physiopathology of AIH, and need to be extend and validated in a larger cohort.

Reviewer #2: As stated by the authors, there is a need in AIH for easily testable biomarkers to help diagnosis and prognosis in AIH and transcriptomic profiling seems indeed a good approach to answer this need. As it stands, the presence of an IFN response and the identification of some genes as potential biomarkers could represent some interesting findings but they are weakened by the lack of validation. The top 12 genes described could be easily tested on other patients and/or on the same patient by quantitative PCR and/or microarrays to validate their significance. IFN signatures could also be investigated using other methods and finally an immune phenotyping of the whole blood seems to be required to validate their in-silico prediction about CD8 T cells.

- Line 2: Title : “management Options” makes the title quite difficult to understand out of context

- Line 99: as not every patient received a liver biopsy to estimate fibrosis stage are there any bias in gene expression profiles based on how cirrhosis was determined ( grops can be established based on the different options listed in fig S1

- Line 182: as already stated by another reviewer, the absence of filtering based on adjusted p-value cutoff seems rather hazardous in particular when it comes to identify genes and pathways potentially used for diagnosis purpose. Answer already provided to reviewer#2 is not clear and the fact that IPA did not return any pathways when using an adjusted p-value regarding treatment-naïve AIH vs healthy is rather concerning. Could authors elaborate more their choice to continue analyzing data without adjusted p-value cutoff?

- Line 223: Type 1, Type 2 and other types of AIH are not clearly defined nor in Introduction, material and methods and those classifications do not appear in Table 1

- Line 238: There is on conclusion drawn from figure 1B which is rather surprising? Does the clustering make sense for the different groups in Y axis for example? Commenting such clustering is rather important as a quality control to ensure that everything is correct and that the analyses can be pushed further.

- Same comment for figure 1C: if there is nothing to be said about fig 1C, then it should be moved to sup fig.

- Line 244: 249 DEG only when comparing cases to controls seem a rather small number based on the fact that the analysis was done on whole blood. It could reflect a quite heterogeneous representation among “cases” or the need to do a more refine analysis in cell subtypes. Could the author comments on this small number of DEG in cases vs controls?

- Line 245: Could the authors explain how parameters were chosen and incremented regarding the random forest approach?

- Line 256: pathway analysis was performed on what? I guess it is on the 249 DEG but not really clear ? the 9 genes found by random forest modeling?

- Line 265: How many genes differentially expressed between treatment-naïve and healthy controls? Why does the authors did not comment on the appearance of other pathways when comparing treatment naïve to healthy subject? EIF2, EIF4 signaling, mTOR? They could all be potentially interesting.

- Line 270: upstream regulator analysis was performed on the 249 DEG? The number of significant pathways seems this time really high for such a low number of genes.

- Line 290: conclusion of this paragraph should rather be that using the pathogen detection platform, no evidence of viral presence was detected in the majority of samples and could not therefore explain the increase of IFN signaling pathways.

- Line 306: Can inverse correlation be interesting in those type of analyses? Clusters 2,3,4 and 5 have quite a strong inverse correlation. Could authors comments on those?

- Line 319: Could the authors define gene significance?

- Line 324: As stated by the authors the segregation of individual with AIH based on cirrhosis status is only partial. If a signature score combining the expression of those top genes is used, could the authors obtained a better segregation?

- Line 337: Using a computational method alone to assess contribution of individual immune population from whole blood bulk RNA-seq data, seems a rather risky approach. Authors should confirm their predictions by looking at immune cells distribution in the blood through classical immunophenotyping approach (flow cytometry, CyTOF for example).

- What are the results obtained with CIBERSORT regarding the other cell types? This would be particularly useful to understand how reliable the results of the deconvolution are?

- Line 402: Sirolimus is a treatment, so author should be careful of the words used to explain Sirolimus as an inhibited upstream regulator. All regulators not linked with transcription factors should have been excluded from their upstream analyses.

- Line 405: conclusions made by the authors would need further supports from their data. RICTOR changes of expressions in all their samples? Predicted target changes of expression?

- Line 476 and 477: those lines should be removed from the discussion as theire is no correlation between pegivirus presence and AIH.

Reviewer #3: - The authors performed RNAseq in the peripheral blood of patients with auto immune hepatitis (AIH) to gain insight into the pathogenesis of this disease.

- This study is very exciting and in my opinion, figure 2A is on of the most interesting finding of the study. It compares the top 10 canonial pathways in patients with AIH vs healthy subjects. One limitation is that all AIH patients are included regardless of the treatment they received, and this can of course interfere with the pathways and be a confounding factor. When the authors perform this analysis only on patients naive of treatment (n=5), they can only draw a conclusion on interferon signaling and not on the other caonical pathways due to lack of power.

- Table 1: Treatment type. I would like the authors to give more detail (in the table or in the text) about treatment type. Treatment received is a main confounding factor in this study, therefore, it is important to have maximum information about treatment received.

-> "Other treatment": I would like to know the name of these treatments

-> "Off treatment": cause ? (remission? Non observance ? )

-> " Steroids". Usually, AIH treatment consists of the association of steroids AND another drug (usually azathioprine). Patients are not often treated with steroids only. I would like more information about the treatment associated with steroids (if any) for each of the 28 patients on steroids.

- Figure 4: Treatment received can explain a lower CD8 count. Excluding patients on steroids (and not on other treatments that may have an impact of CD8 count such as I think azathioprine) is not enough (Figure 4 B).I think this should be mentionned in the discussion. If you want to have results more interpretable, you should repeat this analysis but for a given treatment (ie: compare count of CD8+ of patients on steroids only (or another drug but the same one), to show if there is a difference between complete and partial response).

- 384 "and that pegivirus is more prevalent in patients with liver transplant that hepatectomy patients" -> I do not understand what the authors means by "hepatectomy patients" (hepatectomy means the removal of one's liver).

- 432 "in many patients with early cirrhosis fibrosis stage in only apparent ...". I am not sure to understand what the authors mean by "early cirrhosis".

- 433: I would like to point out that fibroScan and shear wave devices are recognised and validated non invasive methods for the diagnosis of cirrhosis.

- 435 "and advanced disease". I am not sure to understand what the authors means by "advanced disease".

- Table 2 and 3A are blurry

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Claire MAYER

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Mar 21;17(3):e0264307. doi: 10.1371/journal.pone.0264307.r003

Author response to Decision Letter 0


28 Jul 2021

Reviewer #1: Authors Michele May-Sien Tana et al performed bulk RNA sequencing on whole blood in 75 AIH patients and 25 healthy volunteers.

They identified a set of genes which are differentially expressed in AIH patients as compared to healthy controls, including the interferon signaling pathway.

They also compared gene expression modulation in patients treatment-naive patients as compared to healthy controls in order to identify potential target of well-known drugs. The data suggested that Sirolimus or GM-CSF could be a potential treatment in AIH patient. This study need to be expand and validated in a larger cohort.

Major comments:

1. Concerning the healthy controls, you need to precise that they signed an informed consent prior to be enrolled in the study.

We specified that the healthy controls and AIH cases had also provided informed consent.

2. The supplemental Tables are missing.

We uploaded the supplemental tables.

3. For the pathway enrichment analysis: why did the authors choose not to use any p-value cut-off (line183)?

We have clarified the Methods and modified our Discussion to explain this choice.

“Differential gene expression results were imported into Ingenuity Pathway Analysis software (Qiagen, Redwood City, CA, USA). Genes were considered for entry into the analysis if they had an absolute log fold change greater than 0.6, but no adjusted p-value cutoff was required.”

“With the small number of treatment-naive patients, our analysis of pathways activated or inhibited in this group could not use a p-value cutoff for genes considered in the analysis and was thus exploratory only. For the analysis of AIH cases vs. healthy controls, the pathway results were similar with or without a p-value cutoff for considered genes.”

Minor comments:

1. The use of Sirolimus as a potential drug can be discussed in relation to the modulation of the mTOR pathway depicted in figure S3

We referenced Figure S3 and the mTOR pathway in the part of the Discussion about sirolimus.

“Sirolimus is a macrolide drug that inhibits mTOR, a protein that controls the proliferation and survival of activated lymphocytes. Indeed, mTOR signaling was one of the most significant pathways in our focused analysis of treatment-naïve patients vs. healthy controls (Figure S3).”

2. Some typos remain in the manuscript. References should be placed before the full stop at the end of each sentence.

We fixed the typos and placed superscripts before the full stop of each sentence.

3. These analyses could benefit from comparison of the same explorations on liver biopsies in parallel.

We added liver biopsies as a future direction in the Discussion:

“further efforts to expand longitudinal cohorts that include AIH patients, clinical data, and blood and liver biospecimens are currently underway”

In total, this work paved the way of a better understanding of the physiopathology of AIH, and need to be extend and validated in a larger cohort.

Reviewer #2: As stated by the authors, there is a need in AIH for easily testable biomarkers to help diagnosis and prognosis in AIH and transcriptomic profiling seems indeed a good approach to answer this need. As it stands, the presence of an IFN response and the identification of some genes as potential biomarkers could represent some interesting findings but they are weakened by the lack of validation. The top 12 genes described could be easily tested on other patients and/or on the same patient by quantitative PCR and/or microarrays to validate their significance. IFN signatures could also be investigated using other methods and finally an immune phenotyping of the whole blood seems to be required to validate their in-silico prediction about CD8 T cells.

- Line 2: Title : “management Options” makes the title quite difficult to understand out of context

We changed the title to “Transcriptomic Profiling of Blood from Autoimmune Hepatitis Patients Reveals Potential Mechanisms with Implications for Management.”

- Line 99: as not every patient received a liver biopsy to estimate fibrosis stage are there any bias in gene expression profiles based on how cirrhosis was determined ( grops can be established based on the different options listed in fig S1

We acknowledged this potential for bias in the Discussion:

“We did not have liver biopsy data on all patients but developed an algorithm for determining if patients had cirrhosis, which raises the possibility of bias in our analysis.”

- Line 182: as already stated by another reviewer, the absence of filtering based on adjusted p-value cutoff seems rather hazardous in particular when it comes to identify genes and pathways potentially used for diagnosis purpose. Answer already provided to reviewer#2 is not clear and the fact that IPA did not return any pathways when using an adjusted p-value regarding treatment-naïve AIH vs healthy is rather concerning. Could authors elaborate more their choice to continue analyzing data without adjusted p-value cutoff?

We did not use a p-value threshold for genes to include in the pathway analysis, but the software did assign p-values to individual pathways that it identified. The number of treatment-naïve AIH patients in the cohort was small (n=5) because once a diagnosis is made, physicians start therapy rapidly. Hence, most patients donated blood after treatment had been initiated. Therefore, we did not have the statistical power to generate low p-values for many genes. However, we wanted to include differentially expressed genes (log-fold 0.6 or greater) in the pathway analysis. We acknowledge these limitations in the Discussion, and state that this was an exploratory pilot study.

Please see our response to Reviewer #1’s major comment #3 about line 183.

“Differential gene expression results were imported into Ingenuity Pathway Analysis software (Qiagen, Redwood City, CA, USA). Genes were considered for entry into the analysis if they had an absolute log fold change greater than 0.6, but no adjusted p-value cutoff was required.”

“With the small number of treatment-naive patients, our analysis of pathways activated or inhibited in this group could not use a p-value cutoff for genes considered in the analysis and was thus exploratory only. For the analysis of AIH cases vs. healthy controls, the pathway results were similar with or without a p-value cutoff for included genes.”

- Line 223: Type 1, Type 2 and other types of AIH are not clearly defined nor in Introduction, material and methods and those classifications do not appear in Table 1

We added definitions for Type 1 and Type 2 AIH in the Methods:

“Autoimmune hepatitis type was determined through chart review. Type 1 was defined by positivity of antinuclear, anti-smooth muscle, or anti-actin antibodies. Type 2 was defined by antibodies to LKM-1 or LC-1. Cases following exposure to an offending drug with the ability to wean immunosuppression was classified as drug-induced AIH.”

- Line 238: There is on conclusion drawn from figure 1B which is rather surprising? Does the clustering make sense for the different groups in Y axis for example? Commenting such clustering is rather important as a quality control to ensure that everything is correct and that the analyses can be pushed further.

We enhanced the Results to comment on the clustering in Figure 1B:

“Unsupervised clustering of the 1000 most variably expressed genes in the dataset did not entirely separate samples based on diagnosis, treatment, or fibrosis stage.”

- Same comment for figure 1C: if there is nothing to be said about fig 1C, then it should be moved to sup fig.

We enhanced the Results to comment on the clustering in Figure 1C:

“Unsupervised clustering analysis on all gene counts did not reveal grouping of samples by demographic factors such as age, sex, and race, nor by RNA preparation batch (Figure 1C, Figure S2).”

- Line 244: 249 DEG only when comparing cases to controls seem a rather small number based on the fact that the analysis was done on whole blood. It could reflect a quite heterogeneous representation among “cases” or the need to do a more refine analysis in cell subtypes. Could the author comments on this small number of DEG in cases vs controls?

We added comments on the number of differentially expressed genes to the Discussion:

“We found 249 genes that were significantly differentially expressed in the whole blood of AIH patients compared to healthy controls. This relatively small number of differentially expressed genes could be related to the fact that most blood samples were drawn from AIH patients after the initiation of therapy, heterogeneity among cases and/or, or perhaps the need to focus on certain cell types within whole blood.”

- Line 245: Could the authors explain how parameters were chosen and incremented regarding the random forest approach?

We used varSelRF with a large number of trees as that has been shown to be optimal for the random forest algorithm [Breiman, Machine Learning, 2001, https://link.springer.com/article/10.1023/A:1010933404324]. In brief, we used a large enough forest to increase the consistency of the results. The OOB error is the parameter to which the RF is more sensitive to and that is what is actually minimized during the process of variable elimination. The reason for 2000 iterations is to have a robust estimate of the variable importance. Random forests do not overfit in general and there is no need to perform cross-validation because that is built in the algorithm. For a more complete explanation please refer to the notes of the original implementation of the algorithm (https://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm). We have added the Breiman, Machine Learning 2001 paper as a reference in the manuscript.

- Line 256: pathway analysis was performed on what? I guess it is on the 249 DEG but not really clear ? the 9 genes found by random forest modeling?

Pathway analysis was performed on all protein-coding genes with an absolute log fold change (between the groups being compared) greater than 0.6. We clarified this in the Methods.

“Genes were considered for entry into the analysis if they had an absolute log fold change greater than 0.6 between the two groups of interest (AIH cases vs. healthy controls, or treatment-naïve AIH vs. healthy controls)”

- Line 265: How many genes differentially expressed between treatment-naïve and healthy controls? Why does the authors did not comment on the appearance of other pathways when comparing treatment naïve to healthy subject? EIF2, EIF4 signaling, mTOR? They could all be potentially interesting.

We added more commentary on activated pathways to the Results section.

“Analysis further identified several additional pathways of interest, for example, myeloid signaling, as indicated by activation of triggering receptor expressed on myeloid cells 1 (TREM1) and inflammasome pathways, but inhibition of liver X receptor/retinoid X receptor (LXR/RXR) activation (Fig. 2A)”

- Line 270: upstream regulator analysis was performed on the 249 DEG? The number of significant pathways seems this time really high for such a low number of genes.

As above, we clarified in the Methods that IPA (Ingenuity Pathway Analysis) used all protein-coding genes with a log-fold change of .6 or greater between AIH cases and healthy controls or tx-naïve AIH and healthy.

- Line 290: conclusion of this paragraph should rather be that using the pathogen detection platform, no evidence of viral presence was detected in the majority of samples and could not therefore explain the increase of IFN signaling pathways.

We modified the paragraph heading per the reviewer’s suggestion:

“Pathogen detection platform reveals infection with pegivirus in a handful of samples, but cannot fully explain the activation of interferon signaling in AIH compared to healthy controls”

- Line 306: Can inverse correlation be interesting in those type of analyses? Clusters 2,3,4 and 5 have quite a strong inverse correlation. Could authors comments on those?

Yes, inverse correlation can also be significant in WGCNA. We commented on gene module 5 in the Results.

“Conversely, module 5 was significantly inversely correlated with clinical variables related to advanced fibrosis (fibroscan score, FIB-4 score, cirrhosis, decompensation, and need for transplant).”

- Line 319: Could the authors define gene significance?

Yes, the Methods section was updated to clarify what the p-value represents as below:

“To identify hub genes, we used a gene significance cutoff of 0.5 and a measure of centrality within the module, module membership, at a cutoff of 0.7. To assess the strength of the correlation between genes of interest and individual modules, gene significance for the genes in the module were compared to module membership using univariate regression, and correlation p-values were generated.”

- Line 324: As stated by the authors the segregation of individual with AIH based on cirrhosis status is only partial. If a signature score combining the expression of those top genes is used, could the authors obtained a better segregation?

Yes, a goal for future directions is to obtain additional patient samples to create a validation cohort where we can test identified signatures, on a more selected set of top genes in order to improve segregation of phenotypes and assess whether these genes may have predictive value. Unfortunately, we did not have the sample size for this in our current study.

- Line 337: Using a computational method alone to assess contribution of individual immune population from whole blood bulk RNA-seq data, seems a rather risky approach. Authors should confirm their predictions by looking at immune cells distribution in the blood through classical immunophenotyping approach (flow cytometry, CyTOF for example).

Yes, this would be ideal. Unfortunately, the whole blood specimens used in this study were collected and frozen in Paxgene tubes in order to preserve intact RNA extraction, which does not allow for analysis of individual cells / cell surface proteins by flow cytometry/CyTOF. A goal for future study would be to perform cell-type specific analysis (eg flow cytometry, scRNA Seq, etc). We added to the discussion that classical immunophenotyping will be a future step.

“However, further efforts to expand longitudinal cohorts that include AIH patients, clinical data, and blood and liver biospecimens are currently underway, and classical immunophenotyping assays are planned.”

- What are the results obtained with CIBERSORT regarding the other cell types? This would be particularly useful to understand how reliable the results of the deconvolution are?

Below is our data on a few additional immunological cell types of interest for review:

- Line 402: Sirolimus is a treatment, so author should be careful of the words used to explain Sirolimus as an inhibited upstream regulator. All regulators not linked with transcription factors should have been excluded from their upstream analyses.

We modified our wording about sirolimus:

“Among the multiple findings, interferon signaling was an activated canonical pathway. Regarding inhibition, genes that would be found downstream of the drug sirolimus were inhibited in AIH patients.”

- Line 405: conclusions made by the authors would need further supports from their data. RICTOR changes of expressions in all their samples? Predicted target changes of expression?

We modified our wording about RICTOR in the Discussion:

“RPTOR Independent Companion Of MTOR Complex 2 (RICTOR) was determined by pathway analysis to be a significantly inhibited upstream regulator of gene expression in treatment-naïve AIH patients, with multiple target molecules supporting this in the dataset. Sirolimus is a macrolide drug that inhibits mTOR, a protein that controls the proliferation and survival of activated lymphocytes. Indeed, mTOR signaling was one of the most significant pathways in our focused analysis of treatment-naïve patients vs. healthy controls (Figure S3).”

- Line 476 and 477: those lines should be removed from the discussion as theire is no correlation between pegivirus presence and AIH.

We modified the wording on pegivirus and AIH.

“We also uncovered a number of AIH samples with viral sequences, an unexpected finding that raises the possibility of molecular mimicry after viral infection as a mechanism in AIH.”

Reviewer #3: - The authors performed RNAseq in the peripheral blood of patients with auto immune hepatitis (AIH) to gain insight into the pathogenesis of this disease.

- This study is very exciting and in my opinion, figure 2A is on of the most interesting finding of the study. It compares the top 10 canonial pathways in patients with AIH vs healthy subjects. One limitation is that all AIH patients are included regardless of the treatment they received, and this can of course interfere with the pathways and be a confounding factor. When the authors perform this analysis only on patients naive of treatment (n=5), they can only draw a conclusion on interferon signaling and not on the other caonical pathways due to lack of power.

- Table 1: Treatment type. I would like the authors to give more detail (in the table or in the text) about treatment type. Treatment received is a main confounding factor in this study, therefore, it is important to have maximum information about treatment received.

-> "Other treatment": I would like to know the name of these treatments

We added information on “other treatments” to the text.

-> "Off treatment": cause ? (remission? Non observance ? )

We added information as to why certain patients were “off treatment” to the text.

-> " Steroids". Usually, AIH treatment consists of the association of steroids AND another drug (usually azathioprine). Patients are not often treated with steroids only. I would like more information about the treatment associated with steroids (if any) for each of the 28 patients on steroids.

We clarified that “steroids” indicates patients were on an AIH treatment regimen that included corticosteroids at any dose.

“10 AIH patients were off therapy at the time of blood biospecimen collection (5 were treatment-naïve, 3 were in remission and had been withdrawn from therapy, and 2 had been previously treated but were off therapy at the time of sample collection). Of subjects on AIH therapy at the time of sample collection, 28 were on a regimen that included corticosteroids: 21 with and 7 without maintenance medications. 29 of the AIH patients on therapy at the time of sample collection were on a steroid-free regimen consisting only of maintenance medications such as azathioprine, mycophenolate, or tacrolimus.”

- Figure 4: Treatment received can explain a lower CD8 count. Excluding patients on steroids (and not on other treatments that may have an impact of CD8 count such as I think azathioprine) is not enough (Figure 4 B).I think this should be mentionned in the discussion. If you want to have results more interpretable, you should repeat this analysis but for a given treatment (ie: compare count of CD8+ of patients on steroids only (or another drug but the same one), to show if there is a difference between complete and partial response).

Yes, this is an important point, we have added the following to the discussion to clarify this point. Unfortunately, we did not have a great enough sample size to perform analysis of individual subgroups by treatment type.

“Finally, dividing the AIH cohort by response to treatment and deconvoluting the gene expression data by cell type pointed to a role for CD8+ T cells. Patients with a partial response to therapy displayed a lower CD8+ T cell expression signal. CD8+ T lymphocytes are cytotoxic and induce apoptosis of damaged cells in response to antigen presentation on MHC Class I molecules, and they are a major cell type in areas of interface hepatitis.(48) In Type II AIH, the degree of CD8+ T cell response correlates with disease activity.(49) In vitro studies have shown that immunosuppressive therapy alters the ability of regulatory T cells to modulate CD8+ T cell activity (50). However, these results are limited by the fact that the effect of AIH on CD8 cell populations is difficult to truly separate from the effect of AIH therapies, eg various forms of immunosuppression. Further study is required to better understand the relative contrition of immunosuppression and underlying AIH disease activity on CD8 biology.”

- 384 "and that pegivirus is more prevalent in patients with liver transplant that hepatectomy patients" -> I do not understand what the authors means by "hepatectomy patients" (hepatectomy means the removal of one's liver).

We clarified the reference on pegivirus:

“Pegivirus is more prevalent in patients with liver transplant than partial hepatectomy patients, but pegivirus is not associated with any changes in clinical outcomes(41).”

- 432 "in many patients with early cirrhosis fibrosis stage in only apparent ...". I am not sure to understand what the authors mean by "early cirrhosis".

We clarified this sentence to read

“While some patients have obvious radiographic evidence of cirrhosis, there are many patients with early, compensated cirrhosis (i.e. without clinically evident complications), for whom fibrosis stage is only apparent on a liver biopsy.”

- 433: I would like to point out that fibroScan and shear wave devices are recognised and validated non invasive methods for the diagnosis of cirrhosis.

We acknowledged this point in the Discussion:

“Noninvasive methods of fibrosis assessment such as transient elastography and magnetic resonance elastography, while validated in AIH, are not available or accessible for many AIH patients.”

- 435 "and advanced disease". I am not sure to understand what the authors means by "advanced disease".

We clarified this to read “advanced disease (stage 3-4 fibrosis).”

- Table 2 and 3A are blurry

We have replaced these tables to make them clearer.

Attachment

Submitted filename: Response to Reviewers 7-23-21.docx

Decision Letter 1

Gualtiero I Colombo

9 Feb 2022

Transcriptomic Profiling of Blood from Autoimmune Hepatitis Patients Reveals Potential Mechanisms with Implications for Management

PONE-D-20-39992R1

Dear Dr. Tana,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Gualtiero I. Colombo, M.D., Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #3: All previous comments have been adressed.

This study is very interesting and exciting and I hope that it is a first step to a bigger study adressing the same questions.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Dr Claire Mayer

Acceptance letter

Gualtiero I Colombo

11 Mar 2022

PONE-D-20-39992R1

Transcriptomic Profiling of Blood from Autoimmune Hepatitis Patients Reveals Potential Mechanisms with Implications for Management

Dear Dr. Tana:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Gualtiero I. Colombo

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Flow chart depicting algorithm for determining fibrosis status of AIH patient.

    (TIF)

    S2 Fig

    PCA plot of variance stabilizing transformed gene counts colored by A. age bracket of patient, B. race/ethnicity of patient, and C. library preparation batch for patient sample.

    (TIF)

    S3 Fig. Top ten most significant canonical pathways related to differential gene expression of treatment naïve AIH patients compared to healthy controls.

    (TIF)

    S1 Table. Most differentially expressed genes in AIH patients compared to healthy volunteers.

    (XLSX)

    S2 Table. List of all upstream regulators identified by pathway analysis for treatment-naïve patients compared to healthy controls.

    Republished from Ingenuity Pathway Analysis under a CC BY license, with permission from Qiagen, original copyright 2019.

    (XLS)

    S3 Table. Top 12 hub genes correlated with AIH cirrhosis.

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers 12-20-20.docx

    Attachment

    Submitted filename: Response to Reviewers 7-23-21.docx

    Data Availability Statement

    Full data set available at github link https://github.com/czbiohub/AIH-Project. Raw data will be deposited into dbGaP prior to publication.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES