Methods
Patients
There were 48 HBsAg-negative FC patients and 27 HBsAg-positive NFC patients in the clinic of our department. Three patients in the FC group were treated with NAs alone, and all the other patients were treated with Peg-IFN, switching from NAs with undetectable HBV DNA and HBsAg below 1,500 IU/mL. All patients in the FC group discontinued Peg-IFN after HBsAg disappeared. The patients in the NFC group were switched to NA monotherapy, and liver biopsies were performed on all patients. The liver biopsies were performed between 12 January 2017, and 11 August 2020. The patients were followed by regular visits. Forty-one positive control samples were originally stored in the liver tissue specimen bank of our department, which was retained before antiviral therapy in treatment-naïve CHB-group patients, and synchronous serum was collected. Specimens were stored in a refrigerator at −80°C. Liver biopsies were performed between 21 August 2010, and 26 May 2014. Ten patients outside the above cohorts were included for deep whole-genome sequencing of liver tissue, including four hepatitis B e antigen (HBeAg)-positive patients, three HBeAg-negative patients with treatment-naïve CHB, and three with an FC after antiviral treatment. The study protocol was consistent with the International Conference on Harmonization Guidelines, applicable regulations, and the ethical guidelines of the Declaration of Helsinki. The protocol and consent forms were approved by the Research Ethics Committee of the Third Affiliated Hospital of Sun Yat-sen University, China (2016, 2-129). Patients with antiviral therapy were obtained from a real-world study registered at clinicaltrails.gov (NCT02745704). All authors had access to the study data and reviewed and approved the final manuscript.
Serological and virological parameters
qHBsAg was measured with Elecsys HBsAg II Quant reagent kits (Roche Diagnostics, Indianapolis, IN, USA). Serum HBV DNA was assayed with a Roche COBAS AmpliPrep/COBAS TaqMan HBV Test v2.0 (Roche Molecular Diagnostics, Branchburg, NJ, USA). All procedures were performed following the manufacturer’s instructions. Serum hepatitis B core related antigen (HBcrAg) levels were measured with an automated chemiluminescent enzyme immunoassay (CLEIA) immunoassay analyzer and Lumipulse G HBcrAg kits (both from Fujirebio, Inc., Tokyo, Japan). HBcrAg values were expressed as log U/mL, and the measurement range was 3.0–7.0 U/mL. Serum samples were diluted with normal human serum and retested if the HBcrAg levels exceeded the upper limit.
Quantification of serum HBV pgRNA
Serum HBV pre-genomic RNA (pgRNA) levels were measured with a HBV pgRNA quantification kit (Supbio, Inc., Guangzhou, China) as previously described.7,8 HBV pgRNA in serum was isolated, treated with DNase I, and then reverse-transcribed with a commercial kit according to the manufacturer’s instructions. Finally, qPCR was performed without the use of reverse transcriptase. The lower limit of detection was 25 copies/mL.
Quantification of intrahepatic cccDNA
Liver biopsies were performed with ultrasound-guided techniques, and the tissue samples were preserved in liquid nitrogen. HBV cccDNA was purified from liver tissue using QIAamp DNA Mini Kits (Qiagen, Hilden, Germany) following the manufacturer’s instructions. Enzyme treatment, and PCR amplification were performed with the HBV cccDNA quantitative detection kits (Supbio, Inc., Guangzhou, China). In brief, 10 µL of extracted DNA was removed, 1 µL plasmid-safe ATP-dependent DNase (PSAD) enzyme and 39 µL of buffer were added, with incubation at 37°C for 30 m, and immediate transfer to a 70°C incubator for 30 m followed by centrifugation at 13,000 rpm for 1 minute, and storage at −20±5°C. qPCR reactions were performed with an Applied Biosystems 7500 real-time PCR System (Thermo Fisher Scientific, Waltham, MA, USA). The lower limit of detection was 600 copies/106 cells, and the quantitative range was 1×103 to 5×107 copies/106 cells.
Clinical samples and DNA extraction for deep sequencing of the liver genome
DNA was extracted from liver tissue using TIANamp Genomic DNA Kits (Tiangen Biotech, Beijing, China), and 1 µg genomic DNA was sheared using Covaris M220 (Covaris, Inc., Woburn, MA, USA). Sequencing was performed on the Illumina Novaseq 6000 platform (Illumina Inc., San Diego, CA, USA) with 150 bp paired-end reads. Deep sequencing was prepared for HBV integration analysis and HBV genotyping.
HBV integration sites
Clean reads obtained were mapped to the human reference genome (hg38) and the HBV genome. We used the paired-end nature of the reads to search for HBV integration. If a paired-end read uniquely mapped to hg38 at one end and the HBV genome at the other end, it was divided into discordant paired reads and chimeric reads. Chimeric reads were considered as integration sites. The position of the breakpoint was defined as the junction between the human and HBV sequences in the reads assembled at the end of the pairs (chimeric reads). The reads in each category were clustered according to their coordinates if the minimum distance between the edges of the two reads was <50 bp (HBV genome) or 100 bp (human genome) in both paired-end reads.9 Our goal was to sequence 2 billion read pairs. Approximately 2.6 billion paired-end 150 bp reads were obtained for each sample. Humans were partially annotated in HBV integrations from the National Center for Biotechnology Information (NCBI), the University of California, Santa Cruz (UCSC) genome browser home page, Ensemble, HumCFS database,10 and the DriverDBv2 database.11 Reads only mapped to HBV genome were used to analyze HBV genotype.
Statistical analysis
SPSS 20.0 (IBM Corp., Armonk, NY, USA) was used to establish the database and perform the statistical analysis. Normally distributed variables were reported as means±SD. Independent samples were compared using t-tests or one-way analysis of variance. Non-normally distributed measurements were reported as medians and compared using the rank-sum test. Statistical analysis of HBV integration was performed with R3.6.0. Pathway enrichment analysis of integrated genes based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) and gene ontology (GO) was performed with the R package ClusterProfiler. Chi-square tests were used to compare the ratios. Differences were considered statistically significant if p was <0.05.
Results
Intrahepatic cccDNA and serum viral markers in different groups
There were no differences in age and sex among the three groups. The levels of ALT were significantly higher in the CHB group than in the other two groups (p<0.05). Serum HBV DNA was not detected in the FC and NFC groups, compared with 5.8 log10 IU/mL in the CHB group. HBsAg was detected in both the NFC and CHB groups, with the latter being significantly higher (1.4 vs. 3.4 log10 IU/mL, p<0.05). The FC group was negative for pgRNA. Three of the 21 patients in the NFC group and 38 of the 39 patients in the CHB group were positive, with mean levels of 3.0 and 4.2 log10 copies/mL respectively. Serum HBcrAg levels were significantly lower in the FC and NFC groups than in the CHB group (p<0.05). Intrahepatic cccDNA was negative in 13 patients in the FC group and three in the NFC group. None of the patients in the CHB group had undetectable intrahepatic cccDNA. Intrahepatic cccDNA levels were significantly higher in the CHB group than in the other two groups, but the difference in the levels of the FC and NFC groups was not significant (Table 1).
Table 1Demographic and virological features of patients in the study groups
Parameter | FC (N=48) | NFC (N=27) | CHB (N=41) | p-value |
---|
Age, years, mean (SD) | 37.2 (8.2) | 39.7 (9.3) | 38.2 (8.0) | 0.507 |
Sex (F/M) | 7/41 | 4/23 | 8/33 | 0.796 |
ALT, median (IQR) | 25.5 (19.3, 34.5) | 29.5 (21.0, 35.0) | 60.0 (30.0, 148) | <0.001 |
Serum HBV DNA (log10 IU/mL) | ND | ND | 5.8 (1.5) | NA |
qHBsAg (log10 IU/mL) | ND | 1.4 (1.1) | 3.4 (0.6) | <0.001 |
Serum pgRNA (ND/Positive) | 48/0 | 18/3 (n=21) | 1/38 (n=39) | NA |
pgRNA (log10 copies/mL) | ND | 3.0 (1.0) (n=3) | 4.2 (1.6) | 0.193 |
HBcrAg (log10 IU/mL) | 3.6 (0.5) | 3.3 (1.0) (n=11) | 6.1 (1.7) | <0.001 |
Intrahepatic cccDNA (ND/Positive) | 13/35 | 3/24 | 0/41 | 0.105 |
Intrahepatic cccDNA (log10 copies/cell) | −1.3 (0.9) (n=35) | −0.9 (0.6) (n=24) | 0.3 (0.9) | <0.001 |
Clearance of intrahepatic cccDNA in the FC group and its possible influencing factors
Of the 48 FC patients, cccDNA was undetectable in 13, a clearance rate of 27%. There were still 73% of patients with positive intrahepatic cccDNA at levels in the logarithmic range of 0.00365 to 1.54 copies/cell, or −2.44 to 0.19 log10 copies/cell, significantly lower than in the CHB groups (p<0.001). Patients were further divided into cccDNA-negative and cccDNA-positive groups according to the cccDNA status. Sex, age, family history, duration of hepatitis B disease, and antiviral treatment in the two groups were compared. There were no differences between the two groups in sex, age, duration of hepatitis B disease, history of NAs treatment, type of interferon used, or the time to HBsAg clearance. More patients in the cccDNA-positive group had a family history of hepatitis B than in the negative group, but the difference was not statistically significant (45.7% vs. 23.1%, p=0.145). Before interferon treatment, more patients in the cccDNA-negative group had lower levels of HBsAg (≤200 IU/mL) than those in the positive group, but the difference was not statistically significant (p=0.153, Table 2).
Table 2Comparison of demographic features in cccDNA-negative and positive groups
| cccDNA-negative (N=13) | cccDNA-positive (N=35) | p-value |
---|
Age (years) Mean (SD) | 37.5 (6.5) | 37.1 (8.8) | 0.875 |
Sex (F/M) | 2/11 | 5/30 | 1.000 |
Family history of hepatitis B*, n (%) | 3 (23.1%) | 16 (45.7%) | 0.197 |
NAs duration (years) median (IQR) | 6.0 (3.0, 8.0) | 3.0 (2.0, 8.0) | 0.197 |
Type of interferon (N=45) | | | 0.721 |
PEG α-2a | 4 (30.8%) | 8 (25.0%) | |
PEG α-2b | 9 (69.2%) | 24 (75.0%) | |
HBsAg prior to IFN (N=44) | | | 0.153 |
>1,000, ≤1,500 | 2 (15.4%) | 2 (6.3%) | |
>200, ≤1,000 | 2 (15.4%) | 15 (46.9%) | |
≤200 | 9 (69.2%) | 15 (46.9%) | |
Maximal ALT level on treatment, U/L, mean (SD) | 108.5 (72.2) | 108.4 (65.7) | 0.998 |
Minimal WBC ×109/L, mean (SD) | 2.9 (0.8) | 3.0 (0.7) | 0.813 |
Minimal PLT ×109/L, mean (SD) | 98.0 (37.0) | 101.1 (33.6) | 0.792 |
Time of HBsAg clearance | | | 0.751 |
≤24 weeks | 6 (46.2%) | 13 (40.6%) | |
>24 weeks | 7 (53.8%) | 19 (59.4%) | |
IFN duration, weeks, median (IQR) | 37 (30, 54) | 44 (33, 48) | 0.870 |
Relationships between virological markers, pathological changes, and cccDNA clearance
To understand whether virological markers and pathological changes were associated with cccDNA clearance, we compared those parameters in the cccDNA-positive and cccDNA-negative groups. The results showed that the patients in the cccDNA-negative group had more advanced liver fibrosis (p<0.05). However, there were no differences in virological markers or other pathological changes, including serum HBcrAg, hepatitis B surface antibody (HBsAb) levels, liver inflammation, and HBsAg/hepatitis B core antigen (HBcAg) immunohistochemistry (p>0.05; Supplementary Table 1).
Association between viral markers and HBsAg relapse
The 48 patients with clearance of HBsAg were followed for more than 24 weeks after discontinuation of the drug (Fig. 1). A total of seven patients showed HBsAg seroreversion (i.e. >0.05 IU/mL). qHBsAg increased to 0.05–1 IU/mL in one case and to 1–20 IU/mL in four cases, with HBV DNA remaining negative (Fig. 2). qHBsAg increased to 1.99 IU/mL in one case with 0.0216 copies/cell of intrahepatic cccDNA, 12 weeks after discontinuation of the drug, and HBV DNA remained undetectable. However, qHBsAg increased to 130 IU/mL, and HBV DNA rebounded to 1.09e8 IU/mL with normal ALT at week 24 after discontinuation of the drug. Another patient with 0.0487 copies of intrahepatic cccDNA/cell had a relapse of HBsAg at week 12 after drug discontinuation, with a qHBsAg of 0.419 IU/mL, while HBV DNA remained undetectable. qHBsAg increased to 18.96 IU/mL at week 24, and HBV DNA was negative. However, at week 48, qHBsAg reached 37.36 IU/mL, HBV DNA rebounded to 347 IU/mL, and ALT was normal. Two patients with virological relapse were retreated with NAs and became undetectable for HBV DNA at 24 weeks.
Levels of intrahepatic cccDNA, serum HBcrAg, HBsAb, and HBsAg
Immunohistochemistry results in patients with and without HBsAg relapse are shown in Supplementary Table 2. Seven of 35 patients (20%) with positive intrahepatic cccDNA experienced HBsAg relapse, while no relapse occurred in patients with negative intrahepatic cccDNA. No relapses occurred in those with HBsAb above 100 IU/mL, otherwise, the relapse rate was 24.1% (7/29, p<0.05). There was no statistical effect of HBcrAg and HBsAg immunohistochemistry on relapse.
Characteristics of patients with deep sequencing
We performed deep whole genome sequencing (WGS) to search for HBV integration sites across the human genome. A total of 10 patients were included, including three CHB patients with an FC, four HBeAg-positive, and three HBeAg-negative treatment-naïve CHB patients (Supplementary Tables 3 and 4). Patient F808 received 38 months of entecavir (ETV) monotherapy, switching to a combination of Peg-IFN and ETV therapy for 12 weeks, followed by 36 weeks of Peg-IFN monotherapy. HBsAg disappeared at week 12 of Peg-IFN monotherapy. Patient R228 was treated with 13 months of tenofovir disoproxil fumarate (TDF) and was switched to a combination of Peg-IFN and TDF for 72 weeks. HBsAg disappeared at week 44 of combination therapy. Patient F807 experienced acute on chronic liver failure and was treated with ETV for 22 months. HBsAg disappeared in the eighth month. Seventy percent of the patients had at least one family member with HBV-associated disease. Based on WGS with HBV reads, we examined the HBV genotypes of all patients. Excluding F807, which lacked HBV reads, we found five cases with genotype B and four cases with genotype C, consistent with the predominant genotypes in South China.12
Integration of viruses into the human genome
There were 439 viral integration sites in 10 liver samples (Supplementary Table 5). The numbers of integration sites in the HBeAg-positive CHB group, the HBeAg-negative CHB group, and the FC group were 132, 304, and 3, respectively. There was a different conclusion about whether HBV integration was distributed throughout the genome. To answer that question, we compared the observed chromosome distribution with the expected group, which was a random selection of 444 sites from the entire human genome after removing assembly gap sites from the genome. Integration sites were preferentially located on chromosomes four and 11, and fewer integration sites were located on chromosome X than expected (Fig. 3). Overall, 45.79% of the integration sites were located in protein-coding genes. The promoters were defined as 5 kb upstream of the transcription start sites. HBV integration was not preferentially located in the promoters, exons, introns, and 3′UTR, affecting gene expression.
Breakpoints and integrative links in the HBV genome
HBV reads were detected in liver samples from one (33.3%) FC patients and seven (100%) CHB patients. The number of HBV reads per patient was much lower in the FC group than in the HBeAg-positive CHB group and the HBeAg-negative CHB group (Fig. 4). For HBV integration, 28.0% of HBV breakpoints were mapped within the 1,500 nt to 1,900 nt range of the HBV genome, which contains DR1 and DR2, which play a crucial role in integrating HBV DNA into the host cell genome.13 The viral genome had a clear preference for breakpoints near the 3′ end of HBx and the 5′ end of the Precore/Core gene, which may lead to HBV-human fusion reads. However, all three breakpoints were located in the S/P region in the FC group. It is thought that double-strand break repair plays a role in HBV integration, which assumes that linearized double viral DNA ends are joined to the human genome via non-homologous end joining and microhomology-mediated end joining using microhomologous sequences during the alignment of the break ends prior to joining.14 Based on the hypothesis, we searched for microhomologous sequences between human sequences and inserted HBV DNA at integration sites in the chimeric reads, and compared them with the expected group. We found that 39.64% of HBV fragments were linked to the human genome by microhomologous sequences greater than or equal to 2 bp.
HBV integration in the driver gene and functional annotation of the HBV integration-targeted gene
Four integration sites in the HBeAg-positive CHB group, 23 in the HBeAg-negative CHB group, and one in the FC group were located in the liver cancer driver gene. The liver cancer driver gene was identified in the DriverDBv2 database.11 In one patient who achieved an FC, HBV DNA remained integrated in the driver gene, which may indicate that HBV integration might be a potential risk for liver carcinogenesis despite the patient achieving the ultimate goal of HBV infection.15 KEGG analysis of integrated genes showed enrichment in cancer-related pathways, such as small-cell lung cancer and choline metabolism in cancer (we selected only the top 30 based on p-values, Supplementary Fig. 1). The cancer pathways imply that influencing the expression or function of integrated genes associated with oncogenic pathways may be an oncogenic factor for HBV integration.
Discussion
Our study showed that cccDNA was undetectable in 27% (13 of 48) of CHB patients after achieving an FC. For patients with positive intrahepatic cccDNA, the levels were significantly reduced compared with treatment-naïve CHB patients. Many studies have shown that NAs and IFNs significantly reduce intrahepatic cccDNA levels.16,17 Our data confirm that result, and more importantly, all patients who achieved cccDNA clearance did not experience virological relapse, which implies that cccDNA clearance can be achieved with current treatment and is essential to predict whether CHB patients will relapse after drug discontinuation in short-term follow-up. Although HBsAg seroreversion was achieved in seven patients, only two experienced a virological relapse. All HBsAg relapses occurred in the cccDNA-positive group, which is not surprising because the original template is required for transcription and replication. Our study shows that CHB patients have a very low risk of relapse after FC. We hypothesized that might be because the cccDNA pool became smaller and inactive (Fig. 5). Since some patients in our study were followed up for only 24 weeks, their dynamics still need to be observed over time.
To better illustrate the levels of serum or intrahepatic viral markers, the methodologies should be appropriate and accurate. Currently, there are various methods to detect serum pgRNA determined by targets used by the assays. Shi Liu et al.18 reported that methodologies targeting both the 5′and 3′ends of the HBV pgRNA, the S region, and the X-region had similar performance in predicting clinical relapse and HBeAg loss. Targeting polyA tail of HBV RNA was not satisfactory. In this study, the assay targeted the X-region and performed well in both treatment-naïve and experienced CHB patients.7,8 However, there is no acknowledged standard to detect HBV pgRNA. Our method had a detection limit of 25 copies/mL, and methodologies having lower detection limits are required. Likewise, multiple strategies to detect intrahepatic cccDNA have been reported. Among them, droplet digital PCR (ddPCR) had higher sensitivity and specificity than qPCR.19 However, its high cost, complicated procedure, and unstable upper limit, its availability hindered its application. The method we used in this study was relative quantification using primers that crossed the gap of relaxed circular DNA (rcDNA). We also added PSAD to degrade rcDNA to exclude the amplification of rcDNA. Although it would not completely exclude the possibility of interference by rcDNA, we still aimed to minimize its influence and present a relatively reliable and cost-effective result.
Several serum viral parameters have been reported to be associated with cccDNA levels.20,21 We also analyzed the relationship between cccDNA and serum viral markers. Our results showed that none of the serum viral markers were associated with cccDNA levels. Most published results were obtained from treatment-naïve CHB patients or those experienced patients with high levels of viral markers and intrahepatic cccDNA who did not obtain an FC. Therefore, we believe that this contradiction is due to the low levels of viral markers and intrahepatic cccDNA as well as the small sample size.
Understanding the mechanisms of cccDNA clearance and finding the parameters that show cccDNA clearance is crucial for hepatologists. We analyzed a variety of parameters, including family history, type of treatment, duration of treatment, serum viral markers, and histological parameters. Unfortunately, the results showed that none of the above parameters was associated with cccDNA clearance. Therefore, more effort should be made to expand the sample size or explore other factors, such as immune indices.
To further understand HBV integration after an FC and whether a sterilizing cure can be achieved, we performed deep sequencing of the liver genome. Our data show that HBV integration occurred more frequently in HBeAg-negative CHB-infected patients than in other infected patients, consistent with previous reports.22 As HBV-associated liver disease progresses, the percentage of HBV double stranded linear DNA in the serum is presumed to be a precursor to HBV integration, which gradually increases.23 In addition, it has been proposed that the frequency of integrated DNA increases with HBV infection duration.24,25 Although integrated HBV DNA has lost its ability to produce viral progeny, it can still produce HBsAg, particularly in patients with negative HBeAg.26 Although Summers and Mason25 reported a marked decrease in viral cccDNA, but the integrated viral DNA in a woodchuck hepatitis model did not show a significant decrease during NAs treatment.25 In contrast, Ning et al.27 found that HBV DNA integration was significantly reduced after NAs treatment, which may explain the decreased risk of HCC after NAs treatment.27 Although HBsAg was not detectable in patient R228, three integration sites were still present in the liver after therapeutic stress. It is well documented that integrated DNA is stable in vitro after at least 17 cell divisions, unlike cccDNA, which can be diluted by cell division.28 The most noteworthy finding was that the number of HBV integrations was significantly lower in patients with FCs than in treatment-naïve CHB patients.
Unlike a previous study by Yang et al.29 showing that HBV integration was more common on chromosomes 1 and 2 in non-tumor tissue with HBV infection, our data show that integration sites were preferentially located on chromosomes four and 11. As the depth of sequencing increases, more integrations are detected.30 Alu repeats are usually separated by >3,000 bp, leading to scarce detection by Alu PCR, especially in low clonal samples that can produce more Alu-Alu products.31,32 However, deeper WGS with genome-wide coverage provides an objective and comprehensive view of HBV integration. Discrepancies associated with different experimental approaches to detect HBV integration may lead to different conclusions regarding the distribution of integration sites throughout the genome.
Chimeric reads showed breakpoints for HBV integration enriched in the HBV genome between 1,500 bp and 1,900 bp, consistent with previous studies.33–35 The breakpoints of functionally cured were within the S/P open reading frame, different from the type in CHB patients, which revealed a preferential pattern within the 3′ end of the X gene, and the C-terminal truncated X protein (Ct-HBx) was shown to initiate hepatocarcinogenesis.36 In addition, breakpoints in S region would lead to the failure to express HBsAg by integrated HBV fragments. Whether patients with an FC share the same patterns of integration needs to be validated. We are planning to enroll more FC and NFC patients after antiviral treatment to better understand their differences.
A female patient who survived acute on chronic liver failure was not found to have any cccDNA or HBV in the liver after entecavir treatment, implying a potential sterilizing cure. That may indicate a different trend in those patients who lost HBsAg through different immune statuses at the start of treatment had a distinguishable outcome of HBV DNA integration. Patient R228 achieved an FC and had fewer HBV integration sites after combined therapy with Peg-IFN and tenofovir. However, noteworthy HBV integration occurred at ATF7IP, a cancer driver gene, which could potentially contribute to the development of HCC and needs to be validated in future studies. Functional annotation analysis of host genes involved in HBV integration sites showed that some functional terms belong to cancer-related pathways. Therefore, the oncogenicity of HBV integration may be determined by targeting hepatocellular carcinoma-associated host genes.
HBV infection can exist for decades, during which time DNA damage is enhanced by liver inflammation and hepatocyte regeneration, which may provide more available DNA ends in host genomic DNA and promote HBV integration. Thus, there may be a trend toward increased HBV integration during long-term HBV infection. Our data show that patients had more integration sites after HBeAg seroconversion. It is generally accepted that liver inflammation and fibrosis may improve to some extent when patients achieve an FC; however, some integration remains in hepatocytes, and some of these integration sites play a potential role in hepatocellular carcinoma. This will require long-term cohort studies to observe whether HBV integration is progressively reduced over the course of treatment.
The study has some limitations. First, the number of functionally cured patients was not large enough for HBV integration analysis and may have produced selection bias. The data need to be validated in a larger population. Second, although deep sequences were used, some HBV integrations may have been missed across the human genome. Efforts are underway to combine deep WGS and RNA sequences to mitigate these deficiencies.