Copyright
©The Author(s) 2016. Published by Baishideng Publishing Group Inc. All rights reserved.
Genetic variation of hepatitis B virus and its significance for pathogenesis
Zhen-Hua Zhang, Chun-Chen Wu, Xin-Wen Chen, Xu Li, Jun Li, Meng-Ji Lu
Zhen-Hua Zhang, Xu Li, Department of Infectious Diseases, the First Affiliated Hospital, Anhui Medical University, Hefei 230022, Anhui Province, China
Zhen-Hua Zhang, Jun Li, School of Pharmacy, Anhui Medical University, Hefei 230022, Anhui Province, China
Chun-Chen Wu, Xin-Wen Chen, State Key Lab of Virology, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, Hubei Province, China
Meng-Ji Lu, Institute of Virology, University Hospital of Essen, University of Duisburg-Essen, 45122 Essen, Germany
ORCID number: $[AuthorORCIDs]
Author contributions: Zhang ZH contributed to analysis and interpretation of data, and drafting the article; Wu CC contributed to drafting and revising the article for important intellectual content; Chen XW, Li X and Li J contributed to revising the article for important intellectual content; Lu MJ contributed to conception and design, analysis and interpretation of data, drafting and revising the article for important intellectual content.
Conflict-of-interest statement: The authors have declared that no potential conflict of interest exists.
Open-Access: This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Correspondence to: Meng-Ji Lu, PhD, Institute of Virology, University Hospital of Essen, University of Duisburg-Essen, Hufelandstrasse 55, 45122 Essen, Germany. mengji.lu@uni-due.de
Telephone: +49-201-7233530 Fax: +49-201-7235929
Received: April 27, 2015
Peer-review started: May 4, 2015
First decision: August 26, 2015
Revised: September 25, 2015
Accepted: November 13, 2015
Article in press: November 13, 2015
Published online: January 7, 2016
INTRODUCTION
Hepatitis B virus (HBV) is the prototype member of a family of viruses called hepadnaviruses. It has a relaxed partially double-stranded circular DNA genome of approximately 3200 bases with four overlapping open reading frames (ORFs): pre-S/S (surface proteins), pre-C/C (pre-core/core), X (transcriptional co-activator) and P (DNA polymerase). The pre-S/S ORF is contained completely within the P ORF, but is translated in a different reading frame. ORFs C and X overlap the ORF P by 1/4 and 1/3 of their respective sequence lengths[1]. The preS/S ORF encodes three different, structurally related envelope proteins, which are synthesized from alternative initiation codons and termed Large (L), Middle (M) and Small (S) protein, respectively. The S protein [hepatitis B surface antigen (HBsAg)] consists of 226 amino acids (aa), and the M protein has an extra N-terminal extension of 55 aa, whereas the L protein has a further N-terminal extension of 108 or 119 aa depending on the genotype. The preC/C ORF codes for two distinct products derived from in-frame alternative initiation sites on the same transcript: the core protein forming the protein shell of the nucleocapsid [hepatitis B core antigen (HBcAg)] and the precore protein which is targeted into the cell’s secretary pathway, processed at both ends and eventually found in the serum of infected individuals [hepatitis B e antigen (HBeAg)]. The X ORF encodes the small regulatory X protein, which is essential for viral replication, and plays roles in modulating host and viral gene expression. The P ORF encodes protein P, the viral DNA polymerase[2].
HBV relies on protein P, which is also a specialized reverse transcriptase (RT), to replicate its genomic DNA via a RNA intermediate[3]. Protein P consists of four domains: a terminal protein that is covalently linked to the DNA primer during negative-strand DNA synthesis, a spacer domain that is tolerant to mutations, the RT domain, and the ribonuclease H (RNase H) domain. The RT and RNase H domains have sequences highly conserved among proteins with similar enzymatic functions such as the HIV RT[4,5]. Similar to the HIV RT, HBV RT lacks proofreading activity[6,7]. As a result, HBV exhibits an estimated mutation rate of 1.4 × 10-5-3.2 × 10-5 nucleotides per site per year, which is more than 10-fold higher than other DNA viruses[8-10]. The high error rate of HBV RT causes frequent nucleotide substitutions during viral replication, resulting in genetic diversity in the form of genotypes, sub-genotypes, quasispecies and a large number of mutations in different regions of the HBV genome. With a spontaneous error rate of 10-5 substitution/base/cycle, viral mutants are generated every day in a mixture of viral quasispecies within the same patient. These mutants usually confer disadvantages to the replication, assembly, secretion or infectivity of the virus; however, in the context of pressure due to antiviral immune response or therapy, the mutants can be selected and become the dominant species.
The natural history of HBV infection can vary dramatically depending on both host and viral factors. Most people do not experience any symptoms during the acute infection phase. However, some people have acute illness with symptoms that last several weeks. A small subset of people with acute hepatitis can develop acute liver failure which can lead to death. Acute HBV infection is characterized by the presence of HBsAg and immunoglobulin M antibody to the core antigen, HBcAg. During the initial phase of infection, patients are also seropositive for HBeAg. HBeAg is usually a marker of high levels of virus replication. The presence of HBeAg indicates that the blood and body fluids of the infected individual are highly contagious. In adults, about 5% of otherwise healthy persons who are infected with HBV as adults will develop chronic infection. Chronic infection is characterized by the persistence of HBsAg for at least 6 mo (with or without concurrent HBeAg). Persistence of HBsAg is the principal marker of risk for developing chronic liver disease and HCC. The 20%-30% of adults who are chronically infected will develop cirrhosis and/or hepatocellular carcinoma (HCC). Some people can develop occult HBV infection. Occult HBV infection is defined as the presence of HBV DNA in the liver tissue of HBsAg-negative individuals[11,12].
Abundant evidence has shown that the genetic diversity of HBV plays critical roles in modulating the pathogenesis in HBV infection (Figure 1). HBV genotypes, sub-genotypes and mutations in certain regions of the HBV genome have been found to influence the HBeAg seroconversion rates, HBcAg seroconversion, viremia levels, immune escape, emergence of mutants, pathogenesis of liver disease, response and resistance to antiviral therapy, and vaccination against the virus (Figure 2).
Figure 1 Interventions, mutations and clinical outcomes in the natural history of hepatitis B virus infection.
The natural history of hepatitis B virus (HBV) infection is typically classified into 4 phases: immune tolerant, immune clearance, inactive and recovery phase. Patients who receive HBV vaccine and/or hepatitis B immunoglobulin (HBIg) injection may develop chronic HBV infection with PreS and/or S gene mutations, this phenomenon is called occult HBV infection (OBI); during the immune clearance phase, long-term use of NAs can select for mutations in the reverse transcriptase (RT) and S regions; HBV reactivation may occur during the inactive phase in patients with hepatitis B surface antigens (HBsAg) and/or anti-HBc positive after chemotherapy or immunosuppression. HBV isolated from these patients may harbor mutations in the PreS, S, basal core promoter (BCP) or Pre-C regions. The causal relationship between these mutations and chemotherapy or immunosuppression is unclear; with increasing age, some patients can develop BCP, Pre-C or X region mutations or HBV DNA integration leading to hepatitis B e antigen (HBeAg) negativity or tumor.
Figure 2 Hepatitis B virus genome and the major gene mutation types in hepatitis B virus open reading frames.
The hepatitis B virus (HBV) genome is a 3.2 kb double-stranded DNA molecule that is organized into four open reading frames: the polymerase, the envelope, the precore and X. The deletion mutations in the PreS gene region and/or some point mutations in the major hydrophilic region (MHR) of S gene can lead to immune escape and occult HBV infection; mutations in reverse transcriptase (RT) can lead to drug resistance after long-term use of nucleotide analogues and this drug-resistant HBV typically has an altered viral envelope of hepatitis B surface antigen because of the overlap between the polymerase and envelope, for example, A181T/V mutations in the RT region can cause W172* (stop codon mutation), W172L and L173F mutations in the S region. M204V/I mutations in the RT region can result in I195M, W196* (stop codon mutation), W196S and W196L mutations in the S region; A1762T and G1764A mutations in the base core promotor (BCP) or G1896A mutation in PreC can increased viral replication, abolish or decrease HBeAg expression. Some mutations in the CTL epitope of HBV core gene can cause T cell immune escape; some point mutations or truncated mutants in the HBV X gene can cause tumorigenesis or other end-stage liver disease.
GENOTYPES AND SUB-GENOTYPES
HBV was formerly classified into nine serological subtypes according to the antigenic determinants[13]. In 1988, Okamoto et al[14] compared the full nucleotide sequence of 18 HBV strains and classified them into four groups, genotype A to D, by a divergence of more than 8% between genotypes. Since then, at least 10 genotypes (A to J) have been identified. By a divergence of 4%, HBV genotypes can be further classified into sub-genotypes. This approach has resulted in HBV genotype A (A1-A7), genotype B (B1-B9), genotype C (C1-16), genotype D (D1-D8), and genotype F (F1-F4)[15-17]. Similar to HBV serological subtypes, HBV genotypes and sub-genotypes also have distinct geographical distributions. HBV sub-genotype B1 dominates in Japan, B2 dominates in China and Vietnam, B3 is confined to Indonesia, and B4 is confined to Vietnam[18]. B7, B8, and B9 have been found in an island in Southeast Asia[19]. HBV/C1 (Cs) is found mainly in Southeast Asia, whereas C2 (Ce) is predominant in East Asia[20]. HBV/C3 was confined to Oceania, while C4 (Caus) was exclusively found in Australia and regarded as the most divergent sub-genotype within HBV/C[21]. Sub-genotypes C5 and C7 were found in Philippines, while C6 and C8 to C16 were isolated from Indonesia[22-27]. This pattern of defined geographical distribution was less evident for D1-D4, where the sub-genotypes were widely spread in Europe, Africa, and Asia[18]. Moreover, as was pointed out in a recent review article, immigration has become an important confounding factor of global HBV distribution and has been substantially changing the geographic pattern of HBV sub-genotypes[28]. Notably, the intergenotype recombination has also been described previously, which plays an important role in the evolutionary history of HBV. Recombination is favored in particular geographical regions[29,30]. For instance, B/C recombinants are prevalent in Southeast Asia and East Asia[31]. Other intergenotype recombinants such as A/D, A/E, C/D and G/C recombinants have also been observed in different geographical regions[29-31]. Therefore, it is logical to predict that distinguishing exotic (sub)genotypes from native ones will become more and more important for improved prophylaxis, diagnosis and treatment.
Not only are HBV genotypes and sub-genotypes related to geographical distribution, mounting evidence has shown that they are also associated with the pathogenesis and outcome of HBV infection. An early study from Europe found that genotype A infection was associated with a significantly higher rate of sustained biochemical remission, HBV DNA clearance, and HBsAg clearance in patients with chronic HBV infection than genotype D infection[32]. Similarly, a study from China has also shown that genotype A and B patients have a higher rate of HBsAg sero-clearance than genotype C and D patients[33]. A study from India has also suggested that genotype D was associated with more severe liver diseases and HCC in younger patients than genotype A[34]. Furthermore, a study from Alaska, where five of the ten HBV genotypes are present, showed that the rate of complications, including HCC, for those infected with genotype A appeared to be less than that found in individuals infected with genotype D, C, or F1[35]. A study by the same group also revealed that HBeAg seroconversion occurred about 3 decades later in women infected with genotype C than those infected with genotypes A2, B6, D, and F1[36]. Multiple cross-sectional studies have suggested that patients with genotype C experience HBeAg seroconversion at older ages and are more likely to be HBeAg positive at any given age than HBV genotype B[37-39]. HBV genotype C has also been associated with increased risk of liver inflammation, flares of hepatitis, liver fibrosis, and cirrhosis[37,38,40]. Prospective studies compared the outcome in those infected with genotypes B and C further and have also shown that HBeAg seroconversion occurred at a significantly younger age for those infected with genotype B than genotype C[36,41-43], and that increased risk of fibrosis was associated with genotype C[42,44]. In addition, infection with genotype C has been identified as an independent risk factor for the development of HCC[45]. Taken together, it can be concluded that individuals infected with HBV genotype C seroconvert from HBeAg later in life and have an increased risk of liver inflammation, liver fibrosis, and HCC. In an acute liver failure study in the United States, genotype D was found to be an independent risk factor for fulminant hepatitis[46]. Genotype G is the most uncommon of all HBV genotypes. This genotype is almost exclusively found in persons co-infected with another HBV genotype, most commonly genotype A, with the only exception being a single report of a transfusion-associated case[47,48]. HBV genotypes F and H are the ‘‘New World’’ genotypes found primarily in indigenous populations of North and South America. In a nested case-control study of a cohort of 1176 Alaskan Natives with chronic HBV infection followed up for 20 years, a significantly higher proportion of persons infected with either genotype F1 or genotype C2 developed HCC than those infected with genotype A2, B6, or D[35]. Genotype H is most closely related to genotype F and likely evolved from this genotype.
The current definitive method for HBV genotyping is PCR amplification and sequencing of the entire genome followed by phylogenetic analysis[14,49-51]. Studies on HBV concerning different genotypes frequently face problems regarding representativeness of reference strains. Taking advantage of large number of sequences deposited in the GenBank database, we have developed a strategy to establish reference sequences for different genotypes. Briefly, sequences were clustered and genotyped using phylogenetic analyses first, and sequences belonging to the same genotype were then aligned with each other and the most common nucleotides in each position were chosen to form the reference sequence for this particular genotype[52]. Although HBV consensus sequences can be generated by sequence alignment, they may not exist in nature or can not usually be isolated from patient samples. To solve this problem, we have adopted a chemical synthesis strategy to generate the consensus HBV genome for certain genotypes. In our recent study, genotype B consensus sequence was established by comparing 42 full-length HBV genotype B sequences and the genome was generated by chemical synthesis. A plasmid carrying a 1.3 × full-length chemically synthesized HBV consensus genome was constructed. This consensus genome was fully replication competent when transfected into hepatoma cells. After this plasmid was hydrodynamically injected into BALB/c mice, HBsAg, anti-HBs, HBeAg, anti-HBe, anti-HBc and HBV genome replication were all detected. Thus, this approach represents a novel strategy to design and create HBV genomes for future studies[53].
PRE-S/S MUTATIONS
The preS/S ORF encodes proteins L, M and S. The Pre-S1 domain is unique for the L protein. The Pre-S2 domain is the shared sequence with the M protein and the S domain is seen in all three proteins. The L and S proteins are essential for virion formation and the M protein can increase the efficiency of virion secretion[8,54,55]. The dominant epitopes of HBsAg, which are the targets of neutralizing B cell responses, are located in the “α” determinant (aa 124-147) within the HBsAg major hydrophilic region (MHR), which covers aa 99-169[56-62]. Mutations in the MHR, particularly the ‘‘a’’ determinant, are known to be associated with immune escape due to conformational changes in the epitope resulting in reduced binding affinity between the HBsAg and the antibody to HBsAg. Carman et al[56] first reported the G145R mutation in the ‘‘a’’ determinant of HBsAg in a child who became infected with HBV despite active and passive immunoprophylaxis. Subsequently, several other mutations within and outside the “α” determinant were reported to have reduced binding affinity to anti-HBs[57,63,64]. To date, escape mutations in the HBsAg MHR have been comprehensively analyzed[65-67].
In 2003, we noted that a renal transplant recipient was persistently positive for HBV despite preexisting anti-hepatitis B surface antibodies (anti-HBs). We cloned the HBsAg gene that was found to be mutated from the patient and compared the antigenicity and immunogenicity of the mutant HBsAg with wild-type HBsAg (wtHBsAg) in mice using a genetic vaccination approach. Our results showed that both B cell and T cell immune responses were impaired by these mutations, suggesting that mutations within HBsAg may enable HBV to escape immunological control[68]. Subsequently, in an agenetic vaccination animal model, we demonstrated that serum samples from mice immunized with aa substitution G145R could recognize plasma-derived mtHBsAg, suggesting HBsAg with aa substitutions may be immunogenic, but with changed specificity[69]. We further investigated the impact of some naturally occurring HBsAg mutations around position 120 to 123 on the antigenicity and detection of HBsAg. Strikingly, aa substitution K122I abolished the reactivity of HBsAg in all immunoassays tested. mtHBsAg G145R was clearly detected with four different enzyme-linked immunosorbent assays that were based on monoclonal anti-HBs antibodies (MAbs) with high affinity. Positive immunofluorescence staining of mtHBsAg K122I was achieved only by polyclonal anti-HBs, while all tests using MAbs failed. mtHBsAg T123N showed a low reactivity in immunoassays and appeared to be secretion deficient. The aa substitution P120T reduced the binding of anti-HBs, but did not completely prevent the detection of mtHBsAg by anti-HBs MAbs. Thus, our data showed that the presence of aa substitutions within the region of 120 to 123 is essential for HBsAg antigenicity and strongly associated with impaired detection in immunoassays[70]. In another study, we systematically investigated a variety of aa substitutions that have been identified around or within the “α” determinant of HBsAg, such as K122I and G145R, on HBsAg expression, secretion and antibody binding. The results showed that the hydrophobicity, the presence of the phenyl group, and the charges in the side chain of the aa residues at position 145 reduced HBsAg secretion and impaired reactivity with anti-HBs antibodies. Only the substitution K122I at position 122 affected HBsAg secretion and recognition by anti-HBs antibodies. Genetic immunization in mice further demonstrated that the priming of anti-HBs antibody response was strongly impaired by the substitutions K122I, G145R, and others, such as G145I, G145W, and G145E. Moreover, when mice that had been pre-immunized with wtHBsAg or variant HBsAg (vtHBsAg) were challenged by hydrodynamic injection (HI) with a replication-competent HBV clone, HBsAg persisted in peripheral blood for at least 3 d after HI in mice pre-immunized with vtHBsAg, but was undetectable in mice pre-immunized with wtHBsAg, indicating that vtHBsAgs fail to induce proper immune responses for efficient HBsAg clearance. Therefore, the biochemical properties of aa residues at positions 122 and 145 of HBsAg have a major effect on antigenicity and immunogenicity. In addition, the presence of proper anti-HBs antibodies is essential for the neutralization and clearance of HBsAg during HBV infection[71].
Viral envelope N-glycosylation modification has been known to play important roles in the biogenesis, stability, antigenicity and infectivity in HIV, HCV and influenza virus[72,73]. Previously, a few studies indicated that a potential N-glycosylation site (Asn-X-Ser/Thr, where X is any aa except Pro) could be created by some mutations within the HBsAg MHR[62,74]. In a recent study, we systematically examined the effects of naturally occurring N-glycosylation-related aa substitutions K122I, T123N, A159G and K160N. T123N and K160N substitutions resulted in additional N-glycosylated forms of HBsAg, while the other mutations produced more heavily glycosylated HBsAg compared with the wild-type (wt). These results showed that vtHBsAg with K122I could not be recognized by HBsAg immunoassays, ELISA or immunofluorescence staining, vtHBsAg with T123N, A159G, K160N and A159G/K160N could be detected, but showed reduced antigenicity. DNA immunization in BALB/c mice revealed that wtHBsAg and vtHBsAg with T123N and K160N were able to induce antibodies to HBsAg (anti-HBs), whereas K122I and A159G greatly impaired the ability of HBsAg to trigger anti-HBs responses. The cellular immune response to the HBsAg aa 29-38 epitope was enhanced by the K160N substitution. Replication competent clones of HBV, T123N and A159G substitutions were shown to strongly reduce virion assembly. The aa substitution K160N appeared to compensate for the negative effect of A159G on virion production. Thus, our study revealed complex effects of aa substitutions on the biochemical properties of HBsAg, on antigenicity and immunogenicity, and on the replication of HBV[75]. Another recent study investigated the molecular and clinical characteristics of HBV immune escape mutants in a Chinese cohort of chronically infected patients. By investigating 216 patients with double positive HBsAg and anti-HBs and 182 HBV carriers without anti-HBs as a control group, the authors found that the frequency of N-glycosylation mutations in the patient group was much higher than that in the control group (47/216 vs 1/182). Using a chemiluminescent microparticle enzyme immunoassay, they showed that HBsAg mutants reacted weakly with anti-HBs compared with wtHBsAg. Their native gel analysis of secreted virion in supernatants of transfected Huh7 cells further revealed that mutants had better virion envelopment and secretion capacity than wt HBV[76]. Together these studies strongly suggested that N-glycosylation mutations on HBsAg greatly contribute to immune escape.
Numerous studies including ours have demonstrated that immune escape mutations were the underlying mechanisms for HBsAg and anti-HBs double positive status[60,77,78]. These mutations are also responsible for a large portion of failure in immunoprophylaxis. In Taiwan, the prevalence of surface gene mutants was found to be approximately 20% in HBsAg carrier children[79]. A study from China reported that the prevalence of aa substitutions among immunoprophylaxis failed children was 20%[80]. Another study reported that the nucleotide diversity rate was 11.4% in those children in Jiangsu Province[81]. The mutation rate in children who failed to be protected by perinatal prophylaxis in Singapore, Japan, and the United States has been reported to be 39.0%, 30.8% and 25.5%, respectively[64,82,83].
Variants within the MHR of HBsAg have also been associated with occult HBV infection (OBI). A study from China by Hou et al[84] found that in 46 cases of OBI, 32 aa substitutions were found between positions 100-160 within the MHR. In addition to the G145R, 11 positions inside and 5 positions outside the “α” determinant were involved. Combined mutations were also detected in some patients. Another two patients had insertion mutations immediately before the “α” determinant. Another study conducted in China in recent years identified another 8 escape mutations associated with OBI, in addition to the G145R, mainly located at positions 120, 126, 129, 130, 133, 134, 137, 140, 143 and 144[66]. In a recent cohort study conducted in South Korea, mutations in the “α” determinant were also found to be related to OBI[85]. Another recent study from China compared the characteristics of 61 patients with OBI to 153 HBsAg (+) carriers with low titers of serum HBsAg (HBsAg-L group) and 54 samples with high serum HBsAg (HBsAg-H group). MHR mutations were seen significantly more frequently in OBI cases (55.7%) compared to the HBsAg-L (34.0%) or the HBsAg-H groups (17.1%). 13 representative MHR mutations were observed in patients with OBI. Of which, 4 mutations strongly decreased the analytical sensitivity of 7 commercial HBsAg immunoassays and 10 mutations significantly impaired virion and/or S protein secretion in both Huh7 cells and mice[86].
Mutations in the pre-S region, especially deletions, have been associated with a lack of detectable HBsAg in the serum. Deletions in the pre-S region can result in reduced expression of HBV surface proteins and help viral persistence by eliminating HLA-restricted B-cell and T-cell epitopes. Pre-S1/pre-S2 mutations are also frequently detected in OBI[85,87,88]. In one study, a 183-bp deletion (nt 3019 to 3201) in the pre-S1 region was detected in occult HBV patients. The deletion covered the CCAAT element that is required for transcription factor binding. The association of deletions in the pre-S gene with a lack of secreted HBsAg was demonstrated using functional analysis by transfection into hepatocyte cell lines[89]. In a similar study, Xu et al[90] showed that a 129-bp in-frame deletion in the S promoter region was associated with reduced levels of M and S protein transcripts, resulting in a marked reduction in the expression of the two proteins. The roles of S promoter mutations and deletions in HBsAg production and secretion in OBI have also been reported in other studies[87,91]. In a recent study, a male patient from China with a chronic hepatitis B infection for 30 years was diagnosed with HB-LF. We amplified the HBV sequences from this patient and found that 2 major variants coexisted in the ratio of 1 to 4: the first variant harbored the A1762T/G1764A 257 double mutation in the basal core promoter (BCP) region and the G1896A mutation in the preC region; the second variant contained 2 deletions in the preS1 region (nt 2976-3102) and the preS2 promoter and preS2 ORF region (nt 3203-3215, nt 1-31), resulting in a stop codon mutation in the ORF of large HBsAg and the deletion of the start codon in the ORF of middle HBsAg, respectively. When we transfected replication competent plasmids harboring these variants into Huh7 cells, we observed phenotypes one would normally predict. However, when both constructs were co-transfected into Huh7 cells, new phenotypes arose. For example, coexistence of both variants increased HBV replication and led to the predominant nuclear localization of HBcAg. Moreover, mice mounted significantly stronger antibody and cytotoxic T lymphocyte (CTL) responses to HBsAg when both variants were co-applied in the HI mouse model. Thus, the coexistence of preS deletion mutants with other variants may significantly modulate specific host immune responses and may enhance immune-mediated liver damage under some circumstances, which represents a novel mechanism for the immunopathogenesis of HBV infection[92].
Numerous studies have linked preS, especially preS2 deletions to the occurrence of HCC[93-98]. At least three mechanisms have been suggested to be involved in the pathogenesis of preS deletion-associated HCC: Firstly, the preS1 and preS2 regions contain several epitopes for T and B cells, and play essential roles in the interaction with host immune responses. Therefore, preS deletion mutations may result in an inefficient immune response[99,100]. Secondly, pre-S1 and pre-S2 mutations may cause overproduction and accumulation of L protein in the endoplasmic reticulum (ER), resulting in significant ER stress that may induce DNA damage and genomic instability leading to hepatocarcinogenesis[101]. Thirdly, preS2 mutants have been shown to directly activate tumor promoting pathways such as the VEGF/AKT/mTOR pathway and the p27/retinoblastoma/Cdk2/cyclin A, D pathway, among others[102-106]. In fact, the pre-S mutants have been shown to be capable of inducing dysplasia in hepatocytes and the development of HCC in transgenic mice[107].
S region mutations have also been associated with acute exacerbation of liver diseases leading to fatal liver failure. In our study, a Chinese patient who suffered from acute liver failure after discontinuation of lamivudine treatment was described. The patient was treated with lamivudine for 4 mo and ceased treatment without consulting. After receiving lamivudine, the patient developed anti-HBs and became negative for hepatitis B surface antigens (HBsAg). However, the patient suffered from a severe exacerbation about two months after cessation of treatment and died due to acute liver failure. Sequencing of HBV isolates revealed mutations including G145R and stop codon mutations at the aa positions 74 and 199 in the HBsAg sequences in all clones. F134S and F134V substitutions within the a-determinant were also detected in some clones. Various aa substitutions were present outside the a-determinant. No wt clone was found among the six cloned sequences. Importantly, no lamivudine resistance-associated mutation was found in the RT region coding for the HBV polymerase protein. The data suggested that anti-HBs antibody which appeared during the lamivudine treatment might be the selective force for the emergence of HBV mutants, and HBV replication resumed after the cessation of lamivudine treatment in this patient might have triggered the process leading to liver failure[108].
In a recently published study, a correlation was revealed between some HBsAg-mutations and serum HBV-DNA levels in HBV chronically-infected drug-naive patients. In this study, 187 patients were stratified into the following ranges of serum HBV-DNA: 12-2000 IU/mL, 2000-100000 IU/mL, and > 100000 IU/mL. The S gene of HBV was isolated and sequenced. Mutant and wt HBV genomes were expressed in Huh7 cells and HBsAg production was determined in cell-supernatants 3 d post-transfection. The results showed that HBsAg-mutations M197T, S204N, Y206C/H and F220L were significantly correlated with serum HBV-DNA < 2000 IU/mL (posterior-probability > 90%, P < 0.05), and the presence of Y206C/H and/or F220L was also associated with lower median (IQR) HBsAg-levels and lower median (IQR) transaminases [for HBsAg: 250 (115-840) IU/mL for Y206C/H and/or F220L vs 4300 (640-11838) IU/mL for wt, P = 0.023; for ALT: 28 (21-40) IU/mL vs 53 (34-90) IU/mL, P < 0.001). These mutations were localized in the HBsAg C-terminus, known to be involved in virion and/or HBsAg secretion. Thus, specific HBsAg-mutations in the HBsAg C-terminus correlated with low-serum HBV-DNA and HBsAg-levels. These mutations may represent an important mechanism underlying low HBV replication and the inactive-carrier state[109].
P REGION MUTATIONS
The P ORF encodes the viral DNA polymerase protein P, which is a specialized RT. Except for interferon-α and pegylated interferon-α, all five drugs approved for HBV treatment are nucleos(t)ide analogues (NAs) that target the DNA polymerase activity of this protein. Treatment with these NAs is generally efficient and well tolerated. However, resistance to some of these agents is a major issue affecting long-term therapy. Lamivudine resistance occurs frequently and is observed in up to 80% of patients treated for 5 years[110-112]. Among adefovir-treated patients, the cumulative incidence of resistance over 5 years has been reported to be 29% in HBeAg-negative patients and 42% in HBeAg-positive patients[113,114]. With 25% of HBeAg-positive and 11% of HBeAg-negative patients experiencing virological breakthrough due to resistance after 2 years of treatment, telbivudine resistance is relatively slower to emerge[115]. Resistance to entecavir has been shown to remain low (1.2%) after 6 years of therapy in NA-naıve patients[116]. No tenofovir resistance has been observed after 4 years of treatment in the registration studies[117].
Antiviral drug resistance is associated with the selection of adaptive mutations which reduces the sensitivity of the mutants to the inhibitory effects of a drug. The barrier to resistance can be defined as the difficulty with which the resistance mutants are selected and the barrier to resistance increases as the number of specific mutations required for drug resistance increase[118]. The main aa change associated with lamivudine and telbivudine-resistance is rtM204V/I, which is located in the YMDD motif of the RT. In the context of lamivudine resistance, the compensatory mutations rtL180M and rtV173L frequently occur in domain B and C of the viral polymerase[4]. The rtA181V/T, rtL80I/V and rtM204Q mutants identified recently showed that primary lamivudine-resistance mutations can also occur outside the YMDD motif[119,120]. Lamivudine-resistance mutations have been reported to result in the selection of high replicative HBV variants leading to exacerbation of disease during chronic HBV infections. In an early study, full-length HBV genomes were analyzed from four chronic hepatitis B patients who developed resistance to lamivudine [-2’-deoxy-3’-thiacytidine, LMV] accompanied by acute exacerbation of disease. Paired full-length HBV isolates were cloned from the sera of patients prior to LMV treatment and after drug resistant breakthrough. Compared to the isolates before treatment, isolates from all four patients during exacerbation showed a marked increase in replicative competence in a cell transfection study. Viral genome amplification and sequencing showed that while all isolates shared mutations at the YMDD motif, the isolates from the one patient who recovered from the exacerbation showed a lower number of mutations, and in particular, lacked BCP mutations at 1762/1764. In contrast, BCP mutations were found in isolates from the other three patients. Thus, in patients with acute exacerbation, high replicative strains were selected from the total HBV quasispecies during treatment, and among these strains, those with core promoter mutations at 1762/1764 were most likely to be associated with severe clinical exacerbations[121]. Adefovir resistance is characterized by rtN236T and/or rt181V/T selected in the D and B domain of the polymerase, respectively[122-124]. The rtI233V mutation was found to confer resistance to adefovir[124]. Entecavir is a NA with a high barrier of resistance as multiple mutations are required to confer a high level of resistance to this drug. Entecavir resistance tends to emerge in a stepwise manner with the rtI169T, rtT184S, rtS202I, and rtM250I/V changes occurring sequentially in the virus already carrying lamivudine resistance mutations[125,126].
No tenofovir resistance has been described after 3 and 4 years of therapy, but rt181T/V and rtN236T, the main adefovir-associated resistance mutations, do reduce its sensitivity clinically and virologically[117,122,127]. Tenofovir (TDF) has been a first-line antiretroviral drug for human immunodeficiency virus (HIV) since 2001 and it is well known that this drug induces resistance mutations leading to treatment failure. Taking advantages of the high homology between the HIV and HBV RTs and the determination of the crystal structure of HIV-1 RT complexed with TDF, we built the homology model for HBV-RT[5,128]. Based on the modeled HBV-RT structure, we designed some mutants and tested their effects on TDF susceptibility in vitro in Huh7 cells and in vivo in a mouse model. Our results showed that HBV mutants with rtP177G and rtF249A reduced susceptibility to tenofovir in vitro with a resistance index of 2.53 and 12.16, respectively, and the testing result based on the HI mouse model revealed the antiviral effect of TDF against wt and mutated HBV genomes, and confirmed the reduced susceptibility of mutant HBV to TDF[129]. In a recent population-based cross-sectional study from China, serum samples from 179 patients who developed virological breakthrough while receiving treatment with NAs were obtained and analyzed for NA-resistant mutations in the RT region[130]. NA-resistant mutations were detected in 89.4% (160/179) of these patients. The prevalence of HBV mutations at rtM204 was 93.0% (106/114) in patients on lamivudine/telbivudine-based therapy, with rtM204I being more frequently associated with rtL80I/V mutations [rtM204I + rtL80I/V (50.0%, 32/64) vs rtM204V + rtL80I/V (27.3%, 9/33), P = 0.032]; rtN236 mutations were found in 76.1% (35/46) of patients receiving adefovir/dipivoxil-based therapies, with rtM204V mutations being more frequently associated with the rtL180M mutations [rtM204V + rtL180M (100%, 33/33) vs rtM204I + rtL180M (60.9%, 39/64), P < 0.001]. In addition, rtA181 mutations were observed in 19.3% (22/114) of patients receiving lamivudine/telbivudine-based therapy and 23.9% (11/46) of patients receiving adefovir/dipivoxil-based therapy.
The RT and HBsAg ORFs overlap at RT aa 8-236, and the HBsAg ORF shifts downstream by 1 nucleotide. Not surprisingly, some resistance mutations also affect the overlapping HBsAg. For instance, the rtA181T mutant selected by adefovir, lamivudine or telbivudine typically results in a stop codon in the envelope gene (sW172stop), causing a dominant negative secretion defect in HBsAg leading to an altered viral rebound profile[131]. Studies have also shown that common antiviral drug selected mutations may confer changes in the antigenicity of the overlapping HBsAg. A pioneering study by Torresi et al[132] reported that resistance mutations rtV173L + rtL180M + rtM204V that resulted in mutations sE164D + sI195M in HBsAg reduced antigen-antibody binding. A few years later, Sloan et al[133] confirmed their observations and further revealed that these mutations caused immune evasion through disruption of the “α” determinant on HBsAg. In recent years, similar observations have been made in studies from Brazil and China[134,135]. On the other hand, the overlapping changes in surface genes may potentially affect the impact of HBV polymerase mutation on replication and drug resistance. For example, the sW172 stop mutation could result in decreased viral replication and increased drug resistance[136,137]. Additionally, wt HBV and drug-resistant HBV such as the sW172 stop mutation could complement each other to maintain viral replication and rescue virion production under NAs treatment, thus facilitating HBV survival and persistence under NAs pressure (our unpublished data).
C REGION
The HBV core gene is divided into the precore (PC) region and the basic core region (BCP) by two in-frame initiating ATG codons. This results in the transcription of either the pregenomic RNA that is essential for HBV replication and translates into the nucleocapsid protein HBcAg or the PC RNA that translates into HBV e antigen (HBeAg) protein. Studies have linked defect core protein expression to PC and/or BCP mutations. The most prevalent PC mutation is a guanine-to-adenine transition at nucleotide position 1896 (G1896A), which creates a TAG stop codon at codon 28 of the PC protein and abolishes HBeAg expression at the translational level[138]. The most common BCP mutation is the double A1762T and G1764A nucleotide exchange, which results in a decrease in HBeAg expression of up to 70%, but enhanced viral genome replication[138-140].
The core protein HBcAg of HBV is a potent immune stimulator, stimulating a strong neutralizing immune response[141,142]. Cytotoxic T cells (CTLs) play a key role in the control of HBV infection and viral clearance. The HBV-specific CD8+ T lymphocytes (CTL)-mediated immune response is multi-specific, polyclonal, and vigorous during acute hepatitis B (AHB), which plays a vital role in viral control and viral clearance, as well as disease pathogenesis[143-145]. In contrast, the HBV-specific CTL response is minimal or undetectable in chronic hepatitis B (CHB) with viral persistence and immune tolerance, indicating the key role of HBV-specific T-cell response in the determination of disease progression and outcome[146,147]. Analysis of CTLs specific for viral epitopes within core, envelope, polymerase, and X proteins showed that the highly conserved HBV core protein (HBc) elicits the strongest CTL responses compared with other viral proteins, suggesting that the HBc-specific T cell response may play a leading role in viral control and clearance[148-152]. However, mutations in HBcAg may lead to the production of immune escape variants, resulting in the persistence of HBV[2,153]. In a recent study, the HBV core gene was amplified and sequenced from 148 patients with chronic HBV infection, and the human leukocyte antigen (HLA) class I genotype (A and B loci) of the patients was determined. Using a statistical approach with a novel analysis package SeqFeatR, residues under selection pressure in the presence of particular HLA class I alleles were identified. With this approach, nine residues in HBV core under selection pressure in the presence of 10 different HLA class I alleles were identified. Immunological experiments confirmed that seven of these residues were located inside epitopes targeted by patients with chronic HBV infection carrying the relevant HLA class I allele. Consistent with viral escape, the selected substitutions reproducibly impaired recognition by HBV-specific CD8 T cells[154].
HBeAg is not required for HBV replication in vitro, but is secreted into the blood. Acting as both an immunogen and a tolerogen, it has profound effects on the natural history and pathogenesis of HBV infection[155]. In general, HBV replicates more actively in carriers with HBeAg than those with anti-HBe, and carriers with HBeAg have a higher activity to transmit HBV than those with anti-HBe[156]. In chronic hepatitis B infection, a key event in the natural history of progression is HBeAg seroconversion to HBeAb with a marked reduction of HBV replication followed by gradual histological improvement[157]. However, a proportion of patients who undergo HBeAg seroconversion demonstrate a recurrence of high HBV DNA levels and intermittent or persistent ALT level elevations. These individuals harbor a mutant form of HBV that does not produce HBeAg, due to a mutation in the precore or core promoter region. In Asia, the Middle East, Mediterranean basin and southern Europe, about 15% to 20% of these carriers have elevated alanine aminotransferase and viral DNA[158]. HBeAg-negative chronic hepatitis B (precore mutant) emerges as the predominant species during the course of typical HBV infection with wt virus and is selected during the immune clearance phase (HBeAg seroconversion)[159]. Sustained spontaneous remission is rare (6% to 15%) in these individuals, and spontaneous HBsAg clearance is only about 0.5% per year[160]. Therefore, long-term prognosis is poorer among HBeAg-negative individuals than their HBeAg-positive counterparts. In fact, HBeAg-negative chronic hepatitis B is currently the main type worldwide as well as the most difficult to treat in terms of achieving sustained virological response.
In China, hepatitis B-related ACLF (HB-ACLF) patients account for more than 80% of ACLF cases as a result of the high incidence of chronic HBV infection, with a high mortality rate of 60%-80% in the absence of liver transplantation, causing 22600 deaths annually[161,162]. Characterized by increased viral load and a fierce immune response, HB-ACLF is very often associated with mutations in the BCP and PC regions, because the BCP mutations may enhance HBV replication and the PC mutation abrogates translation of HBeAg, which is considered a tolerogen and immune repressor buffering the immune attack on the infected hepatocyte[163-168]. In fact, a higher prevalence of the BCP double mutation A1762T/G1764A and the G1896A PC mutation have been reported in ALF than in acute hepatitis B patients[169-173]. In addition, single mutations including the T1753V (C/A/G), C1766T, T1768A, G1862T and G1899A in the BCP/PC region have been reported to be associated with increased HBV replication capacity and/or reduced HBeAg expression in vitro, and in some cases associated with ALF in the clinic[46,163,173-176]. A recent study reported that T1846 and A/G1913 mutations are associated with ACLF in patients infected with HBV genotypes B and C[177].
X REGION
Chronic HBV infection is the dominant global cause of HCC, accounting for 55% of cases worldwide and 80% or more in the eastern Pacific region and sub-Saharan Africa[178]. Accumulating evidence has shown that HBxAg, the viral product of the X ORF, plays critical roles in the pathogenesis of HCC[179]. HBxAg promotes carcinogenesis by interacting with cellular proteins resulting in dysregulation of multiple signaling pathways involved in cell cycle progression, cell growth and apoptosis. To date HBxAg has been found to interfere with cellular signaling pathways including Src, pRb/E2F, p53, NF-κB, PI3K, Jak1/STAT, ERK and PI3K/AKT and Wnt/β-catenin[180-187]. During the last decade or so, several studies have indicated that the C-end truncated X protein often occurs in patients with HCC[188-194]. Further investigations revealed that the truncated HBxAg lost the proapoptotic activity of the full length form and thus acquired stronger cellular transformation activity in vitro and tumor promoting activity in vivo[189]. A recent study reported that, relative to WTHBxAg, naturally occurring truncated mutant HBxΔ127 strongly enhanced cell proliferation and migration in HCC[195]. In addition to truncated HBxAg mutants, insertions in the HBx gene may play a pivotal role in hepatocarcinogenesis. A Korean cohort study showed that the prevalence of insertions was significantly higher in patients with severe liver disease, HCC, or cirrhosis of the liver compared to patients who were carriers or had chronic hepatitis. Four novel types of insertions including PKLL, GM, FFN, and tt, were observed in six patients, which were accompanied by double mutations in the BCP region[196]. Moreover, site mutations have also been associated with HCC. Studies from Korea have reported that HCC risk increased in the presence of ≥ 6 mutations of eight key mutations in Korean chronic HBV genotype C2 carriers. The eight key mutations comprise G1613A, C1653T, T1753V, A1762T, G1764A, A1846T, G1896A and G1899A that are located throughout the core promoter and the proximal portion of the precore gene (X/preC region)[197,198]. A recent study assessed the postoperative prognostic value of HBV mutations in HBxAg in HBV associated HCC patients and found that eight mutational sites, including 1383, 1461, 1485, 1544, 1613, 1653, 1719, and 1753, could serve as independent predictors of HCC survival[199].
Mutations in reactivation of HBV infection upon chemotherapy and immunosuppression
Reactivation of HBV infection is a well-documented complication among cancer patients undergoing cytotoxic chemotherapy. In recent years, it has become clear that HBV mutations associated with severe liver diseases are frequently found in patients on chemotherapy. The most common mutations associated with chemotherapy-related HBV reactivation include the G to A mutation at nt 1896 in the preC/C region, the nt 1762 (A to T) and nt 1764 (G to A) mutations in the preC promoter region[200-204]. A study of ours suggested that immune escape mutations may also be involved in chemotherapy-associated HBV reactivation[205].
HBV reactivation can also occur during immunosuppression. A few years ago, we reported a case in which a non-Hodgkin lymphoma patient who had displayed positive anti-HBs and anti-HBc before immunosuppressive therapy developed HBV reactivation after receiving a rituximab-based regimen. Our sequencing data revealed genotype D with two known escape mutations P120S and S145P and three other mutations Y134K, I150T and T189I, which had not been found in the usual escape setting[206]. In a recent study, the genetic features of HBsAg were investigated by population-based and ultradeep sequencing (UDS) of HBV DNA from 93 patients: 29 developed HBV reactivation and 64 consecutive patients with chronic HBV infection (as controls)[207]. Of the HBV-reactivated patients, 51.7% were treated with rituximab, 34.5% with different chemotherapeutics, and 13.8% with corticosteroids only for inflammatory diseases. In total, 75.9% of HBV-reactivated patients (vs 3.1% of control patients; P < 0.001) carried HBsAg mutations localized in immune-active HBsAg regions. Of the 13 HBsAg mutations found in these patients, 8 of 13 (M103I-L109I-T118K-P120A-Y134H-S143L-D144E-S171F) reside in the MHR where neutralizing antibodies target. The remaining five (C48G-V96A-L175S-G185E-V190A) are localized in class I/II-restricted T-cell epitopes, suggesting a role in HBV escape from T-cell-mediated responses. Using UDS, these mutations occurred in HBV-reactivated patients with a median intra-patient prevalence of 73.3% (range, 27.6%-100%) vs 4.6% (range, 2.5%-11.3%; P < 0.001) in control patients, supporting their fixation in the viral population as a predominant species. Moreover, additional N-linked glycosylation sites within the MHR were found in 24.1% of HBV-reactivated patients (vs 0% of chronic patients; P < 0.001). Thus, data from this study suggest that HBV reactivation occurs upon immunosuppression, correlating with HBsAg mutations endowed with enhanced capability to evade immune response. Another study cloned and sequenced the full length HBV genome from an HBsAg-negative patient who developed HBV reactivation following chemotherapy with rituximab[208]. The results showed that the number of aa substitutions in HBV from this patient was much higher than that reported for occult HBV infection or vaccine escape. It is worth noting that this study detected not only known “α” determinant mutations such as Q129H, F134Y, D144E and the preC G1896A mutation, but also a large number of mutations in other regions including preS, P, X and C, suggesting the possibility that mutations in other regions may also play some roles in immunosuppression-associated HBV reactivation.
Co-infection with HBV and human immunodeficiency virus (HIV) is not uncommon. It is estimated by the Joint United Nations Program on HIV/AIDS that 10% of 33 million HIV-infected patients has concurrent chronic HBV infection[209]. A higher proportion of chronic HBs antigenemia has been found in HIV-infected patients because HIV destroys CD4 cells which compromises host immunity against HBV[210]. Clinical observational studies have demonstrated that HIV/HBV-co-infected patients may have faster progression of hepatic fibrosis and a higher risk of cirrhosis, end-stage liver disease, and HCC than HBV-mono-infected patients[211,212]. The recurrence of HBV replication due to withdrawal of lamivudine therapy and administration of glucocorticosteroids in HIV/HBV-co-infected patients has been described previously[213-216]. Further observations including ours revealed that even slight suppression of host immunity by HIV infection at a level that did not require drug therapy could cause HBV reactivation[217,218]. It is worth noting that immune escape mutations were detected in most of these studies.
SUMARRY AND PERSPECTIVES
Most HBV genotypes and sub-genotypes have distinct geographical distributions. Abundant evidence has shown that genotypes and sub-genotypes are associated with the pathogenesis and outcome of HBV infection. Generally, HBV genotype C has been associated with an increased risk of liver inflammation, flares of hepatitis, liver fibrosis and HCC. Compared to other genotypes, patients with genotypes D, C, and F1 are more likely to develop complications such as liver cirrhosis and HCC. In addition, HBeAg seroconversion occurred much later in patients infected with genotype C compared to other genotypes. As shown in Figures 1 and 2, mutations in the preS/S region are associated with vaccine failure, immune escape, occult HBV infection and the occurrence of HCC. Mutations in the P region may cause drug resistance to NA antivirals. Mutations in the preC/C region are related to HBeAg negativity, immune escape, and persistent hepatitis. Mutations in the X region play critical roles in promoting HCC.
Investigations of HBV genetic variability and pathogenic implications of specific mutations have resulted in significant advances over the past decade. A significant increase in the body of knowledge regarding HBV genetic variability has greatly improved the way HBV infection is managed and treated. However, there are questions that remain unanswered and obstacles that need to be overcome. For example, much of our current understanding regarding HBV genetic variability was inferred from molecular epidemiological analyses. Due to constant viral evolution as a result of interactions among the host, virus and drug therapy, the results of these types of analyses can be confounded by many known or unknown factors. We believe that more physical experiments using reference strains that are really representative of their respective genotypes and sub-genotypes can help overcome this problem and provide more detailed and reliable information. In addition, as both viral and host factors affect HBV pathogenesis, reliable biomarkers and convenient methods need to be established to monitor both the viral and host factors to, ideally, achieve personalized management and treatment. As an example, in a recent pioneering study, Gong et al[219] compared the performance of next-generation sequencing and clone-based sequencing (CBS) in analyzing HBV RT quasispecies heterogeneity. In that study, HBV genomic DNA was extracted from serum samples obtained from 31 antiviral treatment-naive patients with chronic hepatitis B. The RT region quasispecies were analyzed in parallel using CBS and ultradeep pyrosequencing (UDPS). Their data showed that the number of qualified strains obtained by UDPS was much larger than that obtained by CBS (P < 0.001), and the complexity value derived from UDPS data was higher than that derived from CBS data (P < 0.001). A study on the prevalence of variations within the RT region showed that CBS detected an average of 9.7 ± 1.1 aa substitutions/sample and UDPS detected an average of 16.2 ± 1.4 aa substitutions/sample. This study clearly demonstrated that viral heterogeneity determination by the UDPS technique is more sensitive and efficient in detecting low-abundance variations than that by the CBS method, and thus has shed some light on the future clinical application of next generation sequencing in HBV quasispecies evaluation[219]. Additionally, it is important to continue research on the identification of novel therapeutic targets in the life cycle of HBV or in the host immune system to stimulate the development of new antiviral agents and immunotherapies. These can be antiviral agents targeting HBV entry, cccDNA, capsid formation, viral morphogenesis and virion secretion, as well as therapeutic vaccines.