Introduction
Liver diseases are a major cause of morbidity and mortality worldwide. Particularly hepatocellular carcinoma (HCC) is the fifth most common malignancy worldwide, and its incidence is increasing. Successful therapy for HCC patients with a concomitant increase in survival rate strongly depends on the time and accuracy of diagnosis. Therefore, biomarkers for early and accurate diagnosis of HCC using either body fluid samples or tissue specimens are essential for the development of effective treatment strategies. Over the years, several serum marker candidates have been reported for HCC, where α-fetoprotein (AFP)1 and des-γ-carboxyl pro-thrombin2 are the most-prominent and widely used ones. Regarding histopathological diagnosis of HCC, glypican-3, heat shock protein 70, and glutamine synthetase are established feasible biomarkers for HCC diagnosis using immunohistochemistry.3 However, there is still a need for further optimization, as none of the known markers exhibit 100% specificity and sensitivity, and the diagnostic performance is still insufficient for early-stage and small HCC.
To identify novel biomarker candidates, proteomic-based approaches offer a useful and versatile analytical platform, as shown by the numerous HCC studies utilizing this particular “omics”-based technology. Apart from biomarker and drug target discovery, proteome analysis is also useful for the study of the molecular processes underlying HCC progression, metastasis, and recurrence.
The current review gives a summary of the use of proteomics for the investigation of HCC. To begin, we review the general aspects of proteomics and strategies of quantitative proteome analysis. Then, we highlight selected examples of different HCC-related proteomic studies demonstrating the broad applicability of proteomics to analyze diverse sample types.
Proteomic studies of hepatocellular carcinoma
Cell culture models
For HCC proteomic studies, HCC cell lines have been extensively studied as model systems, because of their homogeneity relative to the heterogeneous nature of tumor tissues. In addition, cell lines can be transformed, providing the opportunity to perform experiments under well-defined and manipulated conditions (e.g. knockdown or knockout of a particular gene). Furthermore, cultured cells are easily accessible and provide an endless source of protein samples. However, the main limitation of proteomic data derived from cell culture is the inherent difference between in vivo and in vitro conditions.4
Aiming for the discovery of progression biomarkers for HCC invasion, Wu et al. analyzed membrane proteins using SDS-PAGE and electrospray-ionization mass spectrometry (ESI-MS/MS) to study human HCC cell lines with different metastatic potentials. A total of 14 differentially regulated proteins were identified in the HCC cell lines with high (HCCLM9) and low (MHCC97L) metastatic potential. Using immunohistochemistry, over-expression of the protein Coronin-1C was successfully verified in clinical HCC tumor tissue samples with metastasis and invasive behavior.5 This study and others6–9 demonstrated the applicability of the cell culture model in biomarker discovery studies.
Taking the advantage of the homogeneity of cell culture models, Yokoo et al. analyzed the differences in protein expression between nine AFP-producing cell lines and seven AFP-negative cell lines using 2D-DIGE and matrix-assisted laser desorption-ionization time-of-flight mass spectrometry (MALDI-TOF MS). Of 2000 quantified proteins, 11 proteins were identified and reported to be differentially expressed between these two cell lines, most of which were mainly involved in glucose metabolism, apoptosis, cytoskeleton, and translation.10
Another advantage of using cell culture models is the ability to monitor the effect of exogenous or endogenous chemicals on protein expression and associated pathways. Recently, Fu et al. used 2D-DIGE and MALDI-TOF MS and MS/MS to identify 1,3,6,7-tetrahydroxyxanthones’ (TTA) targets in HCC cells. In total, 18 proteins were differentially regulated in treated and non-treated cell lines. Of interest, β-tubulin and translationally controlled tumor proteins were downregulated, whereas 14-3-3σ and P16 proteins were upregulated.11
Cell culture models can provide informative data regarding the effects of changing expression of a specific enzyme on other proteins or whole pathways. This method was applied by Zhang et al.12 to study the effect of hepatitis B virus X protein (HBX) on protein expression correlated with angiogenesis in HCC cell lines.12 Using isobaric tags for relative and absolute quantitation (iTRAQ) labeling for quantification, 16 proteins were reported to be differently expressed between HepG2 cells transfected with HBV genome and empty vector as control. In particular, fumarate hydratase and tryptophan-tRNA ligase, which were previously described in association with angiogenesis, were upregulated in HBV transfected cells in this study.
Secreted proteins play important roles in cell migration, communication, and signaling.13 Thus the cancer cell secretome has been studied as a potential source of HCC biomarkers. Slany et al. applied LC-MS/MS analysis on isolated 2D-PAGE spots of primary human hepatocytes (PHH), HepG2, and Hep3B cells’ secretome.14 An overlap was reported between the secretome and plasma proteins in all three cell lines. Out of 72 identified proteins in the secretome of PHH, 46 were reported in plasma as well. For HepG2 and Hep3B secretomes, 55 and 24 plasma proteins were detected, respectively. Among the identified secreted proteins, many were already reported in association with HCC, including ephrin-A1, neuroserpin, glypican-3, and gastricsin. In addition, since stanniocalcin-2 precursor and bactericidal/permeability-increasing protein-like 1 demonstrated a restricted expression profile in HepG2 secretome, they were identified as novel potential HCC biomarkers.14
Animal models
HCC is a multifactorial disease, and ideally studied under controlled environmental conditions where data regarding disease progression, management, and cause can be more reliably generated. Animal HCC models are well characterized and easy to handle, making them an important source for in vivo proteomic data. However, induction of HCC in animals takes place over a short period of time, while HCC in humans normally develops over years. Consequently, not all HCC-relevant proteome changes in humans can be reflected in animal models.15
HBV is a risk factor for HCC by producing HBx, a protein implicated in HCC development.16–18 In order to further understand the relationship between HBx and HCC, a proteomic study comparing liver tissue from HBx transgenic and control mice has been performed. Here, seven proteins were found by 2D-DIGE and MALDI-TOF MS to be upregulated in tumor tissue of the p21-HBx transgenic relative to control mice. The over expression of PSMB4, CTSB, PSMA1, and PSMC3 in HCC was confirmed using RT-PCR in tumor transgenic mice and human liver tissue.19
Lai et al.20 were the first to apply proteomics to compare changes in extracellular matrix (ECM) proteins during the development of HCC from fibrosis and steatosis. They used liver tissue sample obtained from two transgenic mouse models of disease progression, PDGFC transgenic and Pten null mice. Using multi-dimensional protein separation, 10,707 protein isoforms were identified following 2D-HPLC and SDS-PAGE for extensive protein separation and nanoflow LC-MS/MS analysis for protein identification. Twenty six collagen proteins demonstrated a significant change during HCC progression, especially collagen types IV, VI, XIV, XV, and XVI. For non-collagenous proteins in ECM, laminin 521 protein was the most abundant laminin protein in HCC, and the receptor protein Integrin α3 was reported only in the Pten null mouse tumor.20
Proteome analysis of animal models can also facilitate HCC diagnosis and implementation of appropriate therapeutic strategies. As partial hepatectomy (PH) is a first line therapy for a certain population of HCC patients,21 Cao et al. performed proteomic analysis in mouse models with 50% PH in order to detect changes in protein expression profiles during liver regeneration.22 Using a two-dimensional set-up, including SDS-PAGE and LC-MS/MS analysis, an average of 441 differentially regulated proteins in mouse liver tissue of sham-operated control and the 50% PH group were identified. One day after PH, 87 protein expression profiles were changed; where 50 proteins were upregulated, and 37 proteins were downregulated in the 50% PH group. All proteins were related to critical biological pathways.
Microcystins (MCs) are known inducers of HCC.23–24 In order to understand the carcinogenic processes underlying induction of HCC by MCs, a proteomic analysis was performed in animal models. MicroRNAs and protein expression changes were monitored in mouse liver treated with microcystins-LR. Using two-dimensional gel electrophoresis, 62 proteins were reported to be significantly regulated in microcystins-LR treated mice relative to untreated controls. MALDI-TOF was used to identify 42 proteins. From the identified proteins, two tumorigenesis associated proteins, transglutaminase 2 (TGM2) and N-myc downstream regulated gene 2 (NDRG2) were verified using Western blot analysis.25
Chu et al. investigated the proteins involved in peroxisome proliferation using surface enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF–MS) to compare protein profiles of control (wild type mice), acyl-CoA oxidase-deficient mice (AOX-/-), and peroxisome proliferator (Wy-14,643) wild type-treated mice.26 2D-DIGE and MALDI-MS analyses were used to compare the protein regulation profiles of HCC in rat liver tissue induced by carcinogenesis and non-carcinogenesis compounds. Zeindl-Eberhart et al. reported aldose reductase-like proteins (ARLPs) 1-4 to be expressed higher in both HCC induced models relative to normal rat liver.27 Perhaps, as suggested by the authors, ARLPs are viable immunohistochemical diagnostic biomarkers of HCC in humans.28
In order to characterize proteome changes during liver cancer progression, tissue proteomics samples from rats of six progression stages of liver cancer were analyzed by 2D-DIGE and MALDI TOF/TOF MS.15 Interestingly, most of the differentially expressed proteins in malignant stages were altered in the pre-tumorous phase. This finding could aid in the identification of reliable biomarker candidates for the early diagnosis for HCC.
Body fluids
Despite its low sensitivity of 39%–65%, AFP is the only commonly used diagnostic serum biomarker for HCC.29 This highlights an urgent need for the identification of new HCC serum markers with higher sensitivities and specificities. Plasma contains thousands of proteins that could potentially reflect the pathological and physiological status of patients, such as aberrant secretions and tissue leakage proteins. Plasma proteins distribute over a wide dynamic quantitative range. For example, albumin is found from 35–40 mg/ml, and interleukin 6 is found from 0–5 pg/ml. In most cases, potential biomarkers are present at low concentrations; however, more than 99% of the entire protein amount of serum is represented by only 22 proteins. That makes proteomics analysis of plasma a very challenging task, even with available advanced analytical technologies such as 2D-DIGE and LC-MS/MS.30
In biomarker studies, high abundant proteins are mostly removed by immunodepletion in order to enrich remaining low abundant serum proteins. Feng et al. depleted albumin and immunoglobulin IgG from the sera of 20 HBV patients, 20 HCC patients, and 20 healthy controls to identify potential HCC biomarkers in serum. Proteins were analyzed by 2D-DIGE, and out of 40 differentially expressed proteins, eight proteins were identified with MALDI-TOF MS/MS. Heat-shock protein 27 (HSP 27) was verified as a potential biomarker in sera by Western blotting.31 The use of more extensive sample preparation workflow can be followed to discover novel HCC biomarkers in serum, including immunodepletion of the six most abundant proteins (transferrin, albumin, antitrypsin, IgG, IgA, and haptoglobin), fluorophore labeling, and 3D protein separation, including online 1D electrophoresis after anion exchange and reversed phase LC.32
In a more recent study investigating the discovery of novel HCC markers in plasma,33 immunodepletion of IgG and albumin was applied to nine pooled plasma samples from 120 controls, 99 liver cirrhosis patients, and 120 HCC patients. In order to decrease proteome complexity, each group of pooled sample was further fractionated to yield 24 fractions, and only seven were chosen for label free analysis and LC-MS/MS. In total, 1,307 proteins were identified, and 26 proteins were found to be differentially regulated among the experimental groups. Four potential HCC biomarker candidates (apolipoprotein A1, complement component 3, alpha-1-antitrypsinand hemopexin) were subsequently verified using ELISA. The expression trends of these proteins were confirmed in an independent patient cohort as well.
Despite being widely used, depletion of high abundant proteins from plasma or serum is a controversial issue in proteomics. This is due to its insufficient reproducibility and the ability of some high abundant proteins, such as albumin, to bind other proteins that are then unintentionally co-depleted. If such proteins are completely removed by this co-depletion, information regarding alterations in their expression would be lost. If proteins were only partially removed, misleading information regarding artificially generated changes in protein levels that are not associated with the disease would be detected. For example, Feng et at. nicely demonstrated this problem when they showed that α-fetoprotein was non-specifically depleted with the albumin-IgG fraction.31
In addition to depletion techniques, enrichment of proteins in plasma and serum are also widely used in proteomic studies. For example, Yang et al. used the Con A-magnetic particle conjugate-based method to isolate glycoproteins from the sera of 60 healthy controls and 60 HCC patients to study the difference in glycoproteomics associated with HCC. Ninety-three glycoproteins were identified by LC-ESI-MS/MS in the controls compared to 85 in HCC patients’ sera, and 72 glycoproteins overlapped between the two samples sets. In addition, the authors investigated the different N-glycan characteristics linked to HCC by MALDI-TOF/MS.34N-glycans in HCC were reported to be hyperfucosylated and associated with a reduction of N-acetylneuraminic acid terminal (Neu5Ac).
In the past few years, targeted mass spectrometry-based approaches like multiple reaction monitoring (MRM) have become more widely used in proteomic studies of body fluids. However, such targeted methods are not applicable for untargeted discovery studies. Hence, prior identification of protein or peptide targets using other proteomic techniques is required. Using isotopically labeled forms of this target that are spiked into a given sample, the MRM approach can be used to absolutely quantify the target protein in a complex of non-depleted matrix. In a recent study, MRM was used to verify nominated HCC diagnostic proteins in plasma from global data mining of proteins linked to HCC. Of the 4,658 proteins screened in an extensive data filtration process, nine proteins were found to be differentially expressed using MRM for quantification in the plasma of 18 HCC patients before and after a three month treatment regimen and 36 healthy controls. Compared with AFP alone, a multimarker panel of Actin-binding protein anillin (ANLN) and filamin-B (FLNB) was reported to discriminate the healthy controls from the HCC group before treatment.35
Human tissue samples
An extensive proteome analysis of clinical tissue samples is the most common source of HCC proteomic data, where most researchers apply comprehensive proteomic analysis on HCC tumor and adjacent non-tumor tissue to address different clinical questions (Fig. 3).32,36–40
Very recently, our group used two different proteomic approaches to characterize protein changes associated with HCC in tissue and to identify novel biomarker candidates.36 In a comprehensive proteomic analysis, seven HCC and adjacent non-tumorous liver tissue samples were quantitatively analyzed by 2D-DIGE and label-free LC-MS/MS. As a result, we identified 476 and 148 proteins that were differentially expressed between normal and HCC tissue in the label-free and 2D-DIGE analysis, respectively. Following verification using Western blotting, two of six selected candidates, major vault protein (MVP) and betaine-homocysteine S-methyltransferase 1 (BHMT),were verified in an independent and larger patient cohort.
Chemical labeling strategies, such as iTRAQ-based quantitative proteome analysis, can be used to identify differentially regulated proteins in HCC tissue relative to adjacent non-tumorous controls. Using this technique, over 600 proteins were identified in a LC-MS/MS analysis by Chaerkady et al.41 Based on alterations in the iTRAQ reporter ion intensities, 59 proteins were found to be upregulated, and 92 proteins were downregulated in HCC. Interestingly, myeloid-associated differentiation marker protein (MYADM) showed 100% specificity and 63% sensitivity in a subsequent verification screen using tissue microarray consisting of 53 HCC tissue and 20 non-tumourous liver tissue samples.41
To provide further insight into the mechanism underlying hepatocarcinogenesis, Li et al. carried out a differential proteomic analysis on liver cirrhosis (LC) and hepatitis B virus (CHB)-developed HCC and their adjacent non-tumorous liver tissues. Twelve LC-developed HCC and six CHB-developed HCC tissue samples and adjacent non-tumorous liver tissue were analyzed using 2D-DIGE and MALDI-TOF MS. Seventeen proteins were differentially regulated between the cancerous and non-cancerous tissue. In particular, ADP/ATP carrier protein and c-Jun N-terminal kinase 2 were reported to be upregulated only in CHB-developed HCC samples, while Rho-GTPase-activating protein 4 was downregulated. In contrast, insulin-like growth factor binding protein 2 was downregulated in LC-developed HCC.37
Bai et al. searched for prognostic markers in 252 HCC patients who underwent liver transplantation (LT) treatment. During a median follow-up of 42 months, 79 patients demonstrated recurrence. Tissue laser capture microdissection (LCM) was performed to isolate HCC hepatocytes from liver tissue samples and cleavable isotope-coded affinity tags (cICAT) were used to quantitate changes in protein expression in a 2D-LC-MS/MS analysis. From 52 differentially expressed proteins in recurrence and no-recurrence after LT, calpain small subunit 1 (CAPN4) overexpression was correlated with HCC recurrence and metastasis, making it a potential candidate for poor HCC prognosis.42
More recently, Mizuno et al. searched for novel HCC prognostic proteins related to human telomerase reverse transcriptase (hTERT). A specific proteomic approach was used that included fractionation of protein lysates using gel filtration columns to isolate hTERT proteins and analysis with LC-MS/MS. The expression of hnRNP A2/B1 protein was reported to be a potential marker for HCC survival prediction following surgical resection.43
Proteomic profiling of HCC tissue with different histological differentiation was conducted by Orimo et al. to monitor key proteins involved in HCC progression. They analyzed microdissected cells from six well differentiated, 14 moderately differentiated, and seven poorly differentiated HCC tissue, in addition to 11 adjacent non-tumorous liver tissue and seven normal liver tissue. Proteins were profiled using 2D-DIGE, and 21 proteins from 41 selected 2D-DIGE spots were identified using LC-MS/MS. APC-binding protein EB1 (EB1) was a nominated potentially important protein for HCC. Thus, EB1 was further tested in an additional 145 HCC samples using immunohistochemistry. Interestingly, EB1 was reported only in moderate and poor differentiated HCC tissue, and its expression was associated with poor HCC prognosis.44
In another study characterizing proteome changes associated with HCC progression in order to establish new therapeutic strategies for HCC, 2D-DIGE with MALDI-TOF MS were used to profile protein expression changes in 27 HCC tissue samples with different histological backgrounds and their adjacent non-tumorous liver tissue. From 43 selected spots, 22 differentially expressed proteins were identified following MS analysis. Of these proteins, peroxiredoxin 3 (PRDX3) was reported to be overexpressed in poorly differentiated HCC tissue, and it may have a role in HCC progression.45