Introduction
Hepatocellular carcinoma (HCC) accounts for roughly 90% of primary liver cancers and ranks among the most lethal malignancies, with the fourth-highest cancer-related mortality rate globally.1 Its primary causes include viral hepatitis, alcohol-induced cirrhosis, and fatty liver disease, among others.2 Despite notable advances in treatment modalities in recent years, the five-year overall survival (OS) rate for HCC patients remains disappointingly low, with only 5% to 15% of early-stage cases qualifying for surgical resection.3 Even after surgery, patients face a substantial risk of recurrence. Due to HCC’s insidious onset and rapid progression, it is often diagnosed at advanced stages. Currently, more than 90% of HCC patients receive treatments such as chemotherapy, immunotherapy, transarterial chemoembolization, and tyrosine kinase inhibitors.4,5 However, the clinical effectiveness of these therapies remains suboptimal. Effectively curbing tumor growth and preventing metastasis continue to be formidable challenges. Furthermore, the heterogeneity of HCC complicates prognosis prediction and clinical decision-making. Therefore, there is an urgent need to identify new and reliable screening methods to improve diagnostic accuracy, better predict patient outcomes, and provide a foundation for personalized treatment strategies.
Cell proliferation is tightly regulated by a series of conserved cell cycle control mechanisms to ensure the generation of two genetically identical daughter cells.6 Cell cycle checkpoints (CCCs) serve as guardians of DNA integrity, preventing the accumulation and propagation of genetic errors during division. These checkpoints can arrest cell cycle progression or, in cases of irreparable DNA damage, trigger cell cycle exit or apoptosis.7 Cancer is characterized by uncontrolled cellular hyperproliferation, with CCC dysfunction playing a pivotal role in its pathogenesis.8 CHK1, a DNA damage checkpoint kinase, is crucial for regulating DNA replication, phase transitions, and mitotic events. Elevated CHK1 expression is significantly associated with clinical outcomes, including prognosis, relapse rates, and drug resistance across various malignancies.9 Centromere proteins (CENPs) are essential for mitosis, participating in centromere formation and chromosome segregation. Previous studies have reported a notable increase in CENPA levels in HCC, correlating with disease progression by upregulating cyclin D1 and neuropilin 2 through YY1 transcriptional activation and collaboration with YY1.10 While the roles of certain cell cycle checkpoint-related genes (CCCRGs) in HCC progression have been extensively studied, most research has focused on individual genes. However, the prognostic significance of the collective transcriptional profile of CCCRGs in HCC remains inadequately explored. Moreover, the functional contributions of many CCCRGs in HCC warrant further investigation.
The tumor immune landscape reflects a complex interplay among tumor cells, diverse immune cell subsets, and the cytokines they secrete. Immune cells can be broadly categorized by function into pro-inflammatory cells, such as effector CD8+ T cells and natural killer (NK) cells, and immunosuppressive cells, including M2 macrophages, regulatory T cells, and myeloid-derived suppressor cells. Tumors employ multiple mechanisms to modulate immune cell infiltration and depletion, while reciprocal interactions also occur among these immune cells.11 Recent advances in HCC immunotherapy have shown that while immune checkpoint inhibitors elicit strong antitumor responses in certain patient populations, combination therapies incorporating immune checkpoint inhibitors with other agents generally achieve superior outcomes. Notably, the atezolizumab–bevacizumab combination has become a first-line treatment for HCC. Conversely, other immunotherapeutic strategies, including adoptive T-cell therapy, cancer vaccines, and oncolytic virotherapy, have demonstrated limited and inconsistent clinical efficacy to date.11 Furthermore, the reconfiguration of the immune microenvironment in HCC has drawn increasing attention. For instance, one study revealed that interactions between SPP1-positive macrophages and cancer-associated fibroblasts form a tumor immune barrier within HCC, which impedes CD8+ T-cell infiltration into the tumor core and significantly diminishes the therapeutic efficacy of immunotherapy.12 Despite extensive research on immune microenvironment reconfiguration in HCC, the role of CCCRGs in this process remains poorly understood.
In this study, we employed a robust bioinformatics strategy, incorporating weighted gene co-expression network analysis (WGCNA) and related methods,13 to identify 22 CCCRGs significantly associated with poor prognosis in HCC. These key genes were then employed to construct a prognostic risk model aimed at enhancing clinicians’ ability to predict patient survival and therapeutic responses with greater accuracy. Among these, CENPI emerged as a critical target gene, yet its functional role and underlying mechanisms in HCC progression remained unclear. Subsequent experimental investigations demonstrated that CENPI promotes HCC cell proliferation, migration, and invasion by modulating the Hippo signaling pathway and regulating epithelial-mesenchymal transition (EMT). Importantly, our findings underscore CENPI’s role in reshaping the tumor immune microenvironment (TIME) and suppressing the infiltration of pro-inflammatory immune cells, including CD8+ T lymphocytes and NK cells. These findings suggest that CENPI represents a promising therapeutic target for HCC management.
Methods
Bioinformatics processing and analysis
High-quality HCC gene expression datasets were selected from The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) databases according to strict inclusion criteria. Samples were excluded if they had incomplete follow-up data, missing survival information, survival durations under 30 days, or redundant sequencing from the same patient. Ultimately, 343 tumor samples from the TCGA-Liver Hepatocellular Carcinoma (LIHC) dataset and 226 from the ICGC-Liver Cancer-Riken-Japan (LIRI-JP) dataset were included. A list of 292 CCCRGs was also obtained from pathcards.genecards.org. Employing WGCNA and differential expression analysis, we identified genes strongly correlated with tumorigenesis and poor prognosis, and cross-referenced these with the CCCRG list to pinpoint key genes.13 Lasso regression and Random Forest (RF) algorithms were applied to further refine and isolate hub genes.14,15 TCGA HCC tumor samples were designated as the training set, while ICGC-LIRI-JP samples served as the validation set. A prognostic model incorporating these hub genes was constructed using the training set and validated with the validation set. Kaplan–Meier curves were generated to evaluate OS across different groups.16 Nomograms and calibration curves were constructed to assess the model’s predictive accuracy for survival probabilities.17 Decision curve analysis was conducted to evaluate the clinical utility of the model, and receiver operating characteristic (ROC) curves were plotted to assess its predictive performance for postoperative survival.18 Gene set enrichment analysis and gene set variation analysis were employed to explore relevant pathways.19 The ESTIMATE algorithm was used to evaluate and compare stromal and immune scores between groups,20 while single sample gene set enrichment analysis was applied to assess variations in immune cell infiltration patterns.21 The maftools package was utilized to visualize mutational landscapes of different risk cohorts.22
The GSE149614 single-cell RNA sequencing dataset included tissue samples from 10 HCC patients, encompassing primary tumors, portal vein tumor thrombi, metastatic lymph nodes, and healthy liver tissues.23 Using the “Seurat” package in R software, 10x single-cell RNA sequencing data were converted into Seurat objects. Raw counts underwent rigorous quality control (QC) to exclude low-quality cells, retaining only those with gene counts between 500 and 5000 and mitochondrial gene expression below 10%. Following the standard Seurat workflow, the data were normalized, subjected to principal component analysis, and batch effects were corrected using Harmony. Clustering was performed at an optimal resolution determined via UMAP visualization, and cell subtypes were identified based on unique molecular expression profiles. The CellChat package was then used for a detailed analysis of cell–cell communication.24
Specimens from patients
Liver tissue samples (n = 30) were collected from HCC patients after treatment, along with an equal number of matched non-neoplastic tissue controls, at the Second Affiliated Hospital of Harbin Medical University (Harbin, China). The study protocol was approved by the hospital’s Ethics Committee, and all participants provided written informed consent (YJSKY2023-148).
Cell culture
Human HCC cell lines and normal liver cells were obtained from Zhongqiao Xinzhou Biotechnology Co., Ltd. (Shanghai, China). Cells were cultured in Dulbecco’s modified Eagle medium supplemented with 10% fetal bovine serum (FBS) and 1% penicillin–streptomycin (Beyotime, Shanghai, China) at 37°C in a 5% CO2 incubator.25
Gene knockdown
For CENPI knockdown, shRNA and negative control (shNC) sequences were designed as follows: shCENPI-1: GCTCTTCTTTACATCAACCAT, shCENPI-2: CTGCTCTGATTTCAGTATCTT, shCENPI-3: CTGAAAGAGCTATTGCAGAAT, shCENPI-4: AGGCTTTGTTGTCACTGTATA, shCENPI-5: TTGCAAATGGCAGTGGGATAT, shNC: TTCTCCGAACGTGTCACGT.26
Western blot analysis
Proteins were extracted from tissue and cellular samples using RIPA buffer (Beyotime, Shanghai, China) supplemented with protease and phosphatase inhibitors (Roche, Switzerland). Protein concentration was determined with a bicinchoninic acid assay kit (Beyotime, Shanghai, China). Proteins were separated by SDS–PAGE, transferred to nitrocellulose membranes, and blocked according to standard protocols. Membranes were incubated overnight at 4°C with the following primary antibodies: β-actin (#8480, CST, 1:1000), CENPI (ab118796, Abcam, 1:1,000), E-cadherin (TA0131, Abmart, 1:1,000), N-cadherin (T55015, Abmart, 1:1,000), Vimentin (#5741, CST, 1:1,000), YAP1 (T55381, Abmart, 1:5,000), and phospho-YAP1 (T55743, Abmart, 1:5,000). After three washes with TBST buffer, membranes were incubated with diluted secondary antibodies for 1 h, washed again, and then exposed to ECL developing solution. After a two-minute incubation in the dark, membranes were imaged using an automated chemiluminescence imaging system.27
CCK-8
Cells were harvested, centrifuged, and resuspended in complete culture medium for counting. Subsequently, 5 × 103 cells in 100 µL of medium were seeded into each well of a 96-well plate and incubated for 24, 48, and 72 h. At each time point, 10 µL of CCK-8 solution (Beyotime, Shanghai, China) was added to each well, followed by a 2-h incubation. The optical density at 450 nm was measured and recorded for each well.28
EdU
Cells were cultured in 6-well plates until adherent, then incubated with 2 mL of 1× EdU working solution for 2 h. Following incubation, cells were washed, fixed with 4% paraformaldehyde, and permeabilized with PBS containing 0.3% Triton X-100. They were then incubated with a Click reaction mixture for 30 m in the dark, stained with 1× Hoechst 33342 to visualize nuclei, and examined using a confocal laser scanning microscope.29
Colony formation assay
When cells in each group reached the desired density, they were harvested and centrifuged. A total of 1,000 cells per well were seeded into 6-well plates and cultured continuously for 12 days. Colonies were then fixed with 4% paraformaldehyde for 30 m and stained with 1% crystal violet for 20 m. The number of colonies was subsequently counted and recorded.30
Wound healing assay
After transfection, cells were grown in 6-well plates until they reached confluence. A scratch was made across the monolayer using a 200 µL pipette tip, and non-adherent cells were removed by washing with PBS. The initial wound edge (0-h time point) was imaged at 40× magnification. Cells were then incubated in serum-free medium, and wound closure was monitored and photographed at 24 and 48 h using a 40× microscope.31
Transwell assay
For the invasion assay, the upper chamber of the transwell insert was coated with approximately 50 µL of Matrigel and allowed to solidify. Cells were resuspended in serum-free medium, and 200 µL of the suspension was added to the upper chamber. The lower chamber was filled with 800 µL of Dulbecco’s modified Eagle medium containing 10% fetal bovine serum. After 24 and 48 h of incubation, invasive cells that had migrated to the lower chamber were fixed with methanol and stained with 1% crystal violet. Cell invasion was then assessed and photographed under a microscope at 100× magnification.32
Flow cytometry
Apoptosis was evaluated using the Annexin V–FITC/PI Apoptosis Kit (BD, USA). Cells were harvested at the desired density, centrifuged, and washed with pre-chilled PBS. After counting, an appropriate volume of the suspension was re-centrifuged, and the supernatant was discarded. The cell pellet was resuspended in 100 µL of binding buffer and transferred into flow cytometry tubes. Cells were then stained with Annexin V–FITC and PI, followed by 15 m of incubation at room temperature in the dark. After incubation, 400 µL of binding buffer was added, and apoptosis was analyzed by flow cytometry.33
Immunofluorescence
Cells were seeded at optimal density in 24-well plates and allowed to adhere. After fixation with 4% paraformaldehyde, cells were permeabilized and blocked. The primary antibody against YAP was incubated with the cells overnight at 4°C. The following day, a fluorescent secondary antibody was applied, and nuclei were counterstained with DAPI. Stained cells were visualized under a fluorescence inverted microscope.34
Formalin-fixed, paraffin-embedded tissue specimens were sectioned at 4 µm. After dewaxing with xylene and rehydration through graded ethanol, antigen retrieval was performed using 0.1 M citrate buffer (pH 6.0) with heat-induced epitope recovery. Sections were incubated overnight at 4 °C with primary antibodies against CD3 (ZM-0417) and CD8 (ZA-0508), followed by incubation with fluorescently conjugated secondary antibodies at 37°C for 60 m. Nuclei were counterstained with DAPI, and fluorescence was quantified using epifluorescence microscopy.35
Animal experiments
Once cells reached confluence, they were harvested, centrifuged, and resuspended in pre-chilled PBS for counting. A total of 5 × 106 Hep3B cells were resuspended in 200 µL of pre-chilled PBS and implanted subcutaneously into the axillae of six-week-old male nude mice. Mice were housed under standard conditions with free access to food and water and maintained on a 12-h light/dark cycle. Tumor size was measured every three days starting on day 6 post-implantation. After three weeks, mice were euthanized, and tumors were excised, weighed, and recorded for analysis.36 All animal experiments were approved by the Ethics Committee of the Second Affiliated Hospital of Harbin Medical University (YJSDW2023-067).
Quantitative reverse transcription polymerase chain reaction analysis
Total RNA was extracted from Hep3B cells using TRIzol reagent (Invitrogen) and reverse-transcribed into cDNA with the Transcriptor First Strand cDNA Synthesis Kit (Roche, Penzberg, Germany). FastStart Universal SYBR Green Master (Roche) was used to amplify each sample in a 20 µL reaction mixture. The fold changes were converted using the 2−ΔΔCt technique. Expression levels were determined by calculating and normalizing them to the endogenous GAPDH.37 The primer sequences are listed in Table 1.
Table 1The sequences of primers
 | Genes | Forward primer (5′-3′) | Reverse primer (5′-3′) | 
|---|
| GAPDH | ACCGGGAAGGAAATGAATGG | CCCAATACGACCAAATCAGAGA | 
| B2M | AAAGATGAGTATGCCTGCCG | CGGCATCTTCAAACCTCCAT | 
| CXCL9 | CCAATACAGGAGTGACTTGGA | CTCACTACTGGGGTTCCTTG | 
| CXCL10 | AGTGGCATTCAAGGAGTACC | ACGTGGACAAAATTGGCTTG | 
| CXCL11 | ACAGTTGTTCAAGGCTTCCC | CTTGCTTGCTTCGATTTGGG | 
Statistical analysis
Data are presented as mean ± standard deviation from at least three independent experiments. Statistical differences between two groups were analyzed using an independent-samples t-test, while one-way ANOVA was used for comparisons among multiple groups. Analyses were performed with GraphPad Prism 8.0 and R version 4.2.3. Statistical significance was defined as p < 0.05.
Results
Weighted gene co-expression network construction
We analyzed normal and tumor samples from the TCGA-LIHC cohort to identify regulatory genes involved in HCC initiation. To delineate densely connected gene clusters within the microarray samples, correlation network analyses were performed. WGCNA was used to construct and systematically analyze active tumor-related networks. Following sample clustering, an appropriate threshold (cutHeight = 210) was applied to remove samples exhibiting conspicuous anomalies, as illustrated in Figure 1A. After outlier removal, a sample clustering dendrogram was constructed (Fig. 1B). For subsequent analyses, the top 8,000 genes were selected. An adjacency matrix was generated using a soft-threshold power of β = 8 (R2 = 0.85), ensuring a scale-free network topology for gene distribution, as depicted in Figure 1C, thereby preserving crucial connectivity information. Using parameters of minModuleSize = 30 and mergeCutHeight = 0.25, we identified 12 distinct modules (Fig. 1D). Module connectivity was calculated, and clustering analysis incorporating grouping information yielded a heatmap (Fig. 1E). To explore module–phenotype associations, we computed correlation coefficients between each module and tumor traits, revealing six modules with statistically significant links to HCC. Notably, the “blue” (r = 0.63, p = 4e−48) and “turquoise” (r = 0.55, p = 3e−34) modules showed the strongest correlations with tumor status (Fig. 1F). We then performed module membership and gene significance correlation analyses on these two modules, observing a strong positive correlation between module membership and gene significance (Fig. 1G). By merging gene sets from both modules, we compiled a final subset of 3826 module-associated genes for further investigation.
The identification of hub CCCRGs implicated in HCC development and associated with unfavorable prognoses
Differential expression analysis was performed to compare gene expression profiles between normal and tumor samples from the TCGA-LIHC dataset. Gene expression fold changes (FCs) were transformed to log2 scale, and a stringent cutoff of |log2FC| ≥ 2 with p < 0.05 was applied to identify differentially expressed genes. Under these criteria, 1,159 genes were identified as potentially involved in tumor progression, including 471 upregulated and 688 downregulated genes (Fig. 2A). Upregulated genes were prioritized for subsequent investigation. Similarly, differential expression analysis of tumor samples stratified by clinical stage was conducted, applying thresholds of |log2FC| ≥ 0.2 and p < 0.05. This yielded 918 stage-associated genes, including 387 upregulated and 531 downregulated genes (Fig. 2B), with upregulated genes selected for further analysis. To identify genes significantly influencing postoperative OS, survival analysis was performed on tumor samples, retaining genes with p < 0.01. Through integrative analysis of differentially expressed genes, module genes, and CCCRGs, 29 key genes were identified (Fig. 2C). LASSO Cox regression analysis was then employed to select the most informative diagnostic features, identifying 26 candidates (Fig. 2D and E). The RF algorithm was subsequently applied to rank gene importance, generating a list of 29 genes. The top 25 genes with the highest importance scores were selected for further investigation (Fig. 2F). By intersecting results from LASSO Cox regression and RF analyses, 22 optimal gene signatures were identified and defined as hub genes (Fig. 2G). Box plots illustrated differential expression of these hub genes between normal and tumor samples (Fig. 2H). ROC curve analysis demonstrated robust diagnostic performance (Supplementary Fig. 1). Correlation analysis revealed potential interactions among most hub genes (Fig. 2I), suggesting coordinated regulatory roles in tumor progression.
Comprehensive analysis of the risk score in HCC patients in conjunction with clinical parameters
We employed the TCGA-LIHC dataset as the training cohort to develop a predictive model by integrating hub gene expression profiles with patient survival data, while the ICGC-LIRI-JP dataset was utilized for external validation. Initially, the optimal cutoff point was determined within the training set (Fig. 3A). Based on this threshold, patients were stratified into high-risk and low-risk subgroups. Patients above the cutoff point exhibited significantly higher risk scores, lower survival rates, and elevated expression of hub genes (Fig. 3B). Cox regression analysis revealed a hazard ratio of 2.53 for the risk score (95% CI: 1.98–3.2; p < 0.001), surpassing the prognostic value of clinical factors such as age, gender, and stage (hazard ratio = 1.55; 95% CI: 1.24–1.9; p < 0.001), indicating the risk score’s robustness as a prognostic indicator (Fig. 3C). Kaplan–Meier analysis showed significantly reduced OS in the high-risk group (p < 0.0001), with very few survivors beyond five years (Fig. 3D). Similar results were obtained in the validation cohort (Supplementary Fig. 2A–D). To further refine prognostic prediction, we integrated the risk score with clinical factors (age, gender, and stage) to construct a nomogram (Fig. 3E). The calibration plot demonstrated strong agreement between nomogram-predicted and observed one-, three-, and five-year OS, confirming its reliability (Fig. 3F). ROC curve analysis further validated the predictive accuracy of the risk score for one-, three-, and five-year OS (Fig. 3G). Decision curve analysis emphasized the clinical utility of the risk score in guiding treatment decisions (Fig. 3H). As most patients in the validation cohort had OS between one and three years, we assessed time points at one, two, and three years, achieving consistent results (Supplementary Fig. 2E–H).
Analysis of immune cell infiltration and genetic mutation landscapes in model-delineated high- and low-risk groups
Using the ESTIMATE algorithm, we analyzed the infiltration levels of immune and stromal cell populations within the tumor microenvironment. As shown in Figure 4A–C, high-risk patients exhibited a significant reduction in stromal scores compared to their low-risk counterparts, while no notable differences were observed in immune scores or ESTIMATE scores between the two groups. Although these findings suggested comparable overall immune scores, they did not accurately reflect the infiltration status of individual immune cell subsets. To address this limitation, we utilized single sample gene set enrichment analysis to quantify the distribution of 28 immune cell subsets within tumor tissues from high- and low-risk cohorts (Fig. 4D). Notably, the high-risk group demonstrated elevated infiltration of type 2 T helper cells, NK T cells, effector memory CD4+ T cells, and activated CD4+ T cells. Conversely, monocytes, central memory CD4+ T cells, CD56dim NK cells, plasmacytoid dendritic cells (DCs), NK cells, effector memory CD8+ T cells, type 1 T helper cells, and eosinophils were more abundant in the low-risk group. Furthermore, a comparative analysis of immune cell exhaustion marker expression revealed significant upregulation in the high-risk cohort (Fig. 4E). We then conducted an in-depth analysis of genetic mutation profiles in the high- and low-risk groups, identifying the top 10 genes with the highest mutation frequencies. CTNNB1 exhibited the highest mutation frequency in the low-risk group (29%), while TP53 was the most frequently mutated gene in the high-risk group (64%) (Fig. 4F and G). Finally, tumor mutation burden analysis revealed a significantly higher tumor mutation burden in the high-risk group compared to the low-risk group (Fig. 4H and I).
CENPI was overexpressed in HCC tissues and cell lines and associated with poor prognosis
Among the 22 previously identified hub genes, the expression pattern and functional role of CENPI in HCC remain inadequately characterized, with limited validation and mechanistic insights into its contribution to disease progression. To address this gap, we focused our subsequent investigations on CENPI. Initially, we observed significant overexpression of CENPI in tumor samples from the TCGA-LIHC dataset (Fig. 2H). To further elucidate its expression pattern in patient tissues, we analyzed the single-cell sequencing dataset GSE149614. Using PC = 30 for dimensionality reduction via UMAP, followed by clustering at a resolution of 2, we identified 55 distinct subgroups from cells that passed QC (Supplementary Fig. 3A and B). These clusters were annotated as “B cell”, “CD4+ T cell”, “CD8+ T cell”, “DC”, “endothelial cell”, “fibroblast”, “hepatocyte”, “macrophage”, “monocyte”, and “NK cell” (Fig. 5A). Characteristic gene expression profiles for each cell type are presented in Supplementary Figure 3C and D. CENPI expression was predominantly detected in hepatocytes, with moderate levels in endothelial cells, DCs, and macrophages (Fig. 5B). To minimize the confounding effects of stromal cells, we extracted hepatocyte data for further analysis, which confirmed significantly elevated CENPI expression in tumor tissues compared to normal tissues across most samples. Interestingly, elevated CENPI expression was also observed in the metastatic lymph nodes and portal vein tumor thrombi of some patients, suggesting a potential role in distant metastasis (Fig. 5C). These bioinformatics findings were further validated in vitro, confirming upregulated CENPI expression in HCC tissues and cell lines (Fig. 5D). Using predefined expression thresholds, we evaluated the correlation between CENPI expression and TNM stage, revealing that high CENPI expression correlated with advanced disease (Fig. 5E). Moreover, survival analysis revealed that patients with elevated CENPI expression exhibited poorer OS (Fig. 5F and G). Gene set enrichment analysis demonstrated that CENPI overexpression was closely related to HCC proliferation, metastasis, and EMT (Fig. 5H).
CENPI facilitated HCC cell proliferation
To investigate the functional role of CENPI in tumor cell behavior, we selected Hep3B (TP53-null) and HCCLM3 (highly metastatic) cell lines for silencing experiments, given their relatively high endogenous CENPI expression. Efficient CENPI knockdown was achieved in both cell lines, as confirmed by experimental validation (Fig. 6A and B). The two clones with the strongest knockdown efficiency from each cell line were selected for subsequent experiments. A series of in vitro assays, including CCK-8 proliferation, EdU incorporation, and colony formation, consistently revealed that CENPI silencing significantly impaired proliferative capacity and clonogenic potential, consistent with the bioinformatics analyses (Fig. 6C–J). Furthermore, flow cytometry showed that CENPI knockdown induced apoptosis in HCC cells, supporting its pro-tumorigenic role (Fig. 6K–N). In an in vivo subcutaneous xenograft model, CENPI knockdown markedly suppressed tumor growth, as evidenced by significantly reduced tumor weight and volume compared to controls (Fig. 6O and P).
CENPI promoted HCC cell migration, invasion, and EMT
To assess the effects of CENPI knockdown on tumor cell motility and invasiveness, we performed wound healing and transwell assays. The wound healing assay revealed a significant reduction in migratory capacity in CENPI knockdown groups compared to controls (Fig. 7A–D). Similarly, transwell invasion assays demonstrated markedly reduced invasive potential, with fewer cells traversing the membrane in knockdown groups versus controls (Fig. 7E and F). Additionally, CENPI suppression significantly altered EMT marker expression, downregulating mesenchymal markers while upregulating epithelial markers, indicating impaired metastatic potential (Fig. 7G–J).
CENPI promoted the malignant biological behavior of HCC cells via the Hippo pathway
We investigated the molecular mechanisms underlying CENPI’s role in HCC proliferation and metastasis. Gene set variation analysis of CENPI confirmed its significant enrichment in the Hippo pathway (Fig. 8A). The Hippo pathway played a pivotal role in regulating liver size, regeneration, stem cell self-renewal, and HCC progression.38 Extensive previous research has demonstrated a close association between the Hippo pathway and HCC proliferation and metastasis.39,40 For example, reduced succinate dehydrogenase (SDH) activity has been linked to succinate accumulation and poor HCC prognosis, with SDHA and SDHB downregulation promoting HCC progression via the YAP1/TAZ oncogenic signaling axis through impaired proteasomal degradation pathways.41 In our study, CENPI knockdown significantly increased phosphorylated YAP levels while decreasing total YAP expression in HCC cells (Fig. 8B–E). Furthermore, CENPI suppression reduced nuclear localization of YAP (Fig. 8F and G). These findings suggest that CENPI regulates the Hippo signaling pathway by inhibiting YAP phosphorylation and facilitating its nuclear translocation, thereby promoting HCC cell proliferation and metastatic potential.
CENPI mediated immune escape in HCC patients
We analyzed the impact of CENPI on the TIME in HCC patients by extracting all tumor samples from the single-cell sequencing dataset GSE149614. Using PC = 30, we performed UMAP dimensionality reduction on cells passing QC and identified 62 subgroups at a resolution of 3 (Supplementary Fig. 4A and B). These 62 clusters were classified as “B cell”, “CD4+ T cell”, “CD8+ T cell”, “DC”, “endothelial cell”, “fibroblast”, “HCC”, “macrophage”, “monocyte”, and “NK cell” (Fig. 9A). Characteristic gene expressions for each cell type are depicted in Supplementary Figure 4C and D. To exclude the influence of CENPI expression in other cell types, we extracted the HCC subset from the tumor samples and analyzed CENPI expression. Patients were classified into low- and high-CENPI expression groups based on their expression profiles (Fig. 9B). Figure 9C shows the proportion of different cell types in the two groups. A notable rise in the percentage of HCC cells was observed in the high-expression cohort, consistent with the finding that CENPI enhanced tumor proliferation. Furthermore, the low-expression group exhibited increased proportions of endothelial cells and fibroblasts. Notably, several immune cell types, including NK cells, DCs, and CD4+ T cells, were markedly reduced in the high-expression group. Multiparametric immune profiling revealed significant associations between CENPI expression and infiltration of 28 immune cell subsets within the TCGA-LIHC cohort (Fig. 9D and E). Most cell types showed reduced infiltration in the high-CENPI group, including monocytes, central memory CD4+ T cells, CD56dim NK cells, plasmacytoid DCs, central memory CD8+ T cells, immature DCs, NK cells, gamma delta T cells, activated CD8+ T cells, effector memory CD8+ T cells, type 1 T helper cells, eosinophils, and neutrophils. Given the critical roles of these cells in antitumor immunity, these findings suggested that CENPI might promote immune escape in HCC. Furthermore, patients were stratified into high- and low-CENPI expression groups based on their tumor tissue CENPI levels. Immunohistochemical analysis of corresponding paraffin-embedded sections revealed significantly reduced CD8+ T cell infiltration in tumors with elevated CENPI expression (Fig. 9F). Analysis of immune exhaustion markers and T cell exhaustion status showed that the high-CENPI group exhibited higher levels of exhaustion markers and a greater number of exhausted T cells (Fig. 9G–I). The distribution of exhaustion markers is depicted in Supplementary Figure 5.
CENPI affected cell-to-cell interactions in the TIME
To elucidate the mechanisms of CENPI-mediated immune escape in HCC, we conducted a comprehensive analysis of intercellular communication networks. While no significant differences were observed in the total number of cell-cell interactions between high- and low-CENPI groups, the strength of these interactions was notably diminished in the high-expression cohort (Fig. 10A). Supplementary Figure 6A and B provide detailed visualizations of intercellular interactions, with line thickness indicating interaction frequency and intensity. We integrated interaction data from both groups to highlight differences, using red to denote stronger interactions in the CENPI-overexpressing group and blue for weaker interactions (Fig. 10B). Figure 10C and Supplementary Figure 6C and D show the activation status of signaling pathways, while Supplementary Figure 6E displays the senders and receivers of cell signals in both groups. Given the pivotal role of MHC-I, CXCL, and CCL signaling pathways in immune cell recruitment, we focused on these pathways. In the high-CENPI group, MHC-I signaling was significantly weaker, indicating that antigen-presenting cells presented tumor antigens to effector cells less frequently. For CXCL signaling, enhanced CD8+ T cell recruitment by other cell types was observed in the low-CENPI group (Fig. 10D and E). In contrast, CCL signaling analysis revealed no significant differences in effector cell recruitment (Supplementary Fig. 6F). Further analysis of signaling pathway-associated molecules revealed that CENPI knockdown in HCC cells significantly upregulated B2M (a key component of MHC-I) and the chemokines CXCL9 and CXCL10 (Fig. 10F). These findings provide mechanistic insight into CENPI’s role in immune escape.
Discussion
In clinical practice, pathological staging remains the cornerstone for predicting long-term survival and guiding treatment decisions in HCC patients. However, due to the inherent heterogeneity of HCC, patients within the same pathological stage often exhibit divergent clinical outcomes, highlighting the limitations of traditional staging systems for personalized prognosis and therapy. The advent of next-generation sequencing has revolutionized cancer prognosis prediction,42 with an increasing number of mRNA- and non-coding RNA-based prognostic biomarkers being developed to forecast patient survival outcomes.43,44 CCCs play a critical role in maintaining genomic stability by ensuring accurate DNA replication and proper chromosome segregation during eukaryotic cell division. Within the various stages of the cell cycle, CCCs include DNA damage response checkpoints, DNA replication stress response checkpoints, and mitotic checkpoints. Dysregulation of CCCRGs at any of these stages can disrupt genetic material transmission, potentially leading to uncontrolled cell proliferation or cell death.45 In this study, we systematically analyzed mRNA expression profiles of 292 CCCRGs in HCC. We subsequently developed and rigorously validated a prognostic model incorporating 22 CCCRGs. Comprehensive evaluation demonstrated that this model effectively predicts OS in HCC patients, providing valuable insights to inform personalized treatment strategies. This approach addresses the limitations of conventional staging systems by capturing molecular heterogeneity within HCC, offering a more precise tool for clinical decision-making and patient stratification.
Over the past decades, accumulating evidence has highlighted the critical role of the CENP family in regulating cancer initiation and progression.46–48 Among its members, CENPI exhibits aberrant expression across multiple malignancies. For example, studies have demonstrated its upregulation in gastric cancer, where it promotes tumor proliferation and migration.49 Similarly, elevated CENPI expression in lung adenocarcinoma (LUAD) has been shown to drive tumor growth while suppressing apoptosis, with the added finding that CENPI modulates immune cell infiltration in the LUAD tumor microenvironment, a previously unreported observation.26 Despite these insights, the precise functions of CENPI in HCC remain poorly characterized. In this study, we addressed this knowledge gap through a comprehensive investigation combining multi-omics analysis of TCGA-LIHC and GSE149614 datasets with extensive in vitro and in vivo experimental validation. Our findings provide novel mechanistic insights into CENPI’s role in HCC pathogenesis and tumor immune regulation, establishing a foundation for future therapeutic targeting strategies.
Initially, we observed significantly elevated CENPI protein expression in both HCC tumor tissues and cell lines compared to normal controls. Subsequent functional experiments demonstrated that CENPI knockdown markedly suppressed malignant phenotypes, including reduced tumor cell proliferation, migration, invasion, and EMT, while simultaneously enhancing apoptotic susceptibility. Mechanistically, CENPI facilitated the malignant biological behaviors of HCC by regulating the Hippo pathway. Given that immunotherapy has emerged as a vital treatment modality for HCC,11,50 and the well-established influence of the TIME on therapeutic response and clinical outcomes, we further investigated the effects of CENPI on the infiltration and exhaustion status of immune cells within HCC tumor tissue. Patients with high CENPI expression frequently exhibited reduced presence of immune effector cells, such as effector CD8+ T cells and NK cells, within their tumors. Moreover, these immune cells were often in a more advanced state of exhaustion, suggesting that elevated CENPI expression may contribute to immune evasion by weakening antitumor immune responses. The underlying mechanisms appear to involve two key processes: (1) impaired antigen presentation by antigen-presenting cells in CENPI-high tumors and (2) diminished chemokine-mediated recruitment of CD8+ T cells to the tumor site. Nevertheless, further in-depth research and experimental validation are required. These findings partially align with observations in LUAD, where high CENPI expression was associated with decreased CD8+ T cell and NKT cell infiltration, coupled with increased myeloid-derived suppressor cell accumulation. Collectively, our results and existing literature indicate that CENPI functions as an oncogenic driver across multiple malignancies by fostering an immunosuppressive tumor microenvironment that hinders effective immune surveillance and clearance of cancer cells. These insights position CENPI as a potential therapeutic target for reversing immune suppression and enhancing immunotherapy efficacy in HCC.
Supporting information
Supplementary Fig. 1
ROC curves of hub genes. AUC, Area under curve.
(JPG)
Supplementary Fig. 2
Validation of the risk scoring model in the testing set.
(A) The optimal cutoff point calculated within the testing set. (B) A dot plot of the risk score distribution, a scatter plot of patient survival status, and a heat map of the model’s gene expression. (C) Cox regression analysis. (D) Survival analysis. (E) A nomogram. (F) A calibration curve for the nomogram. (G) ROC curves for one-, two-, and three-year OS. (H) DCA for one, two, and three years. AUC, Area under curve; DCA, Decision curve analysis; OS, Overall survival; ROC, Receiver operating characteristic.
(JPG)
Supplementary Fig. 3
Preprocessing of all sample data from GSE149614.
(A) PCA performed on all sample data from GSE149614. (B) The UMAP algorithm used to segment the cells from all sample data into 55 distinct clusters. (C) Characteristic gene expressions for each cluster. (D) Characteristic gene expressions for each cell type. DC, Dendritic cell; NK, Natural killer; PCA, Principal component analysis; UMAP, Uniform manifold approximation and projection.
(JPG)
Supplementary Fig. 4
Preprocessing of tumor sample data from GSE149614.
(A) PCA performed on tumor sample data from GSE149614. (B) The UMAP algorithm used to segment the cells from tumor sample data into 62 distinct clusters. (C) Characteristic gene expressions for each cluster. (D) Characteristic gene expressions for each cell type. DC, Dendritic cell; HCC, Hepatocellular carcinoma; NK, Natural killer; PCA, Principal component analysis; UMAP, Uniform manifold approximation and projection.
(JPG)
Supplementary Fig. 5
Distribution of exhaustion markers in the CENPI-high and CENPI-low groups.
UMAP, Uniform manifold approximation and projection.
(JPG)
Supplementary Fig. 6
Differential cell-to-cell interactions between CENPI-high and CENPI-low groups.
(A) Network diagram showing the number of interactions between the two groups. (B) Network diagram showing the strength of interactions between the two groups. (C) Outgoing signaling patterns in the two groups. (D) Incoming signaling patterns in the two groups. (E) A dot plot showing the senders and receivers of cell signals in the two groups. (F) Network plots of the CCL signaling pathway. DC, Dendritic cell; EC, Endothelial cell; HCC, Hepatocellular carcinoma; NK, Natural killer.
(JPG)