Background: Prognostic stratification of hepatocellular carcinoma (HCC) remains difficult because the disease is highly heterogeneous and complete matched multi-omics data are not always available in clinical cohorts. Objective: To develop a weakly supervised multi-omics framework that derives prognostic subtype labels from a public reference cohort and transfers them to cohorts with incomplete data. Methods: This study has analyzed 363 patients in TCGA-LIHC with matched mRNA, miRNA, DNA methylation, and clinical data. Overall-survival-related features were selected by univariate Cox regression, integrated by similarity network fusion (SNF), and clustered by spectral clustering to generate pseudo-labels. TCGA-LIHC was then split 6:4 into training and test sets for supervised modeling. External validation used LIRI-JP, GSE14520, GSE54236, and GSE31384, with models rebuilt on features shared with each cohort. Prognostic performance was evaluated by Kaplan-Meier analysis, log-rank testing, and the concordance index (C-index). Results: A total of 3,890 mRNA features, 150 miRNA features, and 1,889 methylation features were retained. SNF plus spectral clustering identified two subtypes: S1 (n = 257) and S2 (n = 106). S2 had significantly worse overall survival than S1 (log-rank P = 3.891 × 10-9; C-index = 0.866). In internal validation, XGBoost showed the highest AUC (0.983). In external validation, the predicted subtypes remained prognostically informative, with C-index values of 0.857 in LIRI-JP, 0.875 in GSE14520, 0.930 in GSE54236, and 0.883 in GSE31384. Conclusions: In the public datasets included in this study, this weakly supervised framework identified two prognostically distinct HCC subtypes and retained prognostic discrimination after transfer to external cohorts with incomplete data.
Bertuccio P, Turati F, Carioli G, et al., 2017, Global Trends and Predictions in Hepatocellular Carcinoma Mortality. Journal of Hepatology, 67(2): 302–309.
Villanueva A, 2019, Hepatocellular Carcinoma. New England Journal of Medicine, 380(15): 1450–1462.
Wu Y, Liu Z, Xu X, 2020, Molecular Subtyping of Hepatocellular Carcinoma: A Step Toward Precision Medicine. Cancer Communications, 40(12): 681–693.
Zhang Q, Lou Y, Yang J, et al., 2019, Integrated Multiomic Analysis Reveals Comprehensive Tumour Heterogeneity and Novel Immunophenotypic Classification in Hepatocellular Carcinomas. Gut, 68(11): 2019–2031.
Lee J, Chu I, Heo J, et al., 2004, Classification and Prediction of Survival in Hepatocellular Carcinoma by Gene Expression Profiling. Hepatology, 40(3): 667–676.
Hoshida Y, Nijman S, Kobayashi M, et al., 2009, Integrative Transcriptome Analysis Reveals Common Molecular Subclasses of Human Hepatocellular Carcinoma. Cancer Research, 69(18): 7385–7392.
Toffanin S, Hoshida Y, Lachenmayer A, et al., 2011, MicroRNA-Based Classification of Hepatocellular Carcinoma and Oncogenic Role of miR-517a. Gastroenterology, 140(5): 1618–1628.
Wang B, Mezlini A, Demir F, et al., 2014, Similarity Network Fusion for Aggregating Data Types on a Genomic Scale. Nature Methods, 11(3): 333–337.