Hepatocellular Carcinoma Prediction using Large Language Model and Meditron

Suleiman Zeila Dubow; Abraham Matheka

doi:10.36948/ijfmr.2026.v08i03.79040

Hepatocellular Carcinoma Prediction using Large Language Model and Meditron

Author(s)	Mr. Suleiman Zeila Dubow, Dr. Abraham Matheka
Country	Kenya
Abstract	This project investigated the integration of machine learning, large language models (LLMs), and the Meditron medical language model to enhance early prediction and interpretation of hepatocellular carcinoma (HCC) in resource-constrained healthcare settings such as Kenya, where late diagnosis and limited access to advanced diagnostic tools remain major challenges. Using a design science and experimental research approach, an AI-driven clinical decision-support prototype was developed to process unstructured clinical conversations, extract relevant medical features, predict HCC risk, and generate clinically interpretable explanations. A Random Forest classifier was trained on a balanced secondary clinical dataset comprising equal proportions of HCC and non-HCC cases, with preprocessing techniques including median imputation and feature scaling applied to ensure data quality and model robustness. The trained model achieved an accuracy of 0.8780, an F1-score of 0.8780, and an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.9452, demonstrating strong predictive performance. We applied an LLM to convert free-text clinical notes into structured features suitable for the trained predictive model allowing automated HCC risk prediction from conversational input. Then the prediction outputs comprising probabilistic risk scores were submitted to the Meditron model that gave medically grounded interpretations of the results by contextualizing anticipated risk in respect to known clinical indications of HCC. We integrated LLM-based data extraction, machine learning-based prediction, and Meditron-driven interpretation, which enhanced transparency, explainability, and clinical relevance of the system. Results demonstrate that hybrid AI architectures that combine predictive modeling and domain-specific reasoning with medical language can provide accurate, interpretable, and scalable tools for early HCC risk assessment with great potential to support clinical decision-making and improve cancer care delivery in low-resource healthcare settings.
Field	Computer Applications
Published In	Volume 8, Issue 3, May-June 2026
Published On	2026-05-23
DOI	https://doi.org/10.36948/ijfmr.2026.v08i03.79040

View / Download PDF File

E-ISSN 2582-2160

doi

CrossRef DOI is assigned to each research paper published in our journal.

IJFMR DOI prefix is
10.36948/ijfmr

Downloads

Research Paper Format Copyright Permission Form and Undertaking Form Cover Page Vol 8 Isu 3 Cover Page Vol 8 Isu 2 Cover Page Vol 8 Isu 2

All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.

CC-BY-SA

About IJFMR Fees & Payment Current Issue Publication Archive	Submit Research Paper Track Submission Status Publication Guidelines Publication Ethics Peer Review & Plagiarism	Join as a Reviewer Editors & Reviewers Reviewer Referral Program Get Reviewer Membership Certi.	Website/Journal Policies Usage Policy Content Policies Privacy Policy

Contact Us		+91-9687-828-838	editor@ijfmr.com

International Journal For Multidisciplinary Research

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Hepatocellular Carcinoma Prediction using Large Language Model and Meditron

Share this