This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.
| Published in | Cancer Research Journal (Volume 13, Issue 4) |
| DOI | 10.11648/j.crj.20251304.14 |
| Page(s) | 173-185 |
| Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
| Copyright |
Copyright © The Author(s), 2025. Published by Science Publishing Group |
Prediction Analytics, Kaplan-Meier Method, Weibull Distribution, SAIHA Framework
| [1] | Kaplan, E. L. (1958). Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association: |
| [2] | Schober P, V. T. (2018). Survival Analysis and Interpretation of Time-to-Event Data. National Library of Medicine: |
| [3] | Cox, D. R. (1972). Regression models and life‐tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2), 187–220. |
| [4] | Schober P, Vetter TR. Survival Analysis and Interpretation of Time-to-Event Data: The Tortoise and the Hare. Anesth Analg. 2018 Sep; 127(3): 792-798. |
| [5] | Pang M, Platt RW, Schuster T, Abrahamowicz M. Spline-based accelerated failure time model. Stat Med. 2021 Jan 30; 40(2): 481-497. |
| [6] | Fleming, S. T. (2020). Managerial Epidemiology: Cases and Concepts (4th ed.). Health Administration Press. Harrell, F. E. (2015). Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis (2nd ed.). Springer. |
| [7] | Arcuri LJ, Souza Santos FP, Perini GF, Hamerschlak N. (2020). Fine and Gray or Cox model? Blood Adv. 2024 Mar 26; 8(6): 1420-1421. |
| [8] | She Y, Jin Z, Wu J, Deng J, Zhang L, Su H, Jiang G, Liu H, Xie D, Cao N, Ren Y, Chen C. (2020). Development and Validation of a Deep Learning Model for Non-Small Cell Lung Cancer Survival. JAMA Netw Open. 2020 Jun 1; 3(6): e205842. |
| [9] | Steingrimsson JA, Morrison S. (2020). Deep learning for survival outcomes. Stat Med. Author manuscript. 2020 Apr 13; 39(17): 2339-2349. |
| [10] | Lee C, Yoon J, Schaar MV. Dynamic-DeepHit: A Deep Learning Approach for Dynamic Survival Analysis With Competing Risks Based on Longitudinal Data. IEEE Trans Biomed Eng. 2020 Jan; 67(1): 122-133. |
| [11] | De Melo, P (2025). Prediction Modeling: Basic Metabolic Panel. Advances in Bioscience and Biotechnology, 16, 360-378. |
| [12] |
National Cancer Institute. (2023). Pancreatic cancer treatment (PDQ®)–Patient version.
https://www.cancer.gov/types/pancreatic/patient/pancreatic-treatment-pdq |
| [13] | American Cancer Society. (2025). Survival rates for pancreatic cancer. |
| [14] | Siegel, R. L., Miller, K. D., Wagle, N. S., & Jemal, A. (2024). Cancer statistics, 2024. CA: A Cancer Journal for Clinicians, 74(1), 12–43. |
| [15] | Bakasa W, Viriri S. Pancreatic Cancer Survival Prediction: A Survey of the State-of-the-Art. Comput Math Methods Med. 2021 Sep 30; 2021:1188414. |
| [16] | Osareh A., Shadgar B. Machine learning techniques to diagnose breast cancer. 2010 5th international symposium on health informatics and bioinformatics; 2010; Ankara, Turkey. pp. 114–120. |
| [17] | Safiyari A., Javidan R. Predicting lung cancer survivability using ensemble learning methods. in 2017 Intelligent Systems Conference (IntelliSys); 2017; London, UK. pp. 684–688. |
| [18] | Osman M. H. Pancreatic cancer survival prediction using machine learning and comparing its performance with TNM staging system and prognostic nomograms. AACR Annual Meeting 2019; 2019; Atlanta, GA. |
| [19] | Song W., Miao D.-L., Chen L. Nomogram for predicting survival in patients with pancreatic cancer. Oncotargets and Therapy. 2018; 11:539–545. |
| [20] | Chaves DO, Bastos AC, Almeida AM, Guerra MR, Teixeira MTB, Melo APS, Passos VMA. The increasing burden of pancreatic cancer in Brazil from 2000 to 2019: estimates from the Global Burden of Disease Study 2019. Rev Soc Bras Med Trop. 2022 Jan 28; 55(suppl 1): e0271. |
| [21] | De Melo, P., (2024) Public Health Informatics and Technology, Library of Congress. Washington DC. |
| [22] | Lanjewar, MG., Panchbhai, KG., Patle, LB., (2024), Fusion of transfer learning models with LSTM for detection of breast cancer using ultrasound images, Computers in biology and medicine, 2024-02, Vol. 169, p. 107914-107914, Article 107914. |
| [23] | de Melo, P. and St. Rose, M. (2025) Accurate Classification of Diabetes via PM Generative AI. Advances in Bioscience and Biotechnology, 16, 379-409. |
| [24] | Ying GS, Heitjan DF. Weibull prediction of event times in clinical trials. Pharm Stat. 2008 Apr-Jun; 7(2): 107-20. |
APA Style
Melo, P. D., DiLella, M., Holman, T., McElveen, S. (2025). Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Research Journal, 13(4), 173-185. https://doi.org/10.11648/j.crj.20251304.14
ACS Style
Melo, P. D.; DiLella, M.; Holman, T.; McElveen, S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res. J. 2025, 13(4), 173-185. doi: 10.11648/j.crj.20251304.14
@article{10.11648/j.crj.20251304.14,
author = {Philip de Melo and Michele DiLella and Tameka Holman and Shakira McElveen},
title = {Accurate Prediction of Survival Based on Kaplan–Meier Analytics},
journal = {Cancer Research Journal},
volume = {13},
number = {4},
pages = {173-185},
doi = {10.11648/j.crj.20251304.14},
url = {https://doi.org/10.11648/j.crj.20251304.14},
eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.crj.20251304.14},
abstract = {This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.},
year = {2025}
}
TY - JOUR T1 - Accurate Prediction of Survival Based on Kaplan–Meier Analytics AU - Philip de Melo AU - Michele DiLella AU - Tameka Holman AU - Shakira McElveen Y1 - 2025/12/29 PY - 2025 N1 - https://doi.org/10.11648/j.crj.20251304.14 DO - 10.11648/j.crj.20251304.14 T2 - Cancer Research Journal JF - Cancer Research Journal JO - Cancer Research Journal SP - 173 EP - 185 PB - Science Publishing Group SN - 2330-8214 UR - https://doi.org/10.11648/j.crj.20251304.14 AB - This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications. VL - 13 IS - 4 ER -