Machine learning reveals chronic graft--host disease phenotypes and stratifies survival after stem cell transplant for hematologic malignancies.

TitleMachine learning reveals chronic graft--host disease phenotypes and stratifies survival after stem cell transplant for hematologic malignancies.
Publication TypeJournal Article
Year of Publication2019
AuthorsGandelman, JS, Byrne, MT, Mistry, AM, Polikowsky, HG, Diggins, KE, Chen, H, Lee, SJ, Arora, M, Cutler, C, Flowers, M, Pidala, J, Irish, JM, Jagasia, MH
JournalHaematologica
Volume104
Issue1
Pagination189-196
Date Published2019 Jan
ISSN1592-8721
KeywordsAdult, Biomarkers, Chronic Disease, Consensus, Female, Graft vs Host Disease, Hematologic Neoplasms, Hematopoietic Stem Cell Transplantation, Humans, Machine Learning, Male, Middle Aged, National Institutes of Health (U.S.), Prospective Studies, Transplantation, Homologous, United States
Abstract

The application of machine learning in medicine has been productive in multiple fields, but has not previously been applied to analyze the complexity of organ involvement by chronic graft--host disease. Chronic graft--host disease is classified by an overall composite score as mild, moderate or severe, which may overlook clinically relevant patterns in organ involvement. Here we applied a novel computational approach to chronic graft--host disease with the goal of identifying phenotypic groups based on the subcomponents of the National Institutes of Health Consensus Criteria. Computational analysis revealed seven distinct groups of patients with contrasting clinical risks. The high-risk group had an inferior overall survival compared to the low-risk group (hazard ratio 2.24; 95% confidence interval: 1.36-3.68), an effect that was independent of graft--host disease severity as measured by the National Institutes of Health criteria. To test clinical applicability, knowledge was translated into a simplified clinical prognostic decision tree. Groups identified by the decision tree also stratified outcomes and closely matched those from the original analysis. Patients in the high- and intermediate-risk decision-tree groups had significantly shorter overall survival than those in the low-risk group (hazard ratio 2.79; 95% confidence interval: 1.58-4.91 and hazard ratio 1.78; 95% confidence interval: 1.06-3.01, respectively). Machine learning and other computational analyses may better reveal biomarkers and stratify risk than the current approach based on cumulative severity. This approach could now be explored in other disease models with complex clinical phenotypes. External validation must be completed prior to clinical application. Ultimately, this approach has the potential to reveal distinct pathophysiological mechanisms that may underlie clusters. .

DOI10.3324/haematol.2018.193441
Alternate JournalHaematologica
PubMed ID30237265
PubMed Central IDPMC6312024
Grant ListU54 CA217450 / CA / NCI NIH HHS / United States
P30 CA077598 / CA / NCI NIH HHS / United States
R01 CA118953 / CA / NCI NIH HHS / United States
R00 CA143231 / CA / NCI NIH HHS / United States
U24 CA076518 / CA / NCI NIH HHS / United States
U24 HL138660 / HL / NHLBI NIH HHS / United States