Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?

Download PDF Copy

By Dr. Sanchari Sinha Dutta, Ph.D.Reviewed by Benedette Cuffari, M.Sc.Aug 17 2023

A study published in JAMA Network Open describes the utility of multi-level machine learning models in estimating the risk of delay between cancer diagnosis and treatment initiation in a large group of cancer patients.

Study: Development of a Multilevel Model to Identify Patients at Risk for Delay in Starting Cancer Treatment. Image Credit: Peshkova / Shutterstock.com

Background

Cancer patients with poor socioeconomical background and those living in low-resource neighborhoods often experience delays in treatment initiation after diagnosis, which significantly affects clinical outcomes.

The timely implementation of effective treatments can be achieved by identifying patients who are at an increased risk of health disparities. This must be accompanied by improvements in care coordination and patient navigation services; however, these approaches are resource intensive. Thus, a more effective approach would be identifying patients who are at a greater risk of diagnostic delays and subsequently targeting them for timely treatment.

In the current study, scientists evaluate whether machine learning models incorporating clinical and demographic data of cancer patients and neighborhood-level social determinants of health data can be used to identify patients who are at a greater risk for treatment initiation delay.

About the study

The researchers investigated the predictive efficacy of four different machine learning models, including group least absolute shrinkage and selection operator, Bayesian additive regression tree, gradient boosting, and random forest. Adult patients with breast, lung, colorectal, bladder, or kidney cancer who were diagnosed between 2013 and 2019 and subsequently treated at Fox Chase Cancer Center in Philadelphia were included in the study.

Patient data related to cancer diagnosis-first treatment interval, health and demographic characteristics including race, ethnicity, laboratory findings, and comorbidities, as well as neighborhood-level health variables, were incorporated into the machine learning models.

Based on a previous observation that a 60-day delay between diagnosis and treatment initiation can increase cancer mortality, scientists investigated whether these models can predict the likelihood of a treatment delay of more than 60 days after diagnosis.

Three factors, including discrimination, calibration, and interoperability, were applied to select the optimal machine learning model for the study analysis. This led to the selection of group least absolute shrinkage and selection operator (LASSO) as the final model.

Important observations

A total of 6,409 patients were included in the study, 14% of whom belonged to the most socioeconomically deprived neighborhoods. About 25% of the study cohort experienced a delay of more than 60 days between cancer diagnosis and treatment initiation.

The selected group LASSO model incorporating clinical, demographic, and neighborhood-level social determinants of health data was associated with high effectiveness in identifying patients who were at risk of experiencing a delay of more than 60 days between diagnosis and treatment.

The model predicted that patients were less likely to experience a delay if they were diagnosed at the treating center, had the index cancer as their first malignant neoplasm, were Asian or Pacific Islander or White, had private insurance, or had late-stage disease. In contrast, patients with certain comorbidities or increased creatinine levels were more likely to experience a delay. The model showed similar effectiveness in predicting delays for patients diagnosed internally or externally.

Regarding neighborhood-level social determinants, the model predicted that patients belonging to the most socioeconomically deprived areas were more likely to experience a delay as compared to those belonging to the least socioeconomically deprived areas. While neighborhoods with high Hispanic populations were identified as a risk factor for treatment delays, patients residing in areas with a high Black population were less likely to experience a delay.

As compared to the predictions made for the overall population, the model showed lower effectiveness in predicting delays for Black patients, other than non-Hispanic White patients, and those residing in the most deprived areas.

Study significance

Machine learning models that incorporate multi-level data sources can effectively identify cancer patients who are at a greater risk of experiencing treatment delays of more than 60 days after their initial cancer diagnosis.

Although neighborhood-level social determinants of health are incorporated in the study model as contributing variables, no significant impact of these factors was observed on the model performance. Furthermore, the model exhibits lower predictive effectiveness in vulnerable populations.

Future studies should include a higher proportion of vulnerable populations and more relevant social variables to improve the model performance.

Journal reference:

Frosch Z. A. K., Hasler, J., Handorf, E., et al. (2023). Development of a Multilevel Model to Identify Patients at Risk for Delay in Starting Cancer Treatment. JAMA Network Open. doi:10.1001/jamanetworkopen.2023.28712, https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2808249

Posted in: Device / Technology News | Medical Science News | Medical Research News | Medical Condition News | Healthcare News

Comments (0)

Written by

Dr. Sanchari Sinha Dutta

Dr. Sanchari Sinha Dutta is a science communicator who believes in spreading the power of science in every corner of the world. She has a Bachelor of Science (B.Sc.) degree and a Master's of Science (M.Sc.) in biology and human physiology. Following her Master's degree, Sanchari went on to study a Ph.D. in human physiology. She has authored more than 10 original research articles, all of which have been published in world renowned international journals.

Download PDF Copy

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

APA
Dutta, Sanchari Sinha Dutta. (2023, August 17). Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?. News-Medical. Retrieved on October 22, 2025 from https://www.news-medical.net/news/20230817/Can-machine-learning-algorithms-identify-patients-at-risk-of-a-delay-in-starting-cancer-treatment.aspx.
MLA
Dutta, Sanchari Sinha Dutta. "Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?". News-Medical. 22 October 2025. <https://www.news-medical.net/news/20230817/Can-machine-learning-algorithms-identify-patients-at-risk-of-a-delay-in-starting-cancer-treatment.aspx>.
Chicago
Dutta, Sanchari Sinha Dutta. "Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?". News-Medical. https://www.news-medical.net/news/20230817/Can-machine-learning-algorithms-identify-patients-at-risk-of-a-delay-in-starting-cancer-treatment.aspx. (accessed October 22, 2025).
Harvard
Dutta, Sanchari Sinha Dutta. 2023. Can machine learning algorithms identify patients at risk of a delay in starting cancer treatment?. News-Medical, viewed 22 October 2025, https://www.news-medical.net/news/20230817/Can-machine-learning-algorithms-identify-patients-at-risk-of-a-delay-in-starting-cancer-treatment.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.

Post a new comment

(Logout)

Post

Sign in to keep reading

We're committed to providing free access to quality science. By registering and providing insight into your preferences you're joining a community of over 1m science interested individuals and help us to provide you with insightful content whilst keeping our service free.