Impact of Lifestyle Patterns on Breast Cancer Prognosis: Evidence from a UK Biobank Survival Study
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background: Understanding how lifestyle habits influence breast cancer survival is crucial for improving long-term outcomes and guiding individualized care. The purpose of this study is to comprehensively understand how dietary habits, exercise frequency, sleep quality and the frequency of tobacco and alcohol use affect the survival of breast cancer patients by analyzing the synergistic effects of multiple lifestyle habits. Methods: We analyzed data from 14,901 female breast cancer patients in the UK Biobank, a large-scale population-based cohort. Three feature selection methods were applied to identify key prognostic variables. Five survival risk models were compared to investigate the relationship between breast cancer survival and lifestyle habits. Kaplan-Meier survival analysis and SHapley Additive exPlanations (SHAP) interpretability analysis were performed on risk scores calculated from the best-performing model. Results: Through feature selection, we identified that age, number of medications taken, Body Mass Index (BMI), alcohol intake, diastolic blood pressure and smoking status were pinpointed as critical characteristics influencing the survival of breast cancer patients. Among all models, the eXtreme Gradient Boosting (XGBoost) model demonstrated the highest predictive performance (3 years: AUC = 0.72, 6 years: AUC = 0.727, 9 years: AUC = 0.749). Kaplan-Meier analysis showed that patients classified as high-risk based on the median risk score had significantly worse survival outcomes ( P < 0.0001). SHAP analysis further confirmed the dominant influence of age and BMI on mortality risk. Conclusions: This study highlights the prognostic value of lifestyle habits in breast cancer survival. By integrating routine health indicators into interpretable machine learning models, our findings provide a practical foundation for individualized risk assessment and lifestyle-based intervention strategies.