Survival Prediction of Patients with Breast Cancer: Comparisons of Decision Tree and Logistic Regression Analysis

Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
Background
Breast cancer is the first cause of cancer-related deaths among women in Iran.
Objectives
The aim of the present study was to compare the traditional statistical analysis and data mining technique as the research methods for identifying the prognostic factors regarding the survival time of patients with breast cancer. Decision tree method is one of the predictive models that used in the medical field. The most used algorithms are classification and regression trees (CART), the quick, unbiased, efficient statistical tree (QUEST), Chi-square automatic interaction detector (CHAIDs) algorithm, and the C5.0 algorithm.
Methods
We used data for 438 patients, who were referred to cancer research center in Shahid Beheshti University of Medical Sciences. The patients were visited and treated during 1992 to 2012 and followed up until October 2014. The data were analyzed by regression logistic and decision tree method. Six measures for evaluation of predictive performance of different models were used.
Results
The C5.0 algorithm performed better than CHAID, QUEST, CART algorithms, and the logistic regression in predicting breast cancer survival. The multiple logistic regression results indicated that the factors of age at diagnosis, histologic grade, axillary lymph node status, and type of surgery were statistically significant with regard to the probability of death in patients with breast cancer. Moreover, based on C4.5 they reported that tumor size, age of menarche, hormonal therapy, axillary nodal status, and histological grade are the most prominent variables.
Conclusions
The more precise methods can identify the more accurate predictors. The decision tree method was able to predict the probability of death more accurately compared with the conventional logistic regression. Some improvements for classical classification tree such as boosting and bagging have been developed in order to obtain better predictive performance. We suggest that the modern classification tree method in the breast cancer context be the focus of future studies.
Language:
English
Published:
International Journal of Cancer Management, Volume:11 Issue: 7, Jul 2018
Page:
2
magiran.com/p1863623  
دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:
اشتراک شخصی
با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!
اشتراک سازمانی
به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!
توجه!
  • حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
  • پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.
دسترسی سراسری کاربران دانشگاه پیام نور!
اعضای هیئت علمی و دانشجویان دانشگاه پیام نور در سراسر کشور، در صورت ثبت نام با ایمیل دانشگاهی، تا پایان فروردین ماه 1403 به مقالات سایت دسترسی خواهند داشت!
In order to view content subscription is required

Personal subscription
Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.
Organization subscription
Please contact us to subscribe your university or library for unlimited access!