Predictions of the pathological response to neoadjuvant chemotherapy in patients with primary breast cancer using a data mining technique

M. Takada, M. Sugimoto, S. Ohno, K. Kuroi, N. Sato, H. Bando, N. Masuda, H. Iwata, M. Kondo, H. Sasano, L. W.C. Chow, T. Inamoto, Y. Naito, M. Tomita, M. Toi

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Nomogram, a standard technique that utilizes multiple characteristics to predict efficacy of treatment and likelihood of a specific status of an individual patient, has been used for prediction of response to neoadjuvant chemotherapy (NAC) in breast cancer patients. The aim of this study was to develop a novel computational technique to predict the pathological complete response (pCR) to NAC in primary breast cancer patients. A mathematical model using alternating decision trees, an epigone of decision tree, was developed using 28 clinicopathological variables that were retrospectively collected from patients treated with NAC (n = 150), and validated using an independent dataset from a randomized controlled trial (n = 173). The model selected 15 variables to predict the pCR with yielding area under the receiver operating characteristics curve (AUC) values of 0.766 [95 % confidence interval (≥I)], 0.671-0.861, P value< 0.0001) in cross-validation using training dataset and 0.787 (95 % CI 0.716-0.858, P value < 0.0001) in the validation dataset. Among three subtypes of breast cancer, the luminal subgroup showed the best discrimination (AUC = 0.779, 95 % CI 0.641-0.917, P value = 0.0059). The developed model (AUC = 0.805, 95 % CI 0.716-0.894, P value\0.0001) outperformed multivariate logistic regression (AUC = 0.754, 95 % CI 0.651-0.858, P value = 0.00019) of validation datasets without missing values (n = 127). Several analyses, e.g. bootstrap analysis, revealed that the developed model was insensitive to missing values and also tolerant to distribution bias among the datasets. Our model based on clinicopathological variables showed high predictive ability for pCR. This model might improve the prediction of the response to NAC in primary breast cancer patients.

Original languageEnglish
Pages (from-to)661-670
Number of pages10
JournalBreast Cancer Research and Treatment
Volume134
Issue number2
DOIs
Publication statusPublished - 2012 Jul 1

Keywords

  • Breast cancer
  • Data mining
  • Neoadjuvant chemotherapy
  • Nomogram
  • Prediction model

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Fingerprint Dive into the research topics of 'Predictions of the pathological response to neoadjuvant chemotherapy in patients with primary breast cancer using a data mining technique'. Together they form a unique fingerprint.

  • Cite this

    Takada, M., Sugimoto, M., Ohno, S., Kuroi, K., Sato, N., Bando, H., Masuda, N., Iwata, H., Kondo, M., Sasano, H., Chow, L. W. C., Inamoto, T., Naito, Y., Tomita, M., & Toi, M. (2012). Predictions of the pathological response to neoadjuvant chemotherapy in patients with primary breast cancer using a data mining technique. Breast Cancer Research and Treatment, 134(2), 661-670. https://doi.org/10.1007/s10549-012-2109-2