Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm

Koichi Miyaki, Izumi Takei, Kenji Watanabe, Hiroshi Nakashima, Kiyoaki Watanabe, Kazuyuki Omae

Research output: Contribution to journalArticle

28 Citations (Scopus)

Abstract

To estimate the usefulness of data mining algorithms for extracting risk predictors of diabetic vascular complications in proper order in the future, we tried applying the Classification and Regression Trees (CART) method to the prevalence data of 165 type 2 diabetic outpatients and already known risk factors. Among the 6 categorical and 15 continuous risk factors, age (cutoff: 65.4) was the best predictor for classifying patients into groups with and without macroangiopathy (p=0.000). Body weight (cutoff: 53.9) was the best predictor (p=0.006) in the older group (age >65.4), whereas systolic blood pressure (cutoff: 144.5) was the best predictor in the remaining group (p=0.002). Age (cutoff: 64.8) was also the best predictor for categorizing them into groups with and without microangiopathy (p=0.000). In the older group (age >64.8), BMI (cutoff: 21.5) was the best predictor (p=0.001), whereas morbidity term (cutoff: 15.5) was the best predictor in the other group (p=0.010). Because the orders and values of all risk factors and cutoff points mined were reasonable clinically, this method may have the potential to highlight predictors in order of importance to apply tailor-made prevention of diabetic vascular complications.

Original languageEnglish
Pages (from-to)243-248
Number of pages6
JournalJournal of Epidemiology
Volume12
Issue number3
Publication statusPublished - 2002 May

Fingerprint

Data Mining
Statistical Models
Type 2 Diabetes Mellitus
Diabetic Angiopathies
Age Groups
Blood Pressure
Outpatients
Body Weight
Morbidity

Keywords

  • Classification and regression trees
  • Complications
  • Data mining
  • Tailor-made prevention
  • Type 2 diabetes

ASJC Scopus subject areas

  • Epidemiology

Cite this

Miyaki, K., Takei, I., Watanabe, K., Nakashima, H., Watanabe, K., & Omae, K. (2002). Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm. Journal of Epidemiology, 12(3), 243-248.

Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm. / Miyaki, Koichi; Takei, Izumi; Watanabe, Kenji; Nakashima, Hiroshi; Watanabe, Kiyoaki; Omae, Kazuyuki.

In: Journal of Epidemiology, Vol. 12, No. 3, 05.2002, p. 243-248.

Research output: Contribution to journalArticle

Miyaki, K, Takei, I, Watanabe, K, Nakashima, H, Watanabe, K & Omae, K 2002, 'Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm', Journal of Epidemiology, vol. 12, no. 3, pp. 243-248.
Miyaki, Koichi ; Takei, Izumi ; Watanabe, Kenji ; Nakashima, Hiroshi ; Watanabe, Kiyoaki ; Omae, Kazuyuki. / Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm. In: Journal of Epidemiology. 2002 ; Vol. 12, No. 3. pp. 243-248.
@article{64d1ff67d7f84808beb42b4ebec453e8,
title = "Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm",
abstract = "To estimate the usefulness of data mining algorithms for extracting risk predictors of diabetic vascular complications in proper order in the future, we tried applying the Classification and Regression Trees (CART) method to the prevalence data of 165 type 2 diabetic outpatients and already known risk factors. Among the 6 categorical and 15 continuous risk factors, age (cutoff: 65.4) was the best predictor for classifying patients into groups with and without macroangiopathy (p=0.000). Body weight (cutoff: 53.9) was the best predictor (p=0.006) in the older group (age >65.4), whereas systolic blood pressure (cutoff: 144.5) was the best predictor in the remaining group (p=0.002). Age (cutoff: 64.8) was also the best predictor for categorizing them into groups with and without microangiopathy (p=0.000). In the older group (age >64.8), BMI (cutoff: 21.5) was the best predictor (p=0.001), whereas morbidity term (cutoff: 15.5) was the best predictor in the other group (p=0.010). Because the orders and values of all risk factors and cutoff points mined were reasonable clinically, this method may have the potential to highlight predictors in order of importance to apply tailor-made prevention of diabetic vascular complications.",
keywords = "Classification and regression trees, Complications, Data mining, Tailor-made prevention, Type 2 diabetes",
author = "Koichi Miyaki and Izumi Takei and Kenji Watanabe and Hiroshi Nakashima and Kiyoaki Watanabe and Kazuyuki Omae",
year = "2002",
month = "5",
language = "English",
volume = "12",
pages = "243--248",
journal = "Journal of Epidemiology",
issn = "0917-5040",
publisher = "Japan Epidemiology Association",
number = "3",

}

TY - JOUR

T1 - Novel statistical classification model of type 2 diabetes mellitus patients for tailor-made prevention using data mining algorithm

AU - Miyaki, Koichi

AU - Takei, Izumi

AU - Watanabe, Kenji

AU - Nakashima, Hiroshi

AU - Watanabe, Kiyoaki

AU - Omae, Kazuyuki

PY - 2002/5

Y1 - 2002/5

N2 - To estimate the usefulness of data mining algorithms for extracting risk predictors of diabetic vascular complications in proper order in the future, we tried applying the Classification and Regression Trees (CART) method to the prevalence data of 165 type 2 diabetic outpatients and already known risk factors. Among the 6 categorical and 15 continuous risk factors, age (cutoff: 65.4) was the best predictor for classifying patients into groups with and without macroangiopathy (p=0.000). Body weight (cutoff: 53.9) was the best predictor (p=0.006) in the older group (age >65.4), whereas systolic blood pressure (cutoff: 144.5) was the best predictor in the remaining group (p=0.002). Age (cutoff: 64.8) was also the best predictor for categorizing them into groups with and without microangiopathy (p=0.000). In the older group (age >64.8), BMI (cutoff: 21.5) was the best predictor (p=0.001), whereas morbidity term (cutoff: 15.5) was the best predictor in the other group (p=0.010). Because the orders and values of all risk factors and cutoff points mined were reasonable clinically, this method may have the potential to highlight predictors in order of importance to apply tailor-made prevention of diabetic vascular complications.

AB - To estimate the usefulness of data mining algorithms for extracting risk predictors of diabetic vascular complications in proper order in the future, we tried applying the Classification and Regression Trees (CART) method to the prevalence data of 165 type 2 diabetic outpatients and already known risk factors. Among the 6 categorical and 15 continuous risk factors, age (cutoff: 65.4) was the best predictor for classifying patients into groups with and without macroangiopathy (p=0.000). Body weight (cutoff: 53.9) was the best predictor (p=0.006) in the older group (age >65.4), whereas systolic blood pressure (cutoff: 144.5) was the best predictor in the remaining group (p=0.002). Age (cutoff: 64.8) was also the best predictor for categorizing them into groups with and without microangiopathy (p=0.000). In the older group (age >64.8), BMI (cutoff: 21.5) was the best predictor (p=0.001), whereas morbidity term (cutoff: 15.5) was the best predictor in the other group (p=0.010). Because the orders and values of all risk factors and cutoff points mined were reasonable clinically, this method may have the potential to highlight predictors in order of importance to apply tailor-made prevention of diabetic vascular complications.

KW - Classification and regression trees

KW - Complications

KW - Data mining

KW - Tailor-made prevention

KW - Type 2 diabetes

UR - http://www.scopus.com/inward/record.url?scp=0036559582&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036559582&partnerID=8YFLogxK

M3 - Article

C2 - 12164327

AN - SCOPUS:0036559582

VL - 12

SP - 243

EP - 248

JO - Journal of Epidemiology

JF - Journal of Epidemiology

SN - 0917-5040

IS - 3

ER -