TY - JOUR
T1 - Language patterns in Japanese patients with Alzheimer disease
T2 - A machine learning approach
AU - Momota, Yuki
AU - Liang, Kuo ching
AU - Horigome, Toshiro
AU - Kitazawa, Momoko
AU - Eguchi, Yoko
AU - Takamiya, Akihiro
AU - Goto, Akiko
AU - Mimura, Masaru
AU - Kishimoto, Taishiro
N1 - Funding Information:
We are grateful to the participants of this study and to Ms Sayaka Hanashiro, Ms Yuriko Kaise, Ms Noriko Maegaichi, Mr Yoshitaka Yamaoka, and Ms Kelley Cortright at Keio University Hospital, Mr Shimpei Isa at Keio University School of Medicine, Ms Yumi Hattori, Ms Mai Minami, and Ms Kaede Ouchi at Komagino Hospital, and Ms Mai Osawa at Tsurugaoka Garden Hospital for their support during this study. This study is supported by the Japan Agency for Medical Research and Development (AMED) under grant number JP18he1102004 (T.K.).
Funding Information:
The authors declare no conflict of interest in relation to the submission of this manuscript. Within the past 3 years, Dr Horigome has received consultant fees from FRONTEO, Dr Eguchi has received speaker's honoraria from Eisai and Otsuka, Dr Takamiya has received grants or contracts from KAKENHI, SENSHINIYAKU, DAIWA SHOKEN, and Asteras, and speaker's honoraria from Otsuka and Dainippon Sumitomo, Dr Mimura has received grants and/or speaker's honoraria from Daiichi Sankyo, Dainippon‐Sumitomo Pharma, Eisai, Eli Lilly, FRONTEO, FUJIFILM RI Pharma, Janssen Pharmaceutical, Mochida Pharmaceutical, MSD, Nippon Chemipher, Novartis Pharma, Ono Pharma, Otsuka Pharmaceutical, Pfizer, Takeda Pharma, Tsumura, and Yoshitomi Pharma; Dr Kishimoto has received consultant fees from Dainippon Sumitomo, FRONTEO, KYOWA pharmaceutical industry, Novartis, and Otsuka, and speaker's honoraria from Banyu, Eli Lilly, Dainippon Sumitomo, Janssen, Novartis, Otsuka, and Pfizer. He has received grant support from Pfizer Health Research Foundation, Dainippon Sumitomo, Otsuka, and Mochida.
Funding Information:
We are grateful to the participants of this study and to Ms Sayaka Hanashiro, Ms Yuriko Kaise, Ms Noriko Maegaichi, Mr Yoshitaka Yamaoka, and Ms Kelley Cortright at Keio University Hospital, Mr Shimpei Isa at Keio University School of Medicine, Ms Yumi Hattori, Ms Mai Minami, and Ms Kaede Ouchi at Komagino Hospital, and Ms Mai Osawa at Tsurugaoka Garden Hospital for their support during this study. This study is supported by the Japan Agency for Medical Research and Development (AMED) under grant number JP18he1102004 (T.K.).
Publisher Copyright:
© 2022 The Authors. Psychiatry and Clinical Neurosciences published by John Wiley & Sons Australia, Ltd on behalf of Japanese Society of Psychiatry and Neurology.
PY - 2023/5
Y1 - 2023/5
N2 - Aim: The authors applied natural language processing and machine learning to explore the disease-related language patterns that warrant objective measures for assessing language ability in Japanese patients with Alzheimer disease (AD), while most previous studies have used large publicly available data sets in Euro-American languages. Methods: The authors obtained 276 speech samples from 42 patients with AD and 52 healthy controls, aged 50 years or older. A natural language processing library for Python was used, spaCy, with an add-on library, GiNZA, which is a Japanese parser based on Universal Dependencies designed to facilitate multilingual parser development. The authors used eXtreme Gradient Boosting for our classification algorithm. Each unit of part-of-speech and dependency was tagged and counted to create features such as tag-frequency and tag-to-tag transition-frequency. Each feature's importance was computed during the 100-fold repeated random subsampling validation and averaged. Results: The model resulted in an accuracy of 0.84 (SD = 0.06), and an area under the curve of 0.90 (SD = 0.03). Among the features that were important for such predictions, seven of the top 10 features were related to part-of-speech, while the remaining three were related to dependency. A box plot analysis demonstrated that the appearance rates of content words–related features were lower among the patients, whereas those with stagnation-related features were higher. Conclusion: The current study demonstrated a promising level of accuracy for predicting AD and found the language patterns corresponding to the type of lexical-semantic decline known as ‘empty speech’, which is regarded as a characteristic of AD.
AB - Aim: The authors applied natural language processing and machine learning to explore the disease-related language patterns that warrant objective measures for assessing language ability in Japanese patients with Alzheimer disease (AD), while most previous studies have used large publicly available data sets in Euro-American languages. Methods: The authors obtained 276 speech samples from 42 patients with AD and 52 healthy controls, aged 50 years or older. A natural language processing library for Python was used, spaCy, with an add-on library, GiNZA, which is a Japanese parser based on Universal Dependencies designed to facilitate multilingual parser development. The authors used eXtreme Gradient Boosting for our classification algorithm. Each unit of part-of-speech and dependency was tagged and counted to create features such as tag-frequency and tag-to-tag transition-frequency. Each feature's importance was computed during the 100-fold repeated random subsampling validation and averaged. Results: The model resulted in an accuracy of 0.84 (SD = 0.06), and an area under the curve of 0.90 (SD = 0.03). Among the features that were important for such predictions, seven of the top 10 features were related to part-of-speech, while the remaining three were related to dependency. A box plot analysis demonstrated that the appearance rates of content words–related features were lower among the patients, whereas those with stagnation-related features were higher. Conclusion: The current study demonstrated a promising level of accuracy for predicting AD and found the language patterns corresponding to the type of lexical-semantic decline known as ‘empty speech’, which is regarded as a characteristic of AD.
KW - Alzheimer disease
KW - dementia
KW - machine learning
KW - natural language processing
KW - speech-language pathology
UR - http://www.scopus.com/inward/record.url?scp=85147593915&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147593915&partnerID=8YFLogxK
U2 - 10.1111/pcn.13526
DO - 10.1111/pcn.13526
M3 - Article
C2 - 36579663
AN - SCOPUS:85147593915
SN - 1323-1316
VL - 77
SP - 273
EP - 281
JO - Psychiatry and Clinical Neurosciences
JF - Psychiatry and Clinical Neurosciences
IS - 5
ER -