TY - JOUR
T1 - High speed error log control method in in-memory cluster computing platform
AU - Saito, Ryuichi
AU - Haruyama, Shinichiro
N1 - Funding Information:
This research was conducted with aid from the Keio University Doctrate Student Grant-in-Aid Program in 2017 and 2018.
Publisher Copyright:
© 2020 Information Processing Society of Japan.
PY - 2020/5
Y1 - 2020/5
N2 - Since 2010, in-memory cluster computing platform has been increasingly used in firms and research institutions to analyze large amounts of datasets within a short amount of time. In these methods, unexpected errors cause the load to exceed the assumption for computer infrastructures such as a monitoring system, owing to the execution of multithreading, assigning divided datasets to multiple nodes, and storing them in in-memory spaces. In this research, we propose a method that notifies administrators with only information needed to understand the situation in a short period by eliminating duplications of numerous application error logs for that period and clustering messages using an unsupervised learning k-means method with an in-memory cluster computing framework “Apache Spark.” By implementing this method, we can demonstrate that it is possible to eliminate duplications of error messages by 93% on an average compared with conventional methods. Further, we can extract significant messages from the application error messages and notify the administrators in an average of 4.2 min from the time of occurrence of the error.
AB - Since 2010, in-memory cluster computing platform has been increasingly used in firms and research institutions to analyze large amounts of datasets within a short amount of time. In these methods, unexpected errors cause the load to exceed the assumption for computer infrastructures such as a monitoring system, owing to the execution of multithreading, assigning divided datasets to multiple nodes, and storing them in in-memory spaces. In this research, we propose a method that notifies administrators with only information needed to understand the situation in a short period by eliminating duplications of numerous application error logs for that period and clustering messages using an unsupervised learning k-means method with an in-memory cluster computing framework “Apache Spark.” By implementing this method, we can demonstrate that it is possible to eliminate duplications of error messages by 93% on an average compared with conventional methods. Further, we can extract significant messages from the application error messages and notify the administrators in an average of 4.2 min from the time of occurrence of the error.
KW - Distributed System
KW - Error logs
KW - K-means
KW - Spark
KW - TF-IDF
UR - http://www.scopus.com/inward/record.url?scp=85087165654&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087165654&partnerID=8YFLogxK
U2 - 10.2197/ipsjjip.28.310
DO - 10.2197/ipsjjip.28.310
M3 - Article
AN - SCOPUS:85087165654
SN - 0387-5806
VL - 28
SP - 310
EP - 319
JO - Journal of Information Processing
JF - Journal of Information Processing
ER -