TY - JOUR
T1 - Time-sequential action recognition using pose-centric learning for action-transition videos
AU - Suzuki, Tomoyuki
AU - Aoki, Yoshimitsu
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2017
Y1 - 2017
N2 - In this paper, we propose a method of human action recognition for videos in which actions are continuously transitioning. First, we make pose estimator which has learned joint coordinates using Convolutional Neural Networks (CNN) and extract feature from intermediate structure of it. Second, we train action recognizer structured by Long Short-Term Memory (LSTM), using pose feature and environmental feature as inputs. At that time, we propose Pose-Centric Learning. In addition, from pose feature we calculate Attention that represents importance of environmental feature for each element, and filtering latter feature by Attention to make this effective one. When modeling action recognizer, we structure Hierarchical model of LSTM. In experiments, we evaluated our method comparing to conventional method and achieve 15.7% improvement from it on challenging action recognition dataset.
AB - In this paper, we propose a method of human action recognition for videos in which actions are continuously transitioning. First, we make pose estimator which has learned joint coordinates using Convolutional Neural Networks (CNN) and extract feature from intermediate structure of it. Second, we train action recognizer structured by Long Short-Term Memory (LSTM), using pose feature and environmental feature as inputs. At that time, we propose Pose-Centric Learning. In addition, from pose feature we calculate Attention that represents importance of environmental feature for each element, and filtering latter feature by Attention to make this effective one. When modeling action recognizer, we structure Hierarchical model of LSTM. In experiments, we evaluated our method comparing to conventional method and achieve 15.7% improvement from it on challenging action recognition dataset.
KW - Action recognition
KW - Neural network
KW - Time-sequential analysis
KW - Video analysis
UR - http://www.scopus.com/inward/record.url?scp=85037088907&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85037088907&partnerID=8YFLogxK
U2 - 10.2493/jjspe.83.1156
DO - 10.2493/jjspe.83.1156
M3 - Article
AN - SCOPUS:85037088907
SN - 0912-0289
VL - 83
SP - 1156
EP - 1165
JO - Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering
JF - Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering
IS - 12
ER -