Abstract
Motivated by the remarkable performance achieved using deep learning strategies in solving action recognition tasks, an effective, yet simple method is proposed for encoding the spatiotemporal information of skeleton sequences into color texture images, referred to as Skeletal Optical Flows (SOFs). SOFs collectively represent the kinetic energy, predefined angles and pair-wise displacements between joints over consecutive frames of skeleton data, as color variations to capture meaningful temporal information and make them highly interpretable. A novel Convolutional Neural Network with Correctness-Vigilant Regularizer (CVR-CNN) is then employed to exploit the discriminative features of SOFs for human action recognition. Empirical results show that the efficiency of the proposed method is superior in terms of the generalizability of the generated model, the training convergence speed, and the resulting classification accuracy on commonly used action recognition datasets, such as MHAD, HDM05 and NTU RGB+D.
Original language | English |
---|---|
Title of host publication | 2018 IEEE 3rd International Conference on Signal and Image Processing, ICSIP 2018 |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 7-12 |
Number of pages | 6 |
ISBN (Electronic) | 9781538663943 |
DOIs | |
Publication status | Published - 2 Jul 2018 |
Externally published | Yes |
Event | 2018 IEEE 3rd International Conference on Signal and Image Processing, ICSIP 2018 - Shenzhen, China Duration: 13 Jul 2018 → 15 Jul 2018 Conference number: 3rd |
Publication series
Name | 2018 IEEE 3rd International Conference on Signal and Image Processing, ICSIP 2018 |
---|
Conference
Conference | 2018 IEEE 3rd International Conference on Signal and Image Processing, ICSIP 2018 |
---|---|
Abbreviated title | ICSIP |
Country/Territory | China |
City | Shenzhen |
Period | 13/07/18 → 15/07/18 |