Access the full text.
Sign up today, get DeepDyve free for 14 days.
11 Comparison of the skeleton graph in raw data, G3D and MS-AAGCN-3D for the "walk" action
Zheng Shou, Jonathan Chan, Alireza Zareian, K. Miyazawa, Shih-Fu Chang (2017)
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Amir Shahroudy, Jun Liu, T. Ng, G. Wang (2016)
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Wei Peng, Xiaopeng Hong, Haoyu Chen, Guoying Zhao (2019)
Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural SearchingArXiv, abs/1911.04131
Sitao Luan, Mingde Zhao, X. Chang, Doina Precup (2019)
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Xiang Gao, Wei Hu, Jiaxiang Tang, Jiaying Liu, Zongming Guo (2018)
Optimized Skeleton-based Action Recognition via Sparsified Graph RegressionProceedings of the 27th ACM International Conference on Multimedia
Ke Cheng, Yifan Zhang, Xiangyu He, Weihan Chen, Jian Cheng, Hanqing Lu (2020)
Skeleton-Based Action Recognition With Shift Graph Convolutional Network2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
X. Wang, A. Gupta (2018)
Videos as Space-Time Region GraphsArXiv, abs/1806.01810
Basura Fernando, E. Gavves, José Oramas, Amir Ghodrati, T. Tuytelaars (2015)
Modeling video evolution for action recognition2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Raviteja Vemulapalli, Felipe Arrate, R. Chellappa (2014)
Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group2014 IEEE Conference on Computer Vision and Pattern Recognition
Hailun Xia, Xinkai Gao (2021)
Multi-Scale Mixed Dense Graph Convolution Network for Skeleton-Based Action RecognitionIEEE Access, 9
Felix Wu, Tianyi Zhang, A. Souza, Christopher Fifty, Tao Yu, Kilian Weinberger (2019)
Simplifying Graph Convolutional Networks
Wenbin Du, Yali Wang, Y. Qiao (2017)
RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos2017 IEEE International Conference on Computer Vision (ICCV)
Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang (2020)
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Tae Kim, A. Reiter (2017)
Interpretable 3D Human Action Analysis with Temporal Convolutional Networks2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu (2018)
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Bo Li, Yuchao Dai, Xuelian Cheng, Huahui Chen, Yi Lin, Mingyi He (2017)
Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep CNN2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
Mathias Niepert, Mohamed Ahmed, Konstantin Kutzkov (2016)
Learning Convolutional Neural Networks for Graphs
Fanfan Ye, Shiliang Pu, Qiaoyong Zhong, Chaoyu Li, Di Xie, Huiming Tang (2020)
Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action RecognitionProceedings of the 28th ACM International Conference on Multimedia
Sijie Yan, Yuanjun Xiong, Dahua Lin (2018)
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Sami Abu-El-Haija, Bryan Perozzi, Amol Kapoor, Hrayr Harutyunyan, N. Alipourfard, Kristina Lerman, G. Steeg, A. Galstyan (2019)
MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing
Chenyang Si, Ya Jing, Wei Wang, Liang Wang, T. Tan (2018)
Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning
Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu (2019)
Skeleton-Based Action Recognition With Directed Graph Neural Networks2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Thomas Kipf, M. Welling (2016)
Semi-Supervised Classification with Graph Convolutional NetworksArXiv, abs/1609.02907
Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin (2017)
Temporal Action Detection with Structured Segment NetworksInternational Journal of Computer Vision, 128
M. Defferrard, X. Bresson, P. Vandergheynst (2016)
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu (2019)
Skeleton-Based Action Recognition With Multi-Stream Adaptive Graph Convolutional NetworksIEEE Transactions on Image Processing, 29
Yu Lubin, Lianfang Tian, Qiliang Du, J. Bhutto (2021)
Multi-stream adaptive spatial-temporal attention graph convolutional network for skeleton-based action recognitionIET Comput. Vis., 16
Action recognition methods based on spatial-temporal skeleton graphs have been applied extensively. The spatial and temporal graphs are generally modeled individually in previous approaches. Recently, many researchers capture the correlation information of temporal and spatial dimensions in spatial-temporal graphs. However, the existing methods have several issues such as 1. The existing modal graphs are defined based on the human body structure which is not flexible enough; 2. The approach to extracting non-local neighborhood features is insufficiently powerful; 3. Attention modules are limited to a single scale; 4. The fusion of multiple data streams is not sufficiently effective. This work proposes a novel multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition that improves the aforementioned issues. The method utilizes an adaptive topology graph with an adaptive connection coefficient to adaptively optimize the topology of the graph during the training process according to the input data. An optimal high-order adjacency matrix is constructed in our work to balance the weight bias, which captures non-local neighborhood features precisely. Moreover, we design a multi-scale attention mechanism to aggregate information from multiple ranges, which makes the graph convolution focus on more efficient nodes, frames, and channels. To further improve the performance of the model, a novel multi-stream framework is proposed to aggregate the high-order information of the skeleton. The experiment results on the NTU-RGBD and Kinetics-Skeleton prove that our proposed method reveals better results than existing methods.
Applied Intelligence – Springer Journals
Published: Jun 1, 2023
Keywords: Graph convolution; Convolutional Neural Network; Adaptive; Attention module; Action recognition
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.