1 Beijing Sport University 2 Shandong Jianzhu University
Visual-based human action analysis is an important research topic in the field of computer vision,
and has great application prospect in sports performance analysis.
Currently available 3D action analysis datasets have a number of limitations in sports application,
including the lack of special sports actions, distinct class or score labels and variety of samples.
Existing researches mainly use various special RGB videos for sports action analysis,
but analysis with 2D features is less effective than 3D representation. In this paper,
we introduce a new 3D yoga pose dataset (3D-Yoga) with more than 3,792 action samples and 16,668
RGB-D key frames, collected from 22 subjects performing 117 kinds of yoga poses with two RGB-D cameras.
We have reconstructed 3D yoga poses with sparse multi-view data and carried out experiments with the
proposed cascade two-stream adaptive graph convolutional neural network (Cascade 2S-AGCN) to recognize and
assess these poses. Experimental results have shown the advantage of applying our 3D skeleton fusion and hierarchical analysis
methods on 3D-Yoga, and the accuracy of Cascade 2S-AGCN outperforms the state-of-theart methods.
The introduction of 3D-Yoga will enable the community to apply,
develop and adapt various methods for visual-based sports activity analysis.
The capturing of yoga poses in 3D-Yoga. (a) Scene layout. (b) Samples of yoga pose. (c) Scene elements: 1 and 2 are two indoor scenes; A, B, C, D, E, and F are the textures of the wall; a, b, c, and d are the textures of yoga mates. (d) Proportions of three light types. (e) Proportions of four cloth types.
3D-Yoga dataset contains 117 categories of yoga poses performed by 22 subjects in various indoor environments.
The framework constructed by using 2S-AGCN models and fully connected layers with Dropout and ReLU.
The trunk network and branches of Cascade 2S-AGCN are trained together.
The input is the skeleton data, and the outputs of each level are the predicted results of Classification I,
Classification II, and completion score respectively.
If someone wants to download the 3D-Yoga dataset, please fill in the
agreement,
and email Rui Cao <caorui@bsu.edu.cn>
or Jianwei Li <jianwei@bsu.edu.cn> to request the download link.
@inproceedings{2022 3DYoga,
title={3D-Yoga: A 3D Yoga Dataset for Hierarchical SportsAction Analysis},
author={ Li, Jianwei and Hu, Haing and Li, Jinyang and Zhao, Xiaomei},
booktitle={Asian Conference on Computer Vision (ACCV)},
year={2022},
}