Action recognition from 3d pose data has gained increasing attention since the data is readily available for depth or RGB-D videos. The most successful approaches so far perform an expensive feature selection or mining approach for training. In this work, we introduce an algorithm that is very efficient for training and testing. The main idea is that rich structured data like 3d pose does not require sophisticated feature modeling or learning. Instead, we reduce pose data over time to histograms of relative location, velocity, and their correlations and use partial least squares to learn a compact and discriminative representation from it. Despite of its efficiency, our approach achieves state-of-the-art accuracy on four different benchmarks. We further investigate differences of 2d and 3d pose data for action recognition.
Overview of the framework.
By installing, copying, or otherwise using this Software, you agree to be bound by the terms of the license agreement (non-commercial use only). If you do not agree, do not install copy or use the Software. The Software is protected by copyright and other intellectual property laws and is licensed, not sold.
Redistribution and use in source and binary forms, with or without
modification, are permitted for any non-commercial purpose provided that
the following conditions are met:
1. Redistributions of source code must retain the above copyright
notice, this list of conditions and the following disclaimer.
2. Redistributions in binary form must reproduce the above copyright
notice, this list of conditions and the following disclaimer in the
documentation and/or other materials provided with the distribution.
Some purposes which can be non-commercial are teaching, academic research,
public demonstrations and personal experimentation. You may also distribute
this Software with books or other teaching materials, or publish the Software
on websites, that are intended to teach the use of the Software for academic
or other non-commercial purposes.
You may not use or distribute this Software or any derivative works in any
form for commercial purposes. Examples of commercial purposes would be running
business operations, licensing, leasing, or selling the Software, distributing
the Software for use with commercial products, using the Software in the
creation or use of commercial products or any other activity which purpose is to
procure a commercial gain to you or others.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL JUERGEN GALL AND CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Please consider referencing the corresponding paper (PDF) when using this software.
If you have questions concerning the source code, please contact Abdalrahman Eweiwi.
Eweiwi A., Cheema S., Bauckhage C., and Gall J., Efficient Pose-based Action Recognition (PDF), Asian Conference on Computer Vision (ACCV'14), To appear.