Towards Human Pose Semantic Synthesis in 3D based on Query Keywords

Mo'taz Al-Hami, Rolf Lakaemper

2015

Abstract

The work presented in this paper is part of a project to enable humanoid robots to build a semantic understanding of their environment adopting unsupervised self-learning techniques. Here, we propose an approach to learn 3-dimensional human-pose conformations, i.e. structural arrangements of a (simplified) human skeleton model, given only a minimal verbal description of a human posture (e.g. "sitting", "standing", "tree pose"). The only tools given to the robot are knowledge about the skeleton model, as well as a connection to the labeled images database "google images". Hence the main contribution of this work is to filter relevant results from an images database, given a human-pose specific query words, and to transform the information in these (2D) images into a 3D pose that is the most likely to fit the human understanding of the keywords. Steps to achieve this goal integrate available 2D human-pose estimators using still images, clustering techniques to extract representative 2D human skeleton poses, and the 3D-pose from 2D-pose estimation. We evaluate the approach using different query keywords representing different postures.

Download


Paper Citation


in Harvard Style

Al-Hami M. and Lakaemper R. (2015). Towards Human Pose Semantic Synthesis in 3D based on Query Keywords . In Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015) ISBN 978-989-758-091-8, pages 420-427. DOI: 10.5220/0005258704200427

in Bibtex Style

@conference{visapp15,
author={Mo'taz Al-Hami and Rolf Lakaemper},
title={Towards Human Pose Semantic Synthesis in 3D based on Query Keywords},
booktitle={Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015)},
year={2015},
pages={420-427},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005258704200427},
isbn={978-989-758-091-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015)
TI - Towards Human Pose Semantic Synthesis in 3D based on Query Keywords
SN - 978-989-758-091-8
AU - Al-Hami M.
AU - Lakaemper R.
PY - 2015
SP - 420
EP - 427
DO - 10.5220/0005258704200427