From Depth Data to Head Pose Estimation: A Siamese Approach

Marco Venturelli, Guido Borghi, Roberto Vezzani, Rita Cucchiara

2017

Abstract

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

Download


Paper Citation


in Harvard Style

Venturelli M., Borghi G., Vezzani R. and Cucchiara R. (2017). From Depth Data to Head Pose Estimation: A Siamese Approach . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-226-4, pages 194-201. DOI: 10.5220/0006104501940201

in Bibtex Style

@conference{visapp17,
author={Marco Venturelli and Guido Borghi and Roberto Vezzani and Rita Cucchiara},
title={From Depth Data to Head Pose Estimation: A Siamese Approach},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={194-201},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006104501940201},
isbn={978-989-758-226-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)
TI - From Depth Data to Head Pose Estimation: A Siamese Approach
SN - 978-989-758-226-4
AU - Venturelli M.
AU - Borghi G.
AU - Vezzani R.
AU - Cucchiara R.
PY - 2017
SP - 194
EP - 201
DO - 10.5220/0006104501940201