TY - GEN
T1 - Combined framework for real-time head pose estimation using facial landmark detection and salient feature tracking
AU - Díaz Barros, Jilliam María
AU - Garcia, Frederic
AU - Mirbach, Bruno
AU - Varanasi, Kiran
AU - Stricker, Didier
N1 - Publisher Copyright:
© 2018 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved.
PY - 2018
Y1 - 2018
N2 - This paper presents a novel approach to address the head pose estimation (HPE) problem in real world and demanding applications. We propose a new framework that combines the detection of facial landmarks with the tracking of salient features within the head region. That is, rigid facial landmarks are detected from a given face image, while at the same time, salient features are detected within the head region. The 3D coordinates of both set of features result from their intersection on a simple geometric head model (e.g., cylinder or ellipsoid). We then formulate the HPE problem as a perspective-n-point problem that we separately solve by minimizing the reprojection error of each 3D features set and their corresponding facial or salient features in the next face image. The resulting head pose estimations are then combined using Kalman Filter, which allows us to take advantage of the high accuracy when using facial landmarks while enabling us to handle extreme head poses by using salient features. Results are comparable to those from the related literature, with the advantage of being robust under real world situations that might not be covered in the evaluated datasets.
AB - This paper presents a novel approach to address the head pose estimation (HPE) problem in real world and demanding applications. We propose a new framework that combines the detection of facial landmarks with the tracking of salient features within the head region. That is, rigid facial landmarks are detected from a given face image, while at the same time, salient features are detected within the head region. The 3D coordinates of both set of features result from their intersection on a simple geometric head model (e.g., cylinder or ellipsoid). We then formulate the HPE problem as a perspective-n-point problem that we separately solve by minimizing the reprojection error of each 3D features set and their corresponding facial or salient features in the next face image. The resulting head pose estimations are then combined using Kalman Filter, which allows us to take advantage of the high accuracy when using facial landmarks while enabling us to handle extreme head poses by using salient features. Results are comparable to those from the related literature, with the advantage of being robust under real world situations that might not be covered in the evaluated datasets.
KW - Fusion
KW - Head pose estimation
KW - Real time
UR - http://www.scopus.com/inward/record.url?scp=85047830917&partnerID=8YFLogxK
U2 - 10.5220/0006628701230133
DO - 10.5220/0006628701230133
M3 - Conference contribution
AN - SCOPUS:85047830917
T3 - VISIGRAPP 2018 - Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
SP - 123
EP - 133
BT - VISAPP
A2 - Imai, Francisco
A2 - Tremeau, Alain
A2 - Braz, Jose
PB - SciTePress
T2 - 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2018
Y2 - 27 January 2018 through 29 January 2018
ER -