Laser & Optoelectronics Progress, Volume. 56, Issue 13, 131003(2019)
Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
The accuracy of head pose estimation is easy to be affected by illumination, occlusion and other disturbances in practical applications and a large number of calculations are difficult to meet timeliness of practical applications. In order to solve these problems, a method based on multi-scale convolutional neural network is proposed. The feature extraction of the input head pose image is performed by using different scale convolution kernels, which enriches the image features while preserving the image information, and enhances the robustness of the algorithm to the interference factors. At the same time, the 1×1 convolution is introduced to reduce the network structure parameters, reduce the computational complexity of the system, and improve the timeliness of the algorithm. The result of experiment shows that the recognition rates of the proposed algorithm on Pointing'04 and CAS-PEAL-R1 databases are 96.5% and 98.9%, respectively. The method shows good robustness to illumination, expression, occlusion and other disturbances, and has better operation and speed.
Get Citation
Copy Citation Text
Lingyu Liang, Tiantian Zhang, Wei He. Head Pose Estimation Based on Multi-Scale Convolutional Neural Network[J]. Laser & Optoelectronics Progress, 2019, 56(13): 131003
Category: Image Processing
Received: Dec. 6, 2018
Accepted: Jan. 24, 2019
Published Online: Jul. 11, 2019
The Author Email: Liang Lingyu (liangly@shanghaitech.edu.cn), He Wei (wei.he@mail.sim.ac.cn)