Laser & Optoelectronics Progress, Volume. 56, Issue 13, 131003(2019)

Head Pose Estimation Based on Multi-Scale Convolutional Neural Network

Lingyu Liang1,2,3、**, Tiantian Zhang1,3, and Wei He1、*
Author Affiliations
  • 1 Key Laboratory of Wireless Sensor Network and Communication, Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai 201800, China
  • 2 School of Information Science and Technology, ShanghaiTech University, Shanghai 200120, China
  • 3 University of Chinese Academy of Sciences, Beijing 100049, China
  • show less

    The accuracy of head pose estimation is easy to be affected by illumination, occlusion and other disturbances in practical applications and a large number of calculations are difficult to meet timeliness of practical applications. In order to solve these problems, a method based on multi-scale convolutional neural network is proposed. The feature extraction of the input head pose image is performed by using different scale convolution kernels, which enriches the image features while preserving the image information, and enhances the robustness of the algorithm to the interference factors. At the same time, the 1×1 convolution is introduced to reduce the network structure parameters, reduce the computational complexity of the system, and improve the timeliness of the algorithm. The result of experiment shows that the recognition rates of the proposed algorithm on Pointing'04 and CAS-PEAL-R1 databases are 96.5% and 98.9%, respectively. The method shows good robustness to illumination, expression, occlusion and other disturbances, and has better operation and speed.

    Tools

    Get Citation

    Copy Citation Text

    Lingyu Liang, Tiantian Zhang, Wei He. Head Pose Estimation Based on Multi-Scale Convolutional Neural Network[J]. Laser & Optoelectronics Progress, 2019, 56(13): 131003

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Dec. 6, 2018

    Accepted: Jan. 24, 2019

    Published Online: Jul. 11, 2019

    The Author Email: Liang Lingyu (liangly@shanghaitech.edu.cn), He Wei (wei.he@mail.sim.ac.cn)

    DOI:10.3788/LOP56.131003

    Topics