Journal of Applied Optics, Volume. 42, Issue 5, 867(2021)

Object detection and tracking algorithm based on audio-visual information fusion

Zhanhua HUANG... Zhilin CHEN, Hanxiao ZHANG, Yusheng CAO and Muhong SHEN |Show fewer author(s)
Author Affiliations
  • Key Laboratory of Opto-electronics Information Technology (Ministry of Education), School of Precision Instruments and Opto-electronics Engineering, Tianjin University, Tianjin 300072, China
  • show less
    References(17)

    [1] Hongpeng YIN, Bo CHEN, Yi CHAI, . Vision-based object detection and tracking: a review. Acta Automatica Sinica, 42, 1466-1489(2016).

    [2] Wanjun XU, Zhiqiang HOU, Wangsheng YU, . Fusing multi-feature for object tracking algorithm based on color and space information. Journal of Applied Optics, 36, 755-761(2015).

    [3] Chenlin SHAO, Weiping YANG, Zhilong ZHANG. Meanshift tracking algorithm based on SLIC superpixel. Journal of Applied Optics, 38, 193-199(2017).

    [4] Weiwei CUN, Zhigang CAO, Jianqiang WEI. Time delay estimation techniques in source location. Journal of Data Acquisition and Processing, 22, 90-99(2007).

    [5] [5] VERMAAK J, BLAKE A, GANG M, et al. Sequential Monte Carlo fusion of sound vision f speaker tracking[C]Proceedings of the 8th IEEE International Conference on Computer Vision(ICCV2001). Vancouver, Canada: IEEE Press, 2001: 741746.

    [6] Y CHEN, Y RUI. Real-time speaker tracking using particle filter sensor fusion. IEEE, 92, 485-494(2004).

    [7] [7] LI Xin. Human tracking based on audio visual infmation fusion its application[D]. Beijing: Tsinghua University, 2005.

    [8] [8] XIE Jing. Algithm of localization tracking based on audio visual Fusion[D]. Tianjin: Tianjin University, 2009.

    [9] [9] GIRSHICK R, DONAHUE J, DARREL T, et al. Rich feature hierarchies f accurate object detection semantic segmentation[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition. USA: IEEE: 2014: 580587.

    [10] [10] REN S, HE K, GIRSHICK R, et al. Faster RCNN: towards realtime object detection with region proposal wks[C]Advances in Neural Infmation Processing Systems.USA:IEEE, 2015: 9199.

    [11] [11] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detect[C]European Conference on Computer Vision. Berlin, Heidelberg: Springer, 2016: 2137.

    [12] [12] REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition. USA: IEEE, 2017: 72637271.

    [13] Yong SHI, Chongzhao HAN. Adaptive UKF method with applications to target tracking. Acta Automatica Sinica, 37, 755-759(2011).

    [14] Hongyan XING, Xu YANG, Jinyu ZHANG. Sound source omnidirectional location algorithm based on four-element microphone array. Chinese Journal of Scientific Instrument, 39, 43-50(2018).

    [15] Jianhong SUN, Tao ZHANG, Chen JIAO. Influence of array and the number of microphones on the localization performance of sound source. Journal of Electronic Measurement and Instrumentation, 33, 14-21(2019).

    [16] Meng’en ZAN, Hang ZHOU, Dan HAN, . Survey of particle filter target tracking algorithms. Computer Engineering and Applications, 55, 14-23+65(2019).

    [17] Jie CAO, Jingrun ZHENG. Speaker tracking based on audio-video information fusion. Computer Engineering and Applications, 48, 118-124(2012).

    Tools

    Get Citation

    Copy Citation Text

    Zhanhua HUANG, Zhilin CHEN, Hanxiao ZHANG, Yusheng CAO, Muhong SHEN. Object detection and tracking algorithm based on audio-visual information fusion[J]. Journal of Applied Optics, 2021, 42(5): 867

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: OE INFORMATION ACQUISITION AND PROCESSING

    Received: May. 6, 2021

    Accepted: --

    Published Online: Sep. 23, 2021

    The Author Email:

    DOI:10.5768/JAO202142.0502007

    Topics