Journal of Applied Optics, Volume. 42, Issue 5, 867(2021)
Object detection and tracking algorithm based on audio-visual information fusion
[1] YIN Hongpeng, CHEN Bo, CHAI Yi, . Vision-based object detection and tracking: a review[J]. Acta Automatica Sinica, 42, 1466-1489(2016).
[2] XU Wanjun, HOU Zhiqiang, YU Wangsheng, . Fusing multi-feature for object tracking algorithm based on color and space information[J]. Journal of Applied Optics, 36, 755-761(2015).
[3] SHAO Chenlin, YANG Weiping, ZHANG Zhilong. Meanshift tracking algorithm based on SLIC superpixel[J]. Journal of Applied Optics, 38, 193-199(2017).
[4] CUN Weiwei, CAO Zhigang, WEI Jianqiang. Time delay estimation techniques in source location[J]. Journal of Data Acquisition and Processing, 22, 90-99(2007).
[5] [5] VERMAAK J, BLAKE A, GANG M, et al. Sequential Monte Carlo fusion of sound vision f speaker tracking[C]Proceedings of the 8th IEEE International Conference on Computer Vision(ICCV2001). Vancouver, Canada: IEEE Press, 2001: 741746.
[6] CHEN Y, RUI Y. Real-time speaker tracking using particle filter sensor fusion[J]. IEEE, 92, 485-494(2004).
[7] [7] LI Xin. Human tracking based on audio visual infmation fusion its application[D]. Beijing: Tsinghua University, 2005.
[8] [8] XIE Jing. Algithm of localization tracking based on audio visual Fusion[D]. Tianjin: Tianjin University, 2009.
[9] [9] GIRSHICK R, DONAHUE J, DARREL T, et al. Rich feature hierarchies f accurate object detection semantic segmentation[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition. USA: IEEE: 2014: 580587.
[10] [10] REN S, HE K, GIRSHICK R, et al. Faster RCNN: towards realtime object detection with region proposal wks[C]Advances in Neural Infmation Processing Systems.USA:IEEE, 2015: 9199.
[11] [11] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detect[C]European Conference on Computer Vision. Berlin, Heidelberg: Springer, 2016: 2137.
[12] [12] REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition. USA: IEEE, 2017: 72637271.
[13] SHI Yong, HAN Chongzhao. Adaptive UKF method with applications to target tracking[J]. Acta Automatica Sinica, 37, 755-759(2011).
[14] XING Hongyan, YANG Xu, ZHANG Jinyu. Sound source omnidirectional location algorithm based on four-element microphone array[J]. Chinese Journal of Scientific Instrument, 39, 43-50(2018).
[15] SUN Jianhong, ZHANG Tao, JIAO Chen. Influence of array and the number of microphones on the localization performance of sound source[J]. Journal of Electronic Measurement and Instrumentation, 33, 14-21(2019).
[16] ZAN Meng’en, ZHOU Hang, HAN Dan, . Survey of particle filter target tracking algorithms[J]. Computer Engineering and Applications, 55, 14-23+65(2019).
[17] CAO Jie, ZHENG Jingrun. Speaker tracking based on audio-video information fusion[J]. Computer Engineering and Applications, 48, 118-124(2012).
Get Citation
Copy Citation Text
Zhanhua HUANG, Zhilin CHEN, Hanxiao ZHANG, Yusheng CAO, Muhong SHEN. Object detection and tracking algorithm based on audio-visual information fusion[J]. Journal of Applied Optics, 2021, 42(5): 867
Category: OE INFORMATION ACQUISITION AND PROCESSING
Received: May. 6, 2021
Accepted: --
Published Online: Sep. 23, 2021
The Author Email: