Laser & Optoelectronics Progress, Volume. 58, Issue 24, 2415002(2021)
Exposing DeepFake Video Detection Based on Convolutional Long Short-Term Memory Network
[1] Afchar D, Nozick V, Yamagishi J et al. MesoNet: a compact facial video forgery detection network[C]. //2018 IEEE International Workshop on Information Forensics and Security (WIFS), December 11-13, 2018, Hong Kong, China(2018).
[2] Wang J X, Lei Z C. A convolutional neural network based on feature fusion for face recognition[J]. Laser & Optoelectronics Progress, 57, 101508(2020).
[3] Zhang H, Goodfellow I, Metaxas D et al. Self-attention generative adversarial networks[C]. //Proceedings of the 36th International Conference on Machine Learning, June 9-15, 2019, Long Beach, California, USA, 7354-7363(2019).
[4] Zhu J Y, Park T, Isola P et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]. //2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy., 2242-2251(2017).
[6] Sabir E, Cheng J, Jaiswal A et al. Recurrent convolutional strategies for face manipulation detection in videos[C]. //IEEE Conference on Computer Vision and Pattern Recognition Workshops, June 16-20, 2019, Long Beach, California, USA., 80-87(2019).
[7] Zhang Y X, Li G, Cao Y et al. A method for detecting human-face-tampered videos based on interframe difference[J]. Journal of Cyber Security, 5, 49-72(2020).
[8] Zhu M K, Lu X L. Human action recognition algorithm based on Bi-LSTM-Attention model[J]. Laser & Optoelectronics Progress, 56, 151503(2019).
[9] Zhang K P, Zhang Z P, Li Z F et al. Joint face detection and alignment using multitask cascaded convolutional networks[J]. IEEE Signal Processing Letters, 23, 1499-1503(2016).
[10] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 770-778(2016).
[11] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[C]. //3rd International Conference on Learning Representations, May 7-9, 2015, San Diego, California, USA. [S.l.: s.n.](2015).
[12] Tan M X, Le Q V. Efficientnet: rethinking model scaling for convolutional neural networks[C]. //Proceedings of the 36th International Conference on Machine Learning, June 9-15, 2019, Long Beach, California, USA. [S.l.: s.n.], 6105-6114(2019).
[13] Sutskever I, Vinyals O, Le Q V. Sequence to sequence learning with neural networks[C]. //Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13, 2014, Montreal, Quebec, Canada, 3104-3112(2014).
[14] Yang Y, Zhou J, Ai J B et al. Video captioning by adversarial LSTM[J]. IEEE Transactions on Image Processing, 27, 5600-5611(2018).
[15] Cornia M, Baraldi L, Serra G et al. Predicting human eye fixations via an LSTM-based saliency attentive model[J]. IEEE Transactions on Image Processing, 27, 5142-5154(2018).
[16] Xu L H, Li Z, Jiang J J et al. High-precision and lightweight facial landmark detection algorithm[J]. Laser & Optoelectronics Progress, 57, 241026(2020).
[17] Ma Y K, Peng H Y, Cambria E et al. Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM[C]. //Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, New Orleans, Louisiana, USA., 5876-5883(2018).
[18] Xu K, Ba J, Kiros R et al. Show, attend and tell: neural image caption generation with visual attention[C]. //Proceedings of the 32nd International Conference on Machine Learning, July 6-11, 2015, Lille, France, 2048-2057(2015).
[19] Shi X J, Chen Z R, Wang H et al. Convolutional LSTM network: a machine learning approach for precipitation nowcasting[C]. //Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada, 802-810(2015).
[20] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 9, 1735-1780(1997).
[21] Rössler A, Cozzolino D, Verdoliva L et al. FaceForensics++: learning to detect manipulated facial images[C]. //2019 IEEE/CVF International Conference on Computer Vision (ICCV), October 27-November 2, 2019, Seoul, Korea (South)., 1-11(2019).
[22] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]. //Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012, December 3-6, 2012, Nevada, United States, 1106-1114(2012).
[23] Amerini I, Li C T, Caldelli R. Social network identification through image classification with CNN[J]. IEEE Access, 7, 35264-35273(2019).
[26] Vaswani A, Shazeer N, Parmar N et al. Attention is all you need[C]. //Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, California, USA., 5998-6008(2017).
[27] Amerini I, Galteri L, Caldelli R et al. Deepfake video detection through optical flow based CNN[C]. //2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), October 27-28, 2019, Seoul, Korea (South)., 1205-1207(2019).
[28] Wang Y H, Bilinski P, Bremond F et al. G3AN: disentangling appearance and motion for video generation[C]. //2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 13-19, 2020, Seattle, WA, USA, 5263-5272(2020).
Get Citation
Copy Citation Text
Bowen Zheng, Huawei Xia, Ruidong Chen, Qiankun Han. Exposing DeepFake Video Detection Based on Convolutional Long Short-Term Memory Network[J]. Laser & Optoelectronics Progress, 2021, 58(24): 2415002
Category: Machine Vision
Received: Jan. 5, 2021
Accepted: Mar. 2, 2021
Published Online: Nov. 29, 2021
The Author Email: Xia Huawei (xiahuawei@tju.edu.cn), Chen Ruidong (20517610@qq.com), Han Qiankun (15822563807@163.com)