Laser & Optoelectronics Progress, Volume. 58, Issue 24, 2415002(2021)

Exposing DeepFake Video Detection Based on Convolutional Long Short-Term Memory Network

Bowen Zheng, Huawei Xia*, Ruidong Chen**, and Qiankun Han***
Author Affiliations
  • School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • show less
    References(28)

    [1] Afchar D, Nozick V, Yamagishi J et al. MesoNet: a compact facial video forgery detection network[C]. //2018 IEEE International Workshop on Information Forensics and Security (WIFS), December 11-13, 2018, Hong Kong, China(2018).

    [2] Wang J X, Lei Z C. A convolutional neural network based on feature fusion for face recognition[J]. Laser & Optoelectronics Progress, 57, 101508(2020).

    [3] Zhang H, Goodfellow I, Metaxas D et al. Self-attention generative adversarial networks[C]. //Proceedings of the 36th International Conference on Machine Learning, June 9-15, 2019, Long Beach, California, USA, 7354-7363(2019).

    [4] Zhu J Y, Park T, Isola P et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]. //2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy., 2242-2251(2017).

    [6] Sabir E, Cheng J, Jaiswal A et al. Recurrent convolutional strategies for face manipulation detection in videos[C]. //IEEE Conference on Computer Vision and Pattern Recognition Workshops, June 16-20, 2019, Long Beach, California, USA., 80-87(2019).

    [7] Zhang Y X, Li G, Cao Y et al. A method for detecting human-face-tampered videos based on interframe difference[J]. Journal of Cyber Security, 5, 49-72(2020).

    [8] Zhu M K, Lu X L. Human action recognition algorithm based on Bi-LSTM-Attention model[J]. Laser & Optoelectronics Progress, 56, 151503(2019).

    [9] Zhang K P, Zhang Z P, Li Z F et al. Joint face detection and alignment using multitask cascaded convolutional networks[J]. IEEE Signal Processing Letters, 23, 1499-1503(2016).

    [10] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 770-778(2016).

    [11] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[C]. //3rd International Conference on Learning Representations, May 7-9, 2015, San Diego, California, USA. [S.l.: s.n.](2015).

    [12] Tan M X, Le Q V. Efficientnet: rethinking model scaling for convolutional neural networks[C]. //Proceedings of the 36th International Conference on Machine Learning, June 9-15, 2019, Long Beach, California, USA. [S.l.: s.n.], 6105-6114(2019).

    [13] Sutskever I, Vinyals O, Le Q V. Sequence to sequence learning with neural networks[C]. //Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13, 2014, Montreal, Quebec, Canada, 3104-3112(2014).

    [14] Yang Y, Zhou J, Ai J B et al. Video captioning by adversarial LSTM[J]. IEEE Transactions on Image Processing, 27, 5600-5611(2018).

    [15] Cornia M, Baraldi L, Serra G et al. Predicting human eye fixations via an LSTM-based saliency attentive model[J]. IEEE Transactions on Image Processing, 27, 5142-5154(2018).

    [16] Xu L H, Li Z, Jiang J J et al. High-precision and lightweight facial landmark detection algorithm[J]. Laser & Optoelectronics Progress, 57, 241026(2020).

    [17] Ma Y K, Peng H Y, Cambria E et al. Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM[C]. //Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, New Orleans, Louisiana, USA., 5876-5883(2018).

    [18] Xu K, Ba J, Kiros R et al. Show, attend and tell: neural image caption generation with visual attention[C]. //Proceedings of the 32nd International Conference on Machine Learning, July 6-11, 2015, Lille, France, 2048-2057(2015).

    [19] Shi X J, Chen Z R, Wang H et al. Convolutional LSTM network: a machine learning approach for precipitation nowcasting[C]. //Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada, 802-810(2015).

    [20] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 9, 1735-1780(1997).

    [21] Rössler A, Cozzolino D, Verdoliva L et al. FaceForensics++: learning to detect manipulated facial images[C]. //2019 IEEE/CVF International Conference on Computer Vision (ICCV), October 27-November 2, 2019, Seoul, Korea (South)., 1-11(2019).

    [22] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]. //Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012, December 3-6, 2012, Nevada, United States, 1106-1114(2012).

    [23] Amerini I, Li C T, Caldelli R. Social network identification through image classification with CNN[J]. IEEE Access, 7, 35264-35273(2019).

    [26] Vaswani A, Shazeer N, Parmar N et al. Attention is all you need[C]. //Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, California, USA., 5998-6008(2017).

    [27] Amerini I, Galteri L, Caldelli R et al. Deepfake video detection through optical flow based CNN[C]. //2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), October 27-28, 2019, Seoul, Korea (South)., 1205-1207(2019).

    [28] Wang Y H, Bilinski P, Bremond F et al. G3AN: disentangling appearance and motion for video generation[C]. //2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 13-19, 2020, Seattle, WA, USA, 5263-5272(2020).

    Tools

    Get Citation

    Copy Citation Text

    Bowen Zheng, Huawei Xia, Ruidong Chen, Qiankun Han. Exposing DeepFake Video Detection Based on Convolutional Long Short-Term Memory Network[J]. Laser & Optoelectronics Progress, 2021, 58(24): 2415002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jan. 5, 2021

    Accepted: Mar. 2, 2021

    Published Online: Nov. 29, 2021

    The Author Email: Xia Huawei (xiahuawei@tju.edu.cn), Chen Ruidong (20517610@qq.com), Han Qiankun (15822563807@163.com)

    DOI:10.3788/LOP202158.2415002

    Topics