Journal of Applied Optics, Volume. 46, Issue 2, 327(2025)

PSMNet algorithm based on dual three-pooling attention mechanism

Tengfei LIU, Dongyun LIN*, Weiyao LAN, and Yuehang CHEN
Author Affiliations
  • School of Aerospace Engineering, Xiamen University, Xiamen 361102, China
  • show less
    References(30)

    [1] KAKADE A, DESHPANDE M, SARDESHPANDE S et al. 3D modelling using sequential and convolutional generative adversarial networks[C], 1-4(2021).

    [2] MIN K, HAN S, LEE D et al. SAE Level 3 Autonomous driving technology of the ETRI[C], 464-466(2019).

    [3] YANG J L, REN P R, ZHANG D Q et al. Neural aggregation network for video face recognition[C], 5216-5225(2017).

    [4] ZENATI N, ZERHOUNI N. Dense stereo matching with application to augmented reality[C], 1503-1506(2007).

    [5] HIRSCHMULLER H. Accurate and efficient stereo processing by semi-global matching and mutual information[J]. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 807-814(2005).

    [6] LIU T F, LIN D Y, LAN W Y. PatchMatch stereo - cross dynamic windows based on textured bithreshold rule[C], 7382-7387(2023).

    [7] ZAGORUYKO S, KOMODAKIS N. Learning to compare image patches via convolutional neural networks[C], 4353-4361(2015).

    [8] ZBONTAR J, LECUN Y. Stereo matching by training a convolutional neural network to compare image patches[J]. Journal of Machine Learning Research, 17, 1-32(2016).

    [9] YE X, LI J, WANG H et al. Efficient stereo matching leveraging deep local and context information[J]. IEEE Access, 18745-18755(2017).

    [10] DOSOVITSKIY A, FISCHER P, ILG E et al. FlowNet: learning optical flow with convolutional networks.[C], 2758-2766(2015).

    [11] MAYER N, ILG E, HAUSSER P et al. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation[C], 4040-4048(2016).

    [12] CHANG J, CHEN Y. Pyramid stereo matching network[C], 5410-5418(2018).

    [13] YANG G, MANELA J, HAPPOLD M et al. Hierarchical deep stereo matching on high-resolution images[C], 5515-5524(2019).

    [14] KENDALL A, MARTIROSYAN H, DASGUPTA S et al. End-to-end learning of geometry and context for deep stereo regression[C], 66-75(2017).

    [15] CHENG X, ZHONG Y, HARAKEH A et al. Learning stereo matching network with convolutional spatial propagation network[C], 156-165(2020).

    [16] TANKOVICH V, KAR A, HANE C et al. Hitnet: hierarchical iterative tile refinement network for real-time stereo matching[C], 14362-14372(2021).

    [18] GREGOR K, DANIHELKA I, GRAVES A et al. DRAW: a recurrent neural network for image generation[C], 1462-1471(2015).

    [19] FU J, LIU J, TIAN H J et al. Dual attention network for scene segmentation[C], 3141-3149(2019).

    [20] CHU X, YANG W, OUYANG W L et al. Multi-context attention for human pose estimation[C], 5669-5678(2017).

    [21] XU T, ZHANG P C, HUANG Q Y et al. AttnGAN: fine-grained text to image generation with attentional generative adversarial networks[C], 1316-1324(2018).

    [22] MNIH V, HEESS N, GRAVES A et al. Recurrent models of visual attention[J]. 27th International Conference on Neural Information Processing Systems, 2204-2212(2014).

    [23] JADERBERG M, SIMONYAN K, ZISSERMAN A et al. Spatial transformer networks[J]. 28th International Conference on Neural Information Processing Systems, 2017-2025(2015).

    [25] MENZE M, GEIGER A. Object scene flow for autonomous vehicles[C], 3061-3070(2015).

    [26] HE K, ZHANG X, REN S et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[C], 346-361(2014).

    [27] GOODFELLOW I, BENGIO Y, COURVILLE A et al[M]. Deep Learning(2016).

    [28] GIRSHICK R. Fast R-CNN[C], 1440-1448(2015).

    [29] LAGA H, JOSPIN L V, BOUSSAID F et al. A survey on deep learning techniques for stereo-based depth estimation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 1738-1764(2022).

    [30] DUGGAL S, WANG S, MA W C et al. DeepPruner: Learning efficient stereo matching via differentiable PatchMatch[C], 4383-4392(2019).

    Tools

    Get Citation

    Copy Citation Text

    Tengfei LIU, Dongyun LIN, Weiyao LAN, Yuehang CHEN. PSMNet algorithm based on dual three-pooling attention mechanism[J]. Journal of Applied Optics, 2025, 46(2): 327

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Feb. 1, 2024

    Accepted: --

    Published Online: May. 13, 2025

    The Author Email: Dongyun LIN (林冬云)

    DOI:10.5768/JAO202546.0202005

    Topics