PSMNet algorithm based on dual three-pooling attention mechanism

Tengfei LIU; Dongyun LIN; Weiyao LAN; Yuehang CHEN

doi:10.5768/JAO202546.0202005

Journal of Applied Optics, Volume. 46, Issue 2, 327(2025)

PSMNet algorithm based on dual three-pooling attention mechanism

Tengfei LIU, Dongyun LIN^*, Weiyao LAN, and Yuehang CHEN

Author Affiliations

School of Aerospace Engineering, Xiamen University, Xiamen 361102, China

show less

Abstract Get PDF(in Chinese)

References(30)

[1] KAKADE A, DESHPANDE M, SARDESHPANDE S et al. 3D modelling using sequential and convolutional generative adversarial networks[C], 1-4(2021).

[2] MIN K, HAN S, LEE D et al. SAE Level 3 Autonomous driving technology of the ETRI[C], 464-466(2019).

[3] YANG J L, REN P R, ZHANG D Q et al. Neural aggregation network for video face recognition[C], 5216-5225(2017).

[4] ZENATI N, ZERHOUNI N. Dense stereo matching with application to augmented reality[C], 1503-1506(2007).

[5] HIRSCHMULLER H. Accurate and efficient stereo processing by semi-global matching and mutual information[J]. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 807-814(2005).

[6] LIU T F, LIN D Y, LAN W Y. PatchMatch stereo - cross dynamic windows based on textured bithreshold rule[C], 7382-7387(2023).

[7] ZAGORUYKO S, KOMODAKIS N. Learning to compare image patches via convolutional neural networks[C], 4353-4361(2015).

[8] ZBONTAR J, LECUN Y. Stereo matching by training a convolutional neural network to compare image patches[J]. Journal of Machine Learning Research, 17, 1-32(2016).

[9] YE X, LI J, WANG H et al. Efficient stereo matching leveraging deep local and context information[J]. IEEE Access, 18745-18755(2017).

[10] DOSOVITSKIY A, FISCHER P, ILG E et al. FlowNet: learning optical flow with convolutional networks.[C], 2758-2766(2015).

[11] MAYER N, ILG E, HAUSSER P et al. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation[C], 4040-4048(2016).

[12] CHANG J, CHEN Y. Pyramid stereo matching network[C], 5410-5418(2018).

[13] YANG G, MANELA J, HAPPOLD M et al. Hierarchical deep stereo matching on high-resolution images[C], 5515-5524(2019).

[14] KENDALL A, MARTIROSYAN H, DASGUPTA S et al. End-to-end learning of geometry and context for deep stereo regression[C], 66-75(2017).

[15] CHENG X, ZHONG Y, HARAKEH A et al. Learning stereo matching network with convolutional spatial propagation network[C], 156-165(2020).

[16] TANKOVICH V, KAR A, HANE C et al. Hitnet: hierarchical iterative tile refinement network for real-time stereo matching[C], 14362-14372(2021).

[17] GUO M H, XU T X, LIU J J et al. Attention mechanisms in computer vision: A survey[J]. Computational Visual Media, 8, 331-368(2022).

[18] GREGOR K, DANIHELKA I, GRAVES A et al. DRAW: a recurrent neural network for image generation[C], 1462-1471(2015).

[19] FU J, LIU J, TIAN H J et al. Dual attention network for scene segmentation[C], 3141-3149(2019).

[20] CHU X, YANG W, OUYANG W L et al. Multi-context attention for human pose estimation[C], 5669-5678(2017).

[21] XU T, ZHANG P C, HUANG Q Y et al. AttnGAN: fine-grained text to image generation with attentional generative adversarial networks[C], 1316-1324(2018).

[22] MNIH V, HEESS N, GRAVES A et al. Recurrent models of visual attention[J]. 27th International Conference on Neural Information Processing Systems, 2204-2212(2014).

[23] JADERBERG M, SIMONYAN K, ZISSERMAN A et al. Spatial transformer networks[J]. 28th International Conference on Neural Information Processing Systems, 2017-2025(2015).

[24] HU J, SHEN L, ALBANIE S et al. Squeeze-and-excitation networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 2011-2023(2020).

[25] MENZE M, GEIGER A. Object scene flow for autonomous vehicles[C], 3061-3070(2015).

[26] HE K, ZHANG X, REN S et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[C], 346-361(2014).

[27] GOODFELLOW I, BENGIO Y, COURVILLE A et al[M]. Deep Learning(2016).

[28] GIRSHICK R. Fast R-CNN[C], 1440-1448(2015).

[29] LAGA H, JOSPIN L V, BOUSSAID F et al. A survey on deep learning techniques for stereo-based depth estimation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 1738-1764(2022).

[30] DUGGAL S, WANG S, MA W C et al. DeepPruner: Learning efficient stereo matching via differentiable PatchMatch[C], 4383-4392(2019).

Tools

Get Citation

Copy Citation Text

Tengfei LIU, Dongyun LIN, Weiyao LAN, Yuehang CHEN. PSMNet algorithm based on dual three-pooling attention mechanism[J]. Journal of Applied Optics, 2025, 46(2): 327

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Feb. 1, 2024

Accepted: --

Published Online: May. 13, 2025

The Author Email: Dongyun LIN (林冬云)

DOI:10.5768/JAO202546.0202005

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology