Acta Photonica Sinica, Volume. 51, Issue 11, 1101001(2022)

Application of Deep Learning in Underwater Imaging(Invited)

Jun XIE, Jianglei DI*, and Yuwen QIN**
Author Affiliations
  • Institute of Advanced Photonics Technology,School of Information Engineering,Guangdong Provincial Key Laboratory of Information Photonics Technology,Guangdong University of Technology,Guangzhou 510006,China
  • show less
    Figures & Tables(55)
    Principle of underwater image degradation
    Classification of underwater imaging
    CNN structure
    Effects of different algorithms before and after processing[23]
    The applications of deep learning in image enhancement
    Image enhancement effect by neural networks
    WaterGAN model structure[32]
    Image restoration results[41]
    Neural network for parameter estimation[56]
    Neural network for image restoration
    Diagram of our two-stage learning[59]
    Computational polarization difference imaging systems based on Stokes vector[63]
    The relationship between K(x,y)and ∆D(x,y)[66]
    Passive under water polarization imaging detection method in neritic area[4]
    Recovery results of different underwater objects[61]
    Recovery results of different underwater objects[70]
    Neural network for polarimetric underwater image recovery
    Four kinds of polarization-intensity information confluence models and its comparative versions[73]
    Comparison between raw images and restoration results of eight models[73]
    Schematic diagram of ghost imaging
    Structure of CGI[77]
    Reconstruction results of CSGI and GIDL at different sampling rates[88]
    Reconstruction results based on DL and CS methods at different concentrations[87]
    Comparison of simulation results of UGI-GAN,UDLGI,and PDLGI at different sampling rates[84]
    Hyperspectral image data cube
    HyperDiver UHI system and its components[95]
    A multi-faced dataset from HyperDiver[95]
    Color image of the seabed from UHI and SAM classification[97]
    Underwater spectral imaging with filterwheel[89]
    A tunable LED-based underwater multispectral imaging system[98]
    Staring underwater spectral imaging system with optimal waveband subset[100]
    Self-supervised hyperspectral and multispectral image fusion network[110]
    The structure of single pixel camera[115]
    Single-pixel imaging system[116]
    Reconstruction results by traditional FSI and FSPI[129]
    Reconstruction results of GAN-FSI and FSI at different sampling rates[130]
    CS-SRCNN network structure[133]
    LLS structure
    Principle of streak tubeimaging[147]
    Results of streak tube 3D imaging[151-153]
    The target imaging with the distance of 20 m in clear water was recorded by the lidar-radar[158]
    The principle of underwater range-gated imaging system
    Images of underwater target[169]
    Holographic imaging structure diagram
    Robot-driven DIHM[198]
    Rapidly extract focused targets from underwater digital holograms[212]
    • Table 1. Summary of traditional underwater image enhancement methods and deep learning methods

      View table
      View in Article

      Table 1. Summary of traditional underwater image enhancement methods and deep learning methods

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Spatial domainmethodAdjust the gray scale and RGB channels of spatial pixelsEasy to implementandobvious effectsEasy to cause oversaturation and loss of details;Has a certain blindnessAdjust the overall or local over bright(dark)problem;Increase image contrast
      Frequency domainmethodTransform images to the corresponding domain for filteringSeparate high and low frequency information;Enhance edge information;suppress interference noise;High processing efficient in the frequency domainLimited effect on processing color distortion and low contrastDenoising;Deblurring
      Color constancy methodAccording to the relationship between the environment and the target pixel,the environment information is estimated and the raw image is restored according to the hypothesisGreat color restoration effectRely on the accuracy of assumptions;Limited effect on image denoisingColor correction
      Method based on deep learningThe degraded image is restored by using the mapping between degraded image and restored image learned by neural networkNoise removal,color correction and contrast increase can be performed at the same time;No prior information is requiredNetwork training takes time;Heavily dependence on datasets;Poor generalization ability

      Denoising;

      Color correction;Improving contrast

    • Table 2. Summary of image restoration methods based on priori and deep learning

      View table
      View in Article

      Table 2. Summary of image restoration methods based on priori and deep learning

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Restoration methods based on prioriThe water features and related parameters are estimated by a priori hypothesis,and the images before degradation are restored by physical modelIt is targeted and directional,and avoids blind recovery;Results recovered by physical model are naturalThe choice of a priori hypothesis is subject to subjective influence;The model deviation and other restrictive factors make it difficult to apply in complex water environment

      Deblurring;

      Color correction;Contrast enhancement

      Restoration methods based on deep learningNeural network is used to learn the mapping between degraded image and related parameters to estimate model parameters,and restore the degraded imageIt avoids subjective error caused by artificial selection of prior conditions and has certain generalizationIt heavily relies on datasets;Artificial datasets differ from the real environment;It takes longer time compared with prior method

      Deblurring;

      Color correction;Contrast enhancement

    • Table 3. Summary of underwater polarization imaging methods and deep learning-based methods

      View table
      View in Article

      Table 3. Summary of underwater polarization imaging methods and deep learning-based methods

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Polarization difference imagingIt uses the difference of the light vibration between the target and the background to remove the background scattering noiseSimple and effectiveThe restoration results of objects with various polarization and details are poorDeblurring;Imaging in scattering media
      Passive polarization imagingAccording to the difference of polarization characteristics between background scattered light and target light under natural light,the clear scene image is reconstructed by using underwater light transmission modelDistance information is added to the physical model,which has a significant restoration effect on complex scenesThe background area needs to be selected manually;The model is only applicable to objects with low degree of polarization;The recovery effect is poor under high scattering concentration;Uniform light field conditions are requiredDeblurring;Imaging in scattering media
      Active polarization imagingThe active complete polarized light source is introduced,and the background scattering noise is removed by using the polarization characteristics difference between the background and the target reflected lightIt is suitable for low illumination environment;Imaging quality is better than underwater passive polarization imagingThe restoration effect is limited when the difference between the target and the background polarization degree is small or the target contains multiple polarization degrees;The assumption that the polarization direction of the target light and the background scattered light in the model is the same is different from the realityDeblurring;Imaging in scattering media
      Polarization imaging based on deep learningIt uses the additional information of polarization on light intensity to improve the effect of traditional intensity image restoration,recognition,fusion and reconstructionIt has better imaging quality and complete details than conventional imagingIt is heavily dependent on datasets and still in preliminary explorationDeblurring;Imaging in scattering media
    • Table 4. Summary of different ghost imaging methods and methods based on deep learning

      View table
      View in Article

      Table 4. Summary of different ghost imaging methods and methods based on deep learning

      MethodsPrincipleAdvantagesDisadvantagesApplication
      TGIIt calculates the correlation of light field intensity fluctuation to reconstruct the targetStrong anti-interference ability;Lensless imaging;Wide scope of actionIt needs two optical paths,which is complicated in experiment;A large amount of data needs to be collected,and the relevant calculation takes a long time;Low signal-to-noise ratioDenoising;Imaging in scattering media
      CGIThe target image is obtained by calculating the intensity distribution and the second-order correlation of the intensity collected by the detectorThe controllable light field is obtained by SLM or DMD,and the experiment is simplified to a single light path;Greater imaging perspectiveIt still needs to collect a large amount of data,and the relevant calculation takes a long timeDenoising;Imaging in scattering media
      CSGICompressed sensing is used for sparse sampling reconstruction of ghost imageIt can reconstruct high-quality images at low sampling rate and shorten the sampling time;It hashigh signal to noise ratioIt needs mass computing,and signal processing takes long timeImaging at a low sampling rate;Super resolution imaging
      DIGLThe neural network is used to learn the mapping between blurred image and clear image,or signal collected by bucket detector and reconstructed imaging to reconstruct the imageIt avoids using illumination mode and acquires high quality images at a low sampling rate;The reconstruction from barrel detector avoids the complex calculation of CS reconstruction and has better resultsItstill needs mass computing,and heavily relies on datasetsImaging at a low sampling rate
    • Table 5. Summary of traditional MS and HS fusion fusion method and deep learning-based method

      View table
      View in Article

      Table 5. Summary of traditional MS and HS fusion fusion method and deep learning-based method

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Matrix factorizationBased on the linear spectral hybrid model,the end element spectral matrix with high spectral resolution and the abundance matrix with high spatial resolution are obtained by alternating non negative matrix decomposition of HS and MS data,and then the fused image with high spatial resolution and high spectral resolution are obtained by multiplicationThe model theory is simple,easy to implement and close to the actual situationIt requires iterative solution and mass computing;Model parameters are sensitive and difficult to set;It relies on observation modelHS and MS fusion
      Tensor decompositionHS is regarded as a three-dimensional tensor,which is decomposed into a three-mode factor matrix and a three-dimensional core tensor by Tucker decomposition. The core tensor is extracted from the high-resolution MS block set by tensor sparse coding,and is multiplied with the factor matrix to obtain images with high spatial resolution and high spectral resolutionThe reconstruction quality is better than that based on matrix factorizationModel parameters are sensitive and difficult to set;It requires mass computingHS and MS fusion
      Deep learning basedThe mapping between HS and MS and hyperspectral images is established by using neural network for fusionIt has high reconstruction accuracy,high efficiency and good robustness without iterationIt relies heavily on datasets and has poor generalizationHS and MS fusion
    • Table 6. Summary of different SPI reconstruction methods and methods based on deep learning

      View table
      View in Article

      Table 6. Summary of different SPI reconstruction methods and methods based on deep learning

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Conventional SPIThe object image is reconstructed by cross-correlation between the illumination field modulated by random pattern and the value obtained by single pixel cameraIt has great interference immunity,high single pixel detection frequency and great weak light detection capabilityBetter image quality requires far more sampling times than the number of reconstructed image pixelsImaging in scattering media
      FSI/HSIThe Hadamard/ Fourier basis spectrum of the target image is obtained by modulating the light field with the Hadamard/ Fourier basis mask,and then the target image is reconstructed by applying the inverse Hadamard/ Fourier transformIt has great interference immunity,and reconstruct the object image without distortionHigh frequency details are easy to be lost;Image artifacts exist;High quality reconstruction requires more sampling timesImaging in scattering media
      Deeplearning basedNeural networks are used to learning the mapping ofimage or one-dimensional signalto reconstructed imagefor image reconstructionIt has high reconstruction efficiency,good reconstruction quality and certain de-noising abilityThe network is prone to over fitting and takes time to train;It requires high adaptability and robustness of neural networkImaging in scattering media
    • Table 7. Summary of different underwater laser imaging methods

      View table
      View in Article

      Table 7. Summary of different underwater laser imaging methods

      MethodsPrincipleAdvantagesDisadvantagesApplication
      LLSAccording to the characteristic that the backscattered light of waterdecreases rapidly relative to the central axis of illumination,the target light and scattered light are separated in spaceIt reduces the influence of scattered light on imagingImaging equipment has large volume;It is impossible to avoid the influence of scattering medium on the transmission optical path;Lengthy imaging time leads to accuracy degradationImaging in scattering media
      STILThe deflection module in the streak tube is used to convert the time information into the distance information to obtain the three-dimensional imageIt has high imaging accuracy,fast imaging speed and large field of viewIt is not suitable for moving target imaging;The system has a short imaging time,which cannot meet the needs of long-time photography3D imaging
      Range-gated imagingThe backscattered light in the process of light transmission is reduced by adjust the open time of laser and cameraIt reduces the influence of scattered light on imaging,and has fast imaging speedLaser energy is scattered,and only small field of view imaging can be performed;The system is costly with limited resolution,and the operation is complexImaging in scattering media;3D imaging
    • Table 8. Summary of Fourier transform reconstruction and reconstruction based on deep learning

      View table
      View in Article

      Table 8. Summary of Fourier transform reconstruction and reconstruction based on deep learning

      MethodsPrincipleAdvantagesDisadvantagesApplication
      Fourier transform reconstructionAfter the hologram is transformed into frequency domain by Fourier transform,the angle difference between the target light wave and other holographic components is used for separation,and then the spatial carrier is removed by inverse Fourier transform. The reconstructed image is obtained by calculating the diffraction integralIt can obtain the amplitude and phase information of objects in real time and quantitativelyIt needs mass computing and prior knowledge;Only a single hologram can be processed each time,so the efficiency is low3D microscopic imaging
      Holographic reconstruction based on deep learningNeural network is used to establish the mapping between hologram and reconstructed image for holographic reconstructionIt has high imaging efficiency and higher imaging quality;No prior knowledge is requiredIt relies heavily on data sets,requires a large number of different sample data and a wide range of reconstructed distance quantization modelsMicrobial 3D image reconstruction;3D particle field reconstruction;Microbial identification classification
    • Table 9. Application of deep learning in underwater imaging

      View table
      View in Article

      Table 9. Application of deep learning in underwater imaging

      Application fieldNetwork structureInput-outputdetailsLoss functionApplication problems
      Underwater Image EnhancementCNN,GANImage-imageResidual connection,Dense connection,Inception,Fusion,L1,LSE,MSE,SSIM,GAN LossDeblurring[24,26-27],Color Correction[25,29],Dehazing[28-30],Image Generation[32-39]
      Underwater Image RestorationCNN,GANImage-image,Image-parametersDense connection,Residual connection,Skip connection,Fusion,InceptionL1,Perpetual loss,MSE,GANColor Correction[54-56,57,58],Deblurring[59],Dehazing[57],Image Generation[60]
      Underwater Polarization ImagingCNNImage-imageResidual connection,Dense connection,Skip connection,Fusion,MSE,Perpetual lossDeblurring[71,73]
      Underwater Ghost ImagingMLP,CNN,GAN1D signal-image,Image-imageResidual Connection,Dense Connection,Fusion,InceptionMSE,Perpetual loss,self-designedLowSampling Rate Imaging[84-88],Deblurring[86,88]
      Underwater Spectral ImagingCNNMS image-ImageSkip ConnectionL1Spectral Fusion[108-110]
      Underwater Compressed Sensing ImagingCNN,GANImage-Image,1D signal-imageSkip ConnectionMSE,GANLow Sampling Rate Reconstruction[85,130,132-133],Deblurring[130]
      Underwater Laser Imaging
      Underwater Holographic ImagingCNNImage-3D particle field,Image-classification resultSkip connection,Residual connection,Fusion,Cross Entropy,MSE,L1,Huber loss[213]Improve Efficiency[204,207,212],3D Particle Field Reconstruction[207],Classification[210-212]
    Tools

    Get Citation

    Copy Citation Text

    Jun XIE, Jianglei DI, Yuwen QIN. Application of Deep Learning in Underwater Imaging(Invited)[J]. Acta Photonica Sinica, 2022, 51(11): 1101001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Atmospheric and Oceanic Optics

    Received: Apr. 26, 2022

    Accepted: Jun. 27, 2022

    Published Online: Dec. 13, 2022

    The Author Email: Jianglei DI (jiangleidi@gdut.edu.cn), Yuwen QIN (qinyw@gdut.edu.cn)

    DOI:10.3788/gzxb20225111.1101001

    Topics