CNN(convolutional neural network)‑based underwater image enhancement methods neglect global visual perception, leading to color distortion and contrast degradation.
A Transformer‑based multi‑scale underwater image enhancement network (MTransNet) is proposed. To address the problem of lacking global visual perception, a position encoding module is designed based on underwater image priors and a Swin Transformer module which is applicable to underwater scenes is constructed. Furthermore, self‑attention mechanism is built to improve global perception performance. As for the detail blurring that exists in current methods, a CNN module is developed to capture local features such as textures or edges, to improve local perception performance. The transfer fusion module is built to transfer global attention of Swin Transformer to local convolutional feature, achieving full fusion and utilization of global feature and local feature. The PSNR value on subsets of EUVP can reach up to 23.47 dB, which demonstrates the method can significantly enhance global visual perception and increase image visual quality.
3) 对比方法:从主观视觉效果和客观评价指标两方面与先进的水下图像增强算法进行对比,包括基于扩散的方法(fusion‑based method)[21],基于Retinex的方法(Retinex‑based)[22],最小色彩损失和局部自适应对比度增强方法(minimal color loss and locally adaptive contrast enhancement method for underwater image enhancement,MMLE)[23]等;基于CNN的Water-Net[10],Ucolor[11];基于GAN的FUnIE-GAN[9],水下生成对抗网络(underwater GAN,UGAN)[24].
JiangQ P, GuY S, LiC Y,et al.Underwater image enhancement quality evaluation:benchmark dataset and objective metric[J].IEEE Transactions on Circuits and Systems for Video Technology,2022,32(9):5959-5974.
HummelR.Image enhancement by histogram transformation[J].Computer Graphics and Image Processing,1977,6(2):184-195.
[4]
RezaA M.Realization of the contrast limited adaptive histogram equalization (CLAHE) for real‑time image enhancement[J].The Journal of VLSI Signal Processing Systems for Signal,Image,and Video Technology,2004,38(1):35-44.
[5]
HitamM S, YussofW N J H W, AwalludinE A,et al.Mixture contrast limited adaptive histogram equalization for underwater image enhancement[C]//2013 International Conference on Computer Applications Technology (ICCAT).Sousse,2013:1-5.
[6]
McGlameryB L.A computer model for underwater camera systems[J].Proceedings of the SPIE,1980,208:221-231.
[7]
JaffeJ S.Computer modeling and the design of optimal underwater imaging systems[J].IEEE Journal of Oceanic Engineering,1990,15(2):101-111.
[8]
WangY, ZhangJ, CaoY,et al.A deep CNN method for underwater image enhancement[C]//2017 IEEE International Conference on Image Processing (ICIP).Beijing,2017:1382-1386.
[9]
IslamM J, XiaY Y, SattarJ.Fast underwater image enhancement for improved visual perception[J].IEEE Robotics and Automation Letters,2020,5(2):3227-3234.
[10]
LiC Y, GuoC L, RenW Q,et al.An underwater image enhancement benchmark dataset and beyond[J].IEEE Transactions on Image Processing,2020,29:4376-4389.
[11]
LiC Y, AnwarS, HouJ H,et al.Underwater image enhancement via medium transmission‑guided multi‑color space embedding[J].IEEE Transactions on Image Processing,2021,30:4985-5000.
[12]
QiQ, ZhangY C, TianF,et al.Underwater image co‑enhancement with correlation feature matching and joint learning[J].IEEE Transactions on Circuits and Systems for Video Technology,2022,32(3):1133-1147.
[13]
VaswaniA, ShazeerN, ParmarN,et al. Attention is all you need[J].Advances in Neural Information Processing Systems,2017,30:6000-6010.
[14]
LiuZ, MaoH Z, WuC Y,et al. A ConvNet for the 2020s[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans,2022:11976-11986.
[15]
FanS J, LiangW, DingD R,et al.LACN:a lightweight attention‑guided ConvNeXt network for low‑light image enhancement[J].Engineering Applications of Artificial Intelligence,2023,117:105632.
[16]
LiuZ, LinY T, CaoY,et al.Swin Transformer:hierarchical vision Transformer using shifted windows[C]//2021 IEEE/CVF International Conference on Computer Vision.Montreal,2021:9992-10002.
[17]
GuoC L, YanQ X, AnwarS,et al.Image dehazing Transformer with transmission‑aware 3D position embedding[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans,2022:5802-5810.
[18]
JiangL M, ZhangC X, HuangM Y,et al.TSIT:a simple and versatile framework for image‑to‑image translation[C]//Computer Vision-ECCV 2020:16th European Conference Proceedings,Part III 16.Glasgow,2020:206-222.
[19]
WangX T, YuK, DongC,et al.Recovering realistic texture in image super‑resolution by deep spatial feature transform[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City,2018:606-615.
[20]
LiuR S, FanX, ZhuM,et al.Real‑world underwater enhancement:challenges,benchmarks,and solutions under natural light[J].IEEE Transactions on Circuits and Systems for Video Technology,2020,30(12):4861-4875.
[21]
AncutiC O, AncutiC, De VleeschouwerC,et al.Color balance and fusion for underwater image enhancement[J].IEEE Transactions on Image Processing,2018,27(1):379-393.
[22]
FuX Y, ZhuangP X, HuangY,et al.A retinex‑based enhancing approach for single underwater image[C]//2014 IEEE International Conference on Image Processing (ICIP).Paris,2014:4572-4576.
[23]
ZhangW D, ZhuangP X, SunH H,et al.Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement[J].IEEE Transactions on Image Processing,2022,31:3997-4010.
[24]
FabbriC, IslamM J, SattarJ.Enhancing underwater imagery using generative adversarial networks[C]//2018 IEEE International Conference on Robotics and Automation (ICRA).Brisbane,2018:7159-7165.