In order to further improve the compression efficiency of Efficient Video Coding (HEVC) and make it more suitable for high-definition video compression. By utilizing the powerful mining ability of deep learning for video features, this paper proposes a multi input multi-scale residual convolutional neural network and network iterative training method, which significantly improves the performance of HEVC loop filtering. And a novel pixel based interpolation filtering method was proposed to further improve the compression efficiency of the encoding. The experimental results show that the algorithm proposed in this paper can reduce BD rate by an average of 7.47% in RA encoding mode. Compared with the two existing encoding optimization algorithms, the optimization algorithm proposed in this paper effectively improves compression efficiency while enhancing video quality.
根据其形式视频可划分为模拟视频和数字视频两种[1],前者由模型相机逐行或隔行扫描生成,主要用于模拟电视系统;后者由数字相机拍摄生成或由模拟视频生成,日常生活所涉及视频多为数字视频。传统编码技术已经无法满足当下数字视频压缩、存储、传输等方面的要求,由此,高效视频编码(high efficiency video coding,HEVC)应运而生[2],HEVC是为满足数字视频有线和无线传输需求而开发的视频编码标准。
HanLi, WangHua-dong. Research on deformation feature reconstruction of dynamic video multi-frame continuous image[J]. Computer Simulation, 2022, 39(12): 245-248
[3]
SullivanG J, OhmJ R, HanW J, et al. Overview of the high efficiency video coding standard[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2012, 22(12): 1649-1668.
HuiChao, JiangLin, ZhuJun, et al. Dynamic reconfigurable implementation of pixel interpolation algorithm in HEVC[J]. Computer Engineering and Design, 2022, 43(3): 764-770.
[6]
PanZ, YiX, ZhangY, et al. Efficient in-loop filtering based on enhanced deep convolutional neural networks for HEVC[J]. IEEE Transactions on Image Processing, 2020, 29: 5352-5366.
[7]
SunW, HeX, ChenH, et al. A nonlocal HEVC in-loop filter using CNN-based compression noise estimation[J]. Applied Intelligence, 2022, 52(15): 17810-17828.
WangGang, ChenHe-xin, ChenMian-shu. Adaptive interpolation filtering algorithm based on HEVC[J]. Journal of Jilin University (Science Edition), 2018, 56(2): 320-328.
[10]
LuY, HuangX, LiuH, et al. Fast SHVC inter-coding based on bayesian decision with coding depth estimation[J]. Journal of Real-Time Image Processing, 2021, 18(6): 2269-2285.
[11]
HeK, ZhangX, RenS, et al. Deep residual learning for image recognition[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 770-778.
[12]
LiJ C, FangF M, MeiK F, et al. Multi-scale residual network for image super-resolution[C]∥Proceedings of the European Conference on Computer Vision, Munichi, Germany, 2018: 527-542.
[13]
AgustssonE, TimofteR. Ntire 2017 challenge on single image super-resolution: dataset and studyp[C]∥IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, USA, 2017: 1122-1131.
[14]
MaD, ZhangF, BullD. BVI-DVC: a training database for deep video compression[J]. IEEE Transactions on Multimedia, 2021, 24: 3847-3858.
[15]
KingmaD P, BaJ. Adam: a method for stochastic optimization[J]. Arxiv Preprint, 2014, 9: 14126980.
[16]
BjøntegaardG. Calculation of average PSNR differences between RD-curves[J]. ITU-T VCEG-M33, 2001, 4: 2-4.