基于改进YOLOv8的水下垃圾检测研究

李艳灵; 赵添宇; 李佳漫; 梁庆琪; 杨志鹏; 陈重阳

doi:10.3969/j.issn.2097-583X.2026.01.007

信阳师范大学学报（自然科学版） ›› 2026, Vol. 39 ›› Issue (01) : 52 -57. DOI: 10.3969/j.issn.2097-583X.2026.01.007

计算机视觉∙无损检测

基于改进YOLOv8的水下垃圾检测研究

作者信息 +

Research on underwater garbage detection based on improved YOLOv8

Author information +

文章历史 +

PDF (5745K)

摘要

针对水下环境光线昏暗、图像分辨率偏低和小目标密集等问题导致现有目标检测算法性能受限的核心痛点，提出一种改进的YOLOv8水下垃圾目标检测算法。该算法在YOLOv8的特征融合模块中引入了自校准卷积，采用分组卷积的方式进行多尺度特征提取，通过下采样操作来扩大网络的感受野，旨在提升模型的多尺度特征融合与检测能力，更准确地识别水下垃圾。在Seaclear Marine Debris数据集与TrashCan数据集上开展实验，相较于YOLOv8，改进后的模型在Seaclear Marine Debris数据集上的检测精度提升了1.5个百分点，mAP值提升了1.1个百分点；在TrashCan数据集上，检测精度提升了2.4个百分点，mAP值提升了0.7个百分点。实验结果表明，所提方法在水下复杂环境中仍能保持较高检测精度，满足水下垃圾检测的实际需求。

Abstract

To address the core challenge that the existing object detection algorithms were constrained by issues such as low light， low image resolution， and dense small objects in underwater environments， an improved YOLOv8 algorithm for underwater garbage object detection was proposed. The algorithm introduced self‑calibrated convolution into the feature fusion module of YOLOv8， used the method of grouping convolution for multi scale feature extraction， and expanded the receptive field of the network through down-sampling operation， to improve the multi scale feature fusion and detection ability of the model， and more accurately identify underwater garbage. The experiments were carried out on the Seaclear Marine Debris data set and TrashCan data set. Compared with YOLOv8， the detection accuracy of the improved model on the Seaclear Marine Debris data set was improved by 1.5 percent point， mAP values increased by 1.1 percent point. On the TrashCan dataset， the detection accuracy was increased by 2.4 percent point， and the mAP value was increased by 0.7 percent point. Experimental results showed that the proposed method could maintain high detection accuracy in complex underwater environment， and could meet the actual needs of underwater garbage detection.

Graphical abstract

关键词

水下垃圾检测 / 多目标检测 / 特征融合 / 多尺度特征检测

Key words

underwater waste detection / multi object detection / feature fusion / multi scale feature detection

引用本文

引用格式 ▾

[Author(id=1251842929707229187, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=lyl75@163.com, emailSecond=null, emailThird=null, correspondingAuthor=1, authorType=1, ext={EN=AuthorExt(id=1251842929765949448, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842929707229187, language=EN, stringName=Yanling LI, firstName=Yanling, middleName=null, lastName=LI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842929816281100, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842929707229187, language=CN, stringName=李艳灵, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio={"content":"

李艳灵（1975—），女，河南新乡人，教授，博士，硕士生导师，主要从事模式识别及图像处理研究。E-mail ： lyl75@163.com

"}, bioImg=null, bioContent=

李艳灵（1975—），女，河南新乡人，教授，博士，硕士生导师，主要从事模式识别及图像处理研究。E-mail ： lyl75@163.com

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])]), Author(id=1251842929866612752, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251842929925333012, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842929866612752, language=EN, stringName=Tianyu ZHAO, firstName=Tianyu, middleName=null, lastName=ZHAO, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842929975664663, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842929866612752, language=CN, stringName=赵添宇, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])]), Author(id=1251842930025996316, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251842930093105186, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930025996316, language=EN, stringName=Jiaman LI, firstName=Jiaman, middleName=null, lastName=LI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842930143436838, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930025996316, language=CN, stringName=李佳漫, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])]), Author(id=1251842930193768490, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251842930256683056, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930193768490, language=EN, stringName=Qingqi LIANG, firstName=Qingqi, middleName=null, lastName=LIANG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842930307014708, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930193768490, language=CN, stringName=梁庆琪, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])]), Author(id=1251842930361540666, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251842930424455233, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930361540666, language=EN, stringName=Zhipeng YANG, firstName=Zhipeng, middleName=null, lastName=YANG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842930470592581, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930361540666, language=CN, stringName=杨志鹏, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])]), Author(id=1251842930516729928, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, orderNo=5, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251842930579644491, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930516729928, language=EN, stringName=Chongyang CHEN, firstName=Chongyang, middleName=null, lastName=CHEN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251842930625781838, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, authorId=1251842930516729928, language=CN, stringName=陈重阳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=信阳师范大学计算机与信息技术学院，河南信阳 464000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251842929635927037, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, xref=null, ext=[AuthorCompanyExt(id=1251842929648509951, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=College of Computer and Information Technology，Xinyang Normal University，Xinyang 464000，China), AuthorCompanyExt(id=1251842929665287168, tenantId=1045748351789510663, journalId=1179462592835063850, articleId=1251841233565143971, companyId=1251842929635927037, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=信阳师范大学计算机与信息技术学院，河南信阳 464000)])])] 李艳灵,赵添宇,李佳漫,梁庆琪,杨志鹏,陈重阳. 基于改进YOLOv8的水下垃圾检测研究[J]. 信阳师范大学学报（自然科学版）, 2026, 39(01): 52-57 DOI:10.3969/j.issn.2097-583X.2026.01.007

登录浏览全文

4963

注册一个新账户忘记密码

0 引言

过去的数十年间，人类活动产生的各类垃圾不仅严重污染宝贵的水资源，更深刻破坏水生生物的生存环境与生态平衡。尽管全球范围内已陆续出台了一系列严格的环境法规，力图遏制这一恶化趋势，但以塑料为代表的大量废弃物仍被肆意丢弃至水体之中，不仅对水域生态系统造成不可逆转的损害，同时也对水生生物的进化历程构成了前所未有的严峻挑战^［1］。

联合国《2024年全球废物管理展望》报告数据显示，2023年全球城市固体废物产量已突破了23亿吨的大关，这一数字背后是海量被丢弃的塑料制品、纸张、食物残渣等废弃物，它们共同汇聚成庞大的垃圾洪流，持续对环境造成沉重负担。报告进一步预测，城市固体废弃物产生量将从2023年的23亿吨增长到2050年的38亿吨。其中，城市固体废物中塑料废物的混合增长率为2%~3%，预计到2050年将产生约5.1亿吨塑料废物。

随着塑料使用量的持续攀升，这类难以降解的物质通过多种途径侵入河流与海洋，包括人为活动、垃圾填埋场的渗透、船舶的非法倾倒以及废水处理厂处理不彻底的污水排放等^［2］。水下垃圾不仅严重威胁着河流、海洋及沿海地区的野生动物生存，导致它们受伤甚至死亡^［3］，还破坏了这些区域的自然栖息地，使其逐渐退化^［4］。此外，水下垃圾还会干扰航行安全，对渔业和海运业造成巨大的经济损失，降低沿海社区居民的生活质量^［5］。更长远地，它还对人类健康与安全构成了潜在威胁^［6］。鉴于水下垃圾污染引发的社会、经济与生态负面影响广泛且深远，近年来，致力于构建系统监测与自动收集水下垃圾框架的研究活动显著增加^［7］。

用于目标识别与定位的深度学习算法均以卷积神经网络为核心框架^［8］。例如，CHEN等^［9］在URPC2017数据集上提出SWIPE‑Net网络架构，但检测精度较低。随着URPC2018中目标类别的扩展，JIANG等^［10］设计的增强型SSD、HAN等^［11］将平均精度均值（Mean Average Precision，mAP）分别提升至66.9%和91.2%，但检测速度较慢，无法满足多类别检测需求。ZHU等^［12］提出一种基于YOLOv8的改进算法，重点强化对小型水下碎片的检测能力，该方法在TrashCan数据集^［13］上实现了63.6%的精度和47.1%的mAP值。ĐURAŠ等^［14］在Seaclear Marine Debris数据集上采用YOLOv6模型开展实验，取得了68.9%的mAP值，虽保障了一定的检测速度，但精度较低。

针对复杂的水下环境导致目前检测精度不高的问题，提出一种改进YOLOv8的水下垃圾检测算法（Self‑Calibrated Convolutions YOLO，SC‑YOLO），通过引入自校准卷积模块（Self‑Calibrated Convolutions，SCConv）^［15］，将原有的C2f模块的标准卷积替换为SCConv，进一步提升YOLOv8的性能。C2f模块增强了模型的非线性表示能力与特征提取能力，而自校准卷积通过扩大模型的感受野来提高检测性能，两者的有机结合，使 C2f_SCConv 模块既保留了多尺度特征融合的特性，又通过自校准特征提取强化了目标细节信息的捕捉能力，最终实现 YOLOv8 特征表示能力的显著提升，有效解决了复杂水下环境带来的检测精度瓶颈问题。

1 网络模块设计

由于最新的YOLO模型在水下垃圾相关数据集上的适配性欠佳，下面以YOLOv8算法为基础框架进行模型设计。

1.1 YOLOv8模型结构

YOLOv8模型的整体架构分为三大核心模块：特征提取主干（Backbone）网络、特征融合（Neck）网络及检测头（Head）网络。它延续了前代YOLO系列的梯度分叉设计理念，通过卷积操作完成降采样后，采用C2f模块替代传统的C3模块，强化了模型对图像深层特征的挖掘与非线性表示能力。在特征融合阶段，Backbone 网络输出的多尺度特征图经上采样处理后，在 Neck 模块中完成深度融合，构建形成三级特征金字塔。这一结构借鉴了 FPN‑PANet 的经典设计思路，既充分保留了浅层特征的细节信息与深层特征的语义信息，又实现了不同尺度特征的高效互补，最终让模型在检测精度与推理速度之间达成了优异的平衡，适配多种检测场景的需求。

1.2 自校准卷积模块

采用自校准卷积对YOLOv8特征融合网络中的C2f模块进行重构优化，通过增强C2f模块的感受野，使其能够整合多空间尺度的特征信息，实现对目标特征的自校准优化。如图1所示，SCConv的核心思想是在不改变原有模型架构的前提下，通过优化CNN的基础卷积特征转换过程来提升性能。采用分组卷积策略进行多尺度特征提取，将输入特征沿通道维度划分为两个独立分支，其中一个分支执行常规卷积操作，另一个分支则利用下采样操作来扩大网络的感受野。这使得特征图上的每个空间位置都能整合两个不同尺度的特征信息，进而完成自适应的特征校准。

该模块的输入与输出通道均为C，核心运算基于一组形状为

C, C, k h, k w

的卷积核K，其中

k h

和

k w

分别代表卷积核的空间高度与宽度。首先，对原始卷积核K进行拆分处理，得到4组子卷积核

K i i = 1 4

，每组的形状统一为

C / 2, C / 2, k h, k w

。随后，将输入特征

X

均匀划分为两个分支

X 1, X 2

。在第一分支中，利用子卷积核

K 2, K 3, K 4

对

X 1

执行自校准运算，得到

Y 1

。在第二分支中，仅执行一次简单的卷积操作：

Y 2 = F 1 (X 2) = X 2 × K 1

，旨在保留原始的空间上下文信息。然后，将

两个 分支 输出 的特 征图 Y 1, Y 2

沿通道维度拼接融合，最终输出

Y

。

具体来说，自校准操作首先对给定的输入

X 1

应用平均池化，得到中间特征图

T 1

。池化核大小为

r × r

，滑动步幅为

r

，可表示为：

T 1 = A v g P o o l r X 1 。

(1)

接着，

T 1

使用子卷积核

{K 2}

进行特征转换，可表示为：

X 1' = U p (F 2 (T 1)) = U p (T 1 × K 2),

(2)

式中：

U p

表示线性插值的运算，将中间量从较小比例空间映射回原始特征空间。然后，自校准操作表示为：

Y 1' = F 3 (X 1) ⊗ σ (X 1 + X 1'),

(3)

式中：

F 3 (X 1) = X 1 × X 3

，

σ

表示激活函数，符号“

⊗

”表示元素乘法。

X 1'

用于建立自校准权重的残差项。自校准后的最终输出可表示为：

Y 1 = F 4 (Y 1') = Y 1' × K 4 。

(4)

1.3 SC‑YOLO模型结构

SC‑YOLO模型的整体结构如图2所示，该模型在YOLOv8基础框架上进行了关键性的优化。将YOLOv8特征融合网络中原有的C2f模块替换为C2f_SCConv模块。通过模块级的深度融合，实现两种结构优势的互补增效。C2f模块以其强大的特征提取与融合能力著称，而SCConv模块则在特征表示与细节捕捉方面表现出色。因此，C2f_SCConv模块不仅继承了各自的核心优势，更通过协同互补效应，显著提升了模型的特征表达能力与检测性能，为YOLOv8适配复杂多变的检测场景提供了关键支撑。

在实际应用中，多目标检测和复杂背景中的遮挡问题一直是目标检测领域的难点，而SC‑YOLO模型通过引入C2f_SCConv模块，有效提升了模型在这类场景下的检测精度和鲁棒性。同时，该结构还进一步增强了特征的表达能力，使得模型在后续的检测和识别任务中能够更加精准地利用特征信息，提高整体任务的执行效率和准确性。

2 实验结果与分析

2.1 数据集

所用数据集如表1所示。Seaclear Marine Debris数据集涵盖40个对象类别，不仅包含各类典型的海洋垃圾，还广泛纳入了在实地观测中发现的动物、植物及机器人部件等干扰元素，旨在满足目标检测和实例分割任务的需求。该数据集包含8610张海洋垃圾图像，按训练与验证用途划分为7610张训练集和1000张验证集图像，所有图像均来源于Seaclear项目的现场实测，通过专业遥控潜水器（ROV）在多个不同地理位置、多种海洋环境条件下采集获取，数据分布均衡且场景覆盖全面，为模型的训练、验证及性能评估提供了真实、丰富且具有代表性的高质量数据支撑。

TrashCan数据集是面向水下垃圾检测任务的专用数据集，它源自日本海洋地球科学技术机构（JAMSTEC）的J‑EDI（JAMSTEC深海图像电子图书馆）数据集，包括来自日本海数十年积累的ROV视频数据，共计7212张图像，其中训练集6065张、验证集1147张。数据集不仅囊括了塑料袋、塑料瓶、渔网等典型水下垃圾类型，还包括潜水器及其他水下相关物体图像。所有图像均经过专业人员的精细化标注，以确保类别标签的准确性和完整性。

2.2 实验设置

实验训练所采用的硬件平台与软件环境参数详情，如表2所示。模型训练过程中的一些核心参数设置如表3所示。

2.3 评价指标

精确度（Precision，P）、召回率（Recall，R）、平均准确率（Average Precision，AP）、各类别平均精度均值（Mean Average Precision，mAP）、参数量（Params）和浮点运算数（FLOPs）是目标检测领域常用的核心评估指标，采用P、R、mAP、Params和FLOPs作为评估指标来全面评估所提方法的综合性能。P、R、AP和mAP的计算公式如下所示：

P = T P T P + F P × 100 %,

(5)

R = T P T P + F N × 100 %,

(6)

A P = ∫ 01 P (R) d R,

(7)

m A P = 1 n ∑ i = 1 n A P i,

(8)

式中：TP为真阳性，FP为假阳性，FN为假阴性。

2.4 对比性分析

为验证SC‑YOLO模型的检测性能与泛化能力，将其与YOLO系列经典模型进行对照实验，在Seaclear Marine Debris和TrashCan数据集上的实验结果如表4和表5所示。在Seaclear Marine Debris数据集上，改进的模型精确度相较于YOLOv8s提高了1.5个百分点，mAP值提高了1.1个百分点；在TrasCan数据集上，改进的模型精确度相较于YOLOv8s提高了2.4个百分点，mAP值提高了0.7个百分点。

为直观验证SC‑YOLO模型的有效性，其在Seaclear Marine Debris数据集上的检测效果如图3所示。从图3的检测效果对比可见，YOLOv5s、YOLOv8s 与 SC‑YOLO 模型均能实现对目标的基本检测，但在核心性能表现上存在显著差异：相较于 YOLOv5s 与 YOLOv8s 基准模型，SC‑YOLO 模型输出的检测框置信度明显更高。图 4 展示了各模型在 TrashCan 专用水下垃圾数据集上的检测效果：所有模型均能不同程度地识别出图像中的塑料垃圾等核心目标，但YOLOv5s与YOLOv8s模型存在较突出的误检、漏检问题。部分小型垃圾目标被遗漏，或非垃圾物体被错误标注为垃圾，而SC‑YOLO模型则表现出更优的检测完整性与准确性，不仅全面捕捉到图像中的各类水下垃圾，且有效规避了误检、漏检情况。

2.5 mAP曲线结果对比分析

为进一步验证SC‑YOLO算法在训练收敛过程中的性能优越性，在保持硬件环境、训练超参数等条件完全一致的前提下，基于Seaclear Marine Debris与TrashCan两个数据集，对SC‐YOLO模型与YOLO系列经典模型（含YOLOv8基准模型）的训练收敛过程进行对比分析，重点追踪各类模型的平均精度均值（mAP）随训练迭代次数的变化趋势，结果分别如图5和图6所示。

3 结束语

提出了一种基于改进YOLOv8网络的水下垃圾检测算法，通过引入自校准卷积（SCConv）模块，扩大网络的感受野，增强对目标的特征提取能力，提高检测精度。SC‑YOLO模型已在多个数据集上验证了其水下垃圾检测性能，其未来的发展潜力与应用前景依然广阔，但仍有诸多方面值得深入探索与优化，特别是模型的轻量化和实时性提升，将成为未来研究工作的一个关键方向。

参考文献

原文顺序 | 出版日期 | 本文引用

[1]	WU Chunming， SUN Yiqian， WANG Tiaojun， et al. Underwater trash detection algorithm based on improved YOLOv5s［J］. Journal of Real-Time Image Processing， 2022，19（5）： 911‑920.

[2]	AGAMUTHU P， MEHRAN S B， NORKHAIRAH A， et al. Marine debris： A review of impacts and global initiatives［J］. Waste Management & Research， 2019， 37（10）： 987‑1002.

[3]	韩丽，马春海，林志浩，等. 一种用于低分辨率小目标的水下垃圾检测算法［J］. 科学技术与工程， 2024， 24（35）： 15126‑15136.

[4]	HAN Li， MA Chunhai， LIN Zhihao， et al. Underwater trash detection algorithm for low-resolution small targets［J］. Science Technology and Engineering， 2024， 24（35）： 15126‑15136.

[5]	SIMON H. Marine debris： Understanding， preventing and mitigating the significant adverse impacts on marine and coastal biodiversity［M］. Montreal， Canada： Secretariat of the Convention on Biological Diversity， 2016.

[6]

SHINDE A， SWATI S. Computer vision‑based autonomous underwater vehicle with robotic arm for garbage detection and cleaning［M］//CHOUDHURY T， KOLEY B， NAT A， et al. Geo‑Environmental Hazards using AI‑enabled Geospatial Techniques and Earth Observation Systems. Advances in Geographic Information Science， Cham： Springer， 2024： 265‑288.

[7]	ZHANG Yanghai， HUANG Zan， CHEN Changlin， et al. A spiral‑propulsion amphibious intelligent robot for land garbage cleaning and sea garbage cleaning［J］. Journal of Marine Science and Engineering， 2023， 11（8）： 1482.

[8]	KONG Shihan， TIAN Manjun， QIU Changlin， et al. IWSCR： An intelligent water surface cleaner robot for collecting floating garbage［J］. IEEE Transactions on Systems， Man， and Cybernetics： Systems， 2020， 10： 6358‑6368.

[9]	李国进，姚冬宜，艾矫燕，等. 基于改进Faster R‑CNN的水面漂浮物识别与定位［J］. 信阳师范学院学报（自然科学版）， 2021， 34（2）： 292‑299.

[10]	LI Guojin， YAO Dongyi， AI Jiaoyan， et al. Detection and localization of floating objects via improved Faster R‑CNN［J］. Journal of Xinyang Normal University（Natural Science Edition）， 2021， 34（2）： 292‑299.

[11]	CHEN Long， LIU Zhihua， TONG Lei， et al. Underwater object detection using Invert Multi‑Class Adaboost with deep learning［C］//2020 International Joint Conference on Neural Networks （IJCNN）， Glasgow， UK， 2020： 1‑8.

[12]	JIANG Zhongyun， WANG Rongrong. Underwater object detection based on improved single shot multibox detector［C］//Proceedings of the 2020 3rd International Conference on Algorithms， Computing and Artificial Intelligence， New York， NY， USA， 2020 ： 1‑7.

[13]	HAN Fenglei， YAO Jingzheng， ZHU Haitao， et al. Marine organism detection and classification from underwater vision based on the deep CNN method［J］. Mathematical Problems in Engineering， 2020， 2020： 3937580.

[14]	ZHU Jin， HU Tao， ZHENG Linhan， et al. YOLOv8‑C2f‑Faster‑EMA： An improved underwater trash detection model based on YOLOv8［J］. Sensors， 2024， 24（8）： 2483.

[15]	HONG J， FULTON M， SATTAR J. TrashCan： A semantically‑segmented dataset towards visual detection of Marine debris［EB/OL］.（2020‑07‑16）［2025‑01-04］.

[16]	ĐURAŠ A， WOLF B J， ILIOUDI A， et al.A dataset for detection and segmentation of underwater Marine debris in shallow waters［J］. Scientific Data， 2024， 11（1）： 921.

[17]	LIU Jiangjiang， HOU Qibin， CHENG Mingming， et al. Improving convolutional networks with self-calibrated convolutions［C］//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）， Seattle， WA， USA， 2020： 10096‑10105.