基于细节增强与多尺度特征融合的水下目标检测
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金(U21A2013); 湖北省自然科学基金(2021CFB506)


Underwater Object Detection Based on Detail Enhancement and Multi-scale Feature Fusion
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对水下目标检测任务中图像分辨率低、目标尺寸变化大、水体浑浊及目标遮挡而导致检测精度低的问题, 提出一种基于细节增强与多尺度特征融合的水下目标检测模型DEMF-YOLO11n (YOLO11n underwater object detection model based on detail enhancement and multi-scale feature fusion). 模型以YOLO11n为主体, 为改善普通下采样对于低分辨率图像及小目标所造成的特征损失问题, 使用浅层与深层鲁棒下采样方法RFD (robust feature downsampling)分别替换模型浅层及深层步长卷积. 同时, 使用多核并行卷积模块PKIModule (poly kernel inception module)及上下文锚点注意力CAA (context anchor attention)对模型骨干中的C3k2进行重新设计, 以增强模型对遮挡目标及不同尺度目标的特征提取效果, 以及对复杂背景下目标的感知能力. 最后, 为解决水体浑浊等因素所导致的目标边缘细节模糊问题, 在头部网络中使用CGAFusion (content guide attention fusion)模块将深层特征与经EEM (edge enhance module)进行边缘增强后的浅层纹理特征进行自适应融合. 在RUOD数据集上的实验结果表明, DEMF-YOLO11n较基准模型mAP50提升2.8%, mAP50-95提升4.1%, 而参数量仅增加0.53M.

    Abstract:

    To address the challenges of low image resolution, significant variation in target size, water turbidity, and occlusion that lead to poor detection accuracy in underwater object detection, a YOLO11n underwater object detection model based on detail enhancement and multi-scale feature fusion (DEMF-YOLO11n) is proposed. The model is built upon the YOLO11n architecture. To mitigate feature loss caused by standard downsampling in low-resolution images and small targets, the robust feature downsampling (RFD) method is adopted to replace the shallow and deep strided convolution layers. In addition, a module in the backbone is redesigned using the poly kernel inception module (PKIModule) and context anchor attention (CAA), enhancing feature extraction for occluded and multi-scale targets, as well as improving perception in complex underwater environments. To address the blurring of target edges caused by water turbidity and similar factors, shallow texture features are first refined using the edge enhancer module (EEM), and are subsequently fused with deep semantic features in the head network via the content guide attention fusion (CGAFusion) module. Experimental results on the RUOD dataset demonstrate that the proposed DEMF-YOLO11n achieves a 2.8% improvement in mAP50 and a 4.1% improvement in mAP50-95 compared to the baseline model, with an increase of only 0.53M in the number of parameters.

    参考文献
    相似文献
    引证文献
引用本文

胡远江,杨翼,吴湘宁,王梦雪,潘志鹏.基于细节增强与多尺度特征融合的水下目标检测.计算机系统应用,2025,34(9):69-78

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-01-09
  • 最后修改日期:2025-02-17
  • 录用日期:
  • 在线发布日期: 2025-07-14
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62661041 传真: Email:csa@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号