###
计算机系统应用英文版:2023,32(2):379-386
本文二维码信息
码上扫一扫!
融合空洞卷积的轻量化目标检测
(贵州大学 计算机科学与技术学院 公共大数据国家重点实验室, 贵阳 550025)
Lightweight Target Detection Based on Dilated Convolution
(State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang 550025, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 440次   下载 1069
Received:July 13, 2022    Revised:September 07, 2022
中文摘要: 为了轻量化模型, 便于移动端设备的嵌入, 对YOLOv4网络进行了改进. 首先, 用MobileNetV3作为主干网络, 并使用深度可分离卷积替换加强特征提取网络的普通卷积, 降低模型参数量; 其次, 在104×104特征图输出时融合空洞率为2的空洞卷积, 与52×52的特征层进行特征融合, 获取更多的语义信息和位置信息, 细化特征提取能力, 提升模型对极小目标的检测性能; 最后, 将原来的池化层使用3个5×5的Maxpool进行串联, 减少计算量, 提升检测速度. 实验结果表明, 在华为云2020数据集上, 改进算法的mAP比YM算法提高了2.33%, 在公共数据集VOC07+12上, mAP提高了3.12%, FPS比原来的YOLOv4算法提高了一倍多, 参数量降低至原来的18%, 证明了改进算法的有效性.
Abstract:In order to make the model lightweight and facilitate the embedding of mobile devices, the YOLOv4 network is improved. Firstly, MobileNetV3 is used as the backbone network, and a deep separable convolution is adopted to replace the ordinary convolution of an enhanced feature extraction network, so as to reduce the number of model parameters. Secondly, when the feature map with a size of 104×104 is output, the dilated convolution with a dilated rate of 2 is fused, and it is then fused with a feature layer with a size of 52×52, so as to obtain more semantic and location information, which can refine the feature extraction ability and improve the detection performance of the model for minimal targets. Finally, the original pooling layer is connected in series with three Maxpools with a size of 5×5 to reduce the computational load and improve the detection speed. The experimental results show that on Huawei Cloud 2020 dataset, the mAP of the improved algorithm is improved by 2.33% compared with the YM algorithm, and on the public dataset VOC07 + 12, the mAP is improved by 3.12%, and the FPS has more than doubled compared with the original YOLOv4 algorithm, with the number of parameters reduced to 18% of the original one. As a result, the effectiveness of the improved algorithm is verified.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(62162010); 贵州省科技支撑计划 (黔科合支撑[2022]一般267)
引用文本:
李洋,苟刚.融合空洞卷积的轻量化目标检测.计算机系统应用,2023,32(2):379-386
LI Yang,GOU Gang.Lightweight Target Detection Based on Dilated Convolution.COMPUTER SYSTEMS APPLICATIONS,2023,32(2):379-386