首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于改进YOLOv4-tiny的轻量化室内人员目标检测算法
引用本文:赵凤,李永恒,李晶,刘汉强.基于改进YOLOv4-tiny的轻量化室内人员目标检测算法[J].电子与信息学报,2022,44(11):3815-3824.
作者姓名:赵凤  李永恒  李晶  刘汉强
作者单位:1.西安邮电大学通信与信息工程学院 西安 7101212.陕西师范大学计算机科学学院 西安 710119
基金项目:国家自然科学基金(62071379, 62071378, 61901365, 62106196),陕西省自然科学基础研究计划 (2021JM-461, 2020JM-299),西安邮电大学西邮新星团队资助项目(xyt2016-01)
摘    要:深度学习在室内人员检测领域应用广泛,但是传统的卷积神经网络复杂度大且需要高算力GPU的支持,很难实现在嵌入式设备上的部署。针对上述问题,该文提出一种基于改进YOLOv4-tiny的轻量化室内人员目标检测算法。首先,设计一种改进的Ghost卷积特征提取模块,有效减少了模型的复杂度;同时,该文通过采用带有通道混洗机制的深度可分离卷积进一步减少网络参数;其次,该文构建了一种多尺度空洞卷积模块以获得更多具有判别性的特征信息,并结合改进的空洞空间金字塔池化结构和具有位置信息的注意力机制进行有效的特征融合,在提升准确率的同时提高推理速度。在多个数据集和多种硬件平台上的实验表明,该文算法在精度、速度、模型参数和体积等方面优于原YOLOv4-tiny网络,更适合部署于资源有限的嵌入式设备。

关 键 词:室内人员检测  深度学习  YOLOv4-tiny  Ghost卷积
收稿时间:2022-03-08

Lightweight Indoor Personnel Detection Algorithm Based on Improved YOLOv4-tiny
ZHAO Feng,LI Yongheng,LI Jing,LIU Hanqiang.Lightweight Indoor Personnel Detection Algorithm Based on Improved YOLOv4-tiny[J].Journal of Electronics & Information Technology,2022,44(11):3815-3824.
Authors:ZHAO Feng  LI Yongheng  LI Jing  LIU Hanqiang
Institution:1.School of Communications and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an 710121, China2.School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Abstract:Deep learning has been widely applied to the field of indoor personnel detection. However, the traditional convolutional neural networks have a high complexity and require the support of highly computational GPU. It is difficult to accomplish the implementation in the embedded devices. For the above problems, a lightweight network model based on improved YOLOv4-tiny network is proposed for indoor personnel detection. Firstly, an improved Ghost convolution feature extraction module is designed to reduce effectively the model complexity. Simultaneously, to reduce network parameters, a depth-wise separable convolution with channel shuffle mechanism is adopted in this paper. Secondly, a multi-scale dilated convolution module is developed in this paper to obtain more discriminative feature information, which combines the improved dilated space pyramid pooling module and the attention mechanism with location information for effective feature fusion, thereby improving inference accuracy and inference speed, simultaneously. The experiments on multiple datasets and hardware platforms show that the proposed model is superior to the original YOLOv4-tiny network in terms of accuracy, speed, model parameters and volume. Therefore, the proposed model is more suitable for deployment in resource-limited embedded devices.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号