首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于深度学习的YOLO目标检测综述
引用本文:邵延华,张铎,楚红雨,张晓强,饶云波.基于深度学习的YOLO目标检测综述[J].电子与信息学报,2022,44(10):3697-3708.
作者姓名:邵延华  张铎  楚红雨  张晓强  饶云波
作者单位:1.西南科技大学信息工程学院 绵阳 6210102.电子科技大学 成都 610054
基金项目:国家自然科学基金(61601382),四川省科技计划(2019YJ0325, 2020YFG0148, 2021YFG0314)
摘    要:目标检测是计算机视觉领域的一个基础任务和研究热点。YOLO将目标检测概括为一个回归问题,实现端到端的训练和检测,由于其良好的速度-精度平衡,近几年一直处于目标检测领域的领先地位,被成功地研究、改进和应用到众多不同领域。该文对YOLO系列算法及其重要改进、应用进行了详细调研。首先,系统地梳理了YOLO家族及重要改进,包含YOLOv1-v4, YOLOv5, Scaled-YOLOv4, YOLOR和最新的YOLOX。然后,对YOLO中重要的基础网络,损失函数进行了详细的分析和总结。其次,依据不同的改进思路或应用场景对YOLO算法进行了系统的分类归纳。例如,注意力机制、3D、航拍场景、边缘计算等。最后,总结了YOLO的特点,并结合最新的文献分析可能的改进思路和研究趋势。

关 键 词:目标检测    YOLO    深度学习    卷积神经网络
收稿时间:2021-08-06

A Review of YOLO Object Detection Based on Deep Learning
SHAO Yanhua,ZHANG Duo,CHU Hongyu,ZHANG Xiaoqiang,RAO Yunbo.A Review of YOLO Object Detection Based on Deep Learning[J].Journal of Electronics & Information Technology,2022,44(10):3697-3708.
Authors:SHAO Yanhua  ZHANG Duo  CHU Hongyu  ZHANG Xiaoqiang  RAO Yunbo
Institution:1.School of Information Engineering, Southwest University of Science and Technology, Mianyang 621010, China2.University of Electronic Science & Technology of China, Chengdu 610054, China
Abstract:Object detection is one of the basic tasks and research hotspots in the field of computer vision. The YOLO (You Only Look Once) frames object detection is a regression problem to implement end-to-end training and detection. YOLO becomes the leading object detector due to its good speed-accuracy balance, which has been successfully studied, improved, and applied to many different fields. YOLO series and its important improvements and applications are investigated in detail. Firstly, the YOLO family and important improvements are systematically summarized, including YOLOv1-v4, YOLOv5, Scaled-YOLOv4, YOLOR, and the latest YOLOX. Then, important backbone and loss functions in YOLO are analyzed and summarized in detail. Next, the application of YOLO is systematically classified and summarized according to different improvement ideas or scenarios, such as attention mechanisms, three-dimensional scenes, aerial scenes, edge computing, etc. Finally, the characteristics of the YOLO series are summarized and the possible improvement ideas and research trends are analyzed in combination with the latest literature.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号