基于VGGNet改进网络结构的多尺度大熊猫面部检测 Multi-scale giant panda face detection based on the improved VGGNet architecture期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于VGGNet改进网络结构的多尺度大熊猫面部检测

引用本文：	何育欣,郑伯川,谭代伦,刘丹,蔡前舟.基于VGGNet改进网络结构的多尺度大熊猫面部检测[J].重庆大学学报(自然科学版),2020,43(11):63-71.

作者姓名：	何育欣郑伯川谭代伦刘丹蔡前舟

作者单位：	西华师范大学数学与信息学院, 四川南充 637002;西华师范大学数学与信息学院, 四川南充 637002;西华师范大学计算方法及应用软件研究所, 四川南充 637002;西华师范大学计算机学院, 四川南充 637002

基金项目：	四川省科技计划资助项目（2019YFG0299）；四川省科技创新苗子工程（2019027）；西华师范大学基本科研项目（19B045）。

摘要：	大熊猫个体识别对研究大熊猫的种群数量非常重要，大熊猫面部检测是基于面部图像的大熊猫个体识别方法中的首要关键步骤。针对现有的大熊猫面部检测方法精确度不高的问题，提出基于VGGNet-16改进网络结构的多尺度大熊猫面部检测方法。首先，以VGGNet-16网络结构为基础，通过增加残差结构与BN层，降低卷积层通道数，并采用LeakyRelu激活函数等改进，构建一个新的特征提取主干网络。其次，将一个3尺度的特征金字塔网络结构与SPP结构结合用于目标检测。最后，使用深度分离卷积结构替代常规卷积结构。实验结果表明，提出的大熊猫面部检测方法在测试集上能够达到99.48%的mAP，检测性能优于YOLOv4。
关键词：	VGGNet网络结构大熊猫面部检测目标检测
收稿时间：	2020/7/11 0:00:00
Multi-scale giant panda face detection based on the improved VGGNet architecture

HE Yuxin,ZHENG Bochuan,TAN Dailun,LIU Dan,CAI Qianzhou.Multi-scale giant panda face detection based on the improved VGGNet architecture[J].Journal of Chongqing University(Natural Science Edition),2020,43(11):63-71.

Authors:	HE Yuxin ZHENG Bochuan TAN Dailun LIU Dan CAI Qianzhou

Institution:	School of Mathematics and Information, China West Normal University, Nanchong, Sichuan 637002, P. R. China;School of Mathematics and Information, China West Normal University, Nanchong, Sichuan 637002, P. R. China;Institute of Computing Method and Application Software, China West Normal University, Nanchong, Sichuan 637002, P. R. China;School of Computer Science, China West Normal University, Nanchong, Sichuan 637002, P. R. China

Abstract:	Individual identification of giant pandas is very important for studying their population of them.. Giant panda face detection is the first key step of giant panda individual identification method based on facial images. To solve the problem that the precision of the existing giant panda face detection methods are low, a multi-scale giant panda face detection method based on improved VGGNet-16 architecture was proposed in this paper. Firstly, based on the VGGNet-16 network architecture, a new feature extraction backbone network was constructed through certain improvements such as adding the residual block and BN(Batch Normalization) layer, reducing the channel dimensionality of convolution layer and adopting LeakyRelu active function as well. Secondly, a 3-scale feature pyramid network structure was combined with SPP(Spatial Pyramid Pooling) structure for object detection. Finally, the conventional convolution architecture was replaced with the depwise separation convolution architecture. Experimental results show that the proposed method can achieve 99.48% mAP(mean average recision) in the test dataset, and the detection performance is better than YOLOv4(You Only Look Once Version 4).

Keywords:	VGGNet network structure giant panda face detection object detection

	点击此处可从《重庆大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《重庆大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏