首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于时空双流卷积神经网络的红外行为识别
引用本文:吴雪平,孙韶媛,李佳豪,李大威.基于时空双流卷积神经网络的红外行为识别[J].应用光学,2018,39(5):743-750.
作者姓名:吴雪平  孙韶媛  李佳豪  李大威
作者单位:1.东华大学 信息科学与技术学院,上海 201620
基金项目:上海市科委基础研究项目15JC1400600国家青年自然科学61603089上海市青年科技英才扬帆计划16YF1400100
摘    要:针对红外视频人体行为识别问题,提出了一种基于时空双流卷积神经网络的红外人体行为识别方法。通过将整个红外视频进行平均分段,然后将每一段视频中随机抽取的红外图像和对应的光流图像输入空间卷积神经网络,空间卷积神经网络通过融合光流信息可以有效地学习到红外图像中真正发生运动的空间信息,再将每一小段的识别结果进行融合得到空间网络结果。同时将每一段视频中随机抽取的光流图像序列输入时间卷积神经网络,融合每一小段的结果后得到时间网络结果。最后再将空间网络结果和时间网络结果进行加权求和,从而得到最终的视频分类结果。实验中,采用此方法对包含23种红外行为动作类别的红外视频数据集上的动作进行识别,正确识别率为92.0%。结果表明,该算法可以有效地对红外视频行为进行准确识别。

关 键 词:人体行为识别    卷积神经网络    信息融合    红外视频    视频分段
收稿时间:2018-05-11

Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks
Institution:1.School of Information Science and Technology, Donghua University, Shanghai 201620, China2.Engineering Research Center of Digitized Textile & Fashion Technology(Ministry of Education), Donghua University, Shanghai 201620, China
Abstract:Aiming at the recognition of human behavior in infrared video, an infrared human behavior recognition method based on spatio-temporal two-flow convolutional neural network was proposed. In this method, first the entire infrared video is equally segmented, and then the infrared image extracted randomly and the corresponding optical flow image in each video segment are input into the spatial convolutional neural network, and the spatial network can effectively learn which part of the infrared image is actually the action by merging the optical flow information. Next the recognition results of each small segment are merged to get the spatial network results. At the same time, the randomly selected optical stream image sequence in each segment of the video is input into the temporal convolutional neural network, and the result of the temporal network can be obtained by fusing the result of each small segment. Finally, the results of spatial network and the temporal network are weighted and summed to obtain the final video classification results.In the experiment, the action on the infrared video data set containing 23 kinds of infrared behavior action categories was identified by this method, and the correct recognition rate was 92.0%. The results show that the algorithm can effectively identify the infrared video behavior.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《应用光学》浏览原始摘要信息
点击此处可从《应用光学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号