一种视频动作分类的处理方法及装置

发明授权

CN107463949B 一种视频动作分类的处理方法及装置失效 - 权利终止

请登陆查看更多内容

专利标题： 一种视频动作分类的处理方法及装置
申请号： CN201710573692.2

申请日： 2017-07-14
公开(公告)号： CN107463949B

公开(公告)日： 2020-02-21
发明人: 陈雯婕 , 伏文龙 , 曹立宏
申请人： 北京协同创新研究院 , 中国传媒大学
申请人地址： 北京市海淀区苏家坨镇翠湖南环路13号院1号楼
专利权人： 北京协同创新研究院,中国传媒大学
当前专利权人： 北京协同创新研究院,中国传媒大学
当前专利权人地址： 北京市海淀区苏家坨镇翠湖南环路13号院1号楼
代理机构： 北京路浩知识产权代理有限公司
代理商 王庆龙; 曹杰
主分类号： G06K9/62
IPC分类号： G06K9/62 ; G06K9/00 ; G06N3/04

摘要：

本发明实施例提供一种视频动作分类的处理方法及装置，方法包括：读取待识别的视频帧，并提取视频帧的光流图像；选择一帧视频帧作为起始帧，提取起始帧后的连续m帧视频帧x方向和y方向的光流图像，并与起始帧的RGB图像作为一个样本；将每一个样本中的光流图像和起始帧的RGB图像同时输入SCNN和TCNN，以分别获得SCNN和TCNN的最高卷积层计算出的卷积投影；根据卷积投影和多尺度卷积核的融合模块，获取视频动作的时空融合特征投影；将时空融合特征投影依次通过卷积层、最大池化层和全连接层进行计算，并根据计算结果和分类器获得视频动作所属分类。装置执行上述方法。本发明实施例提供的视频动作分类的处理方法及装置，能够提高复杂场景下人物动作的识别准确率。

摘要（英）：

Embodiments of the invention provide a processing method and device video movement classification. The method comprises the steps of reading to-be-identified video frames and extracting light stream images of the video frames; selecting one video frame as the start frame, extracting light stream images in the x direction and the y direction of m video frames following the start frame, and forming the images and an RGB image of the start frame into a sample; inputting the light stream images and the RGB image of the start frame of each sample into an SCNN and a TCNN at the same time to separately obtain convolution projections calculated by the highest convolution layers of the SCNN and the TCNN; according to the convolution projections and a fusion module of multi-scale convolution kernels, acquiring a space-time fusion feature projection of video movement; performing calculation on the space-time fusion feature projection through convolution layers, the maximum pooling layer and a full connection layer, and obtain the classification of the video movement according to the calculation results and a classifier. The device executes the method. The processing method and device for video movement classification can increase accuracy of identification of people movement in complicated scenes.

公开/授权文献

CN107463949A 一种视频动作分类的处理方法及装置公开/授权日：2017-12-12

信息查询

中国专利公布公告 Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置