VIDEO ACTION SEGMENTATION BY MIXED TEMPORAL DOMAIN ADAPTION

发明申请

US20210174093A1 VIDEO ACTION SEGMENTATION BY MIXED TEMPORAL DOMAIN ADAPTION 有权

请登陆查看更多内容

专利标题： VIDEO ACTION SEGMENTATION BY MIXED TEMPORAL DOMAIN ADAPTION
申请号： US16706590

申请日： 2019-12-06
公开(公告)号： US20210174093A1

公开(公告)日： 2021-06-10
发明人: Baopu LI , Min-Hung CHEN , Yingze BAO
申请人： Baidu USA, LLC
申请人地址： US CA Sunnyvale
专利权人： Baidu USA, LLC
当前专利权人： Baidu USA, LLC
当前专利权人地址： US CA Sunnyvale
主分类号： G06K9/00
IPC分类号： G06K9/00 ; G06K9/62 ; G06N3/08 ; G06N3/04

VIDEO ACTION SEGMENTATION BY MIXED TEMPORAL DOMAIN ADAPTION

摘要：

Embodiments herein treat the action segmentation as a domain adaption (DA) problem and reduce the domain discrepancy by performing unsupervised DA with auxiliary unlabeled videos. In one or more embodiments, to reduce domain discrepancy for both the spatial and temporal directions, embodiments of a Mixed Temporal Domain Adaptation (MTDA) approach are presented to jointly align frame-level and video-level embedded feature spaces across domains, and, in one or more embodiments, further integrate with a domain attention mechanism to focus on aligning the frame-level features with higher domain discrepancy, leading to more effective domain adaptation. Comprehensive experiment results validate that embodiments outperform previous state-of-the-art methods. Embodiments can adapt models effectively by using auxiliary unlabeled videos, leading to further applications of large-scale problems, such as video surveillance and human activity analysis.

公开/授权文献

US11138441B2 Video action segmentation by mixed temporal domain adaption 公开/授权日：2021-10-05

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )