- 专利标题: VIDEO ACTION SEGMENTATION BY MIXED TEMPORAL DOMAIN ADAPTION
-
申请号: US16706590申请日: 2019-12-06
-
公开(公告)号: US20210174093A1公开(公告)日: 2021-06-10
- 发明人: Baopu LI , Min-Hung CHEN , Yingze BAO
- 申请人: Baidu USA, LLC
- 申请人地址: US CA Sunnyvale
- 专利权人: Baidu USA, LLC
- 当前专利权人: Baidu USA, LLC
- 当前专利权人地址: US CA Sunnyvale
- 主分类号: G06K9/00
- IPC分类号: G06K9/00 ; G06K9/62 ; G06N3/08 ; G06N3/04
摘要:
Embodiments herein treat the action segmentation as a domain adaption (DA) problem and reduce the domain discrepancy by performing unsupervised DA with auxiliary unlabeled videos. In one or more embodiments, to reduce domain discrepancy for both the spatial and temporal directions, embodiments of a Mixed Temporal Domain Adaptation (MTDA) approach are presented to jointly align frame-level and video-level embedded feature spaces across domains, and, in one or more embodiments, further integrate with a domain attention mechanism to focus on aligning the frame-level features with higher domain discrepancy, leading to more effective domain adaptation. Comprehensive experiment results validate that embodiments outperform previous state-of-the-art methods. Embodiments can adapt models effectively by using auxiliary unlabeled videos, leading to further applications of large-scale problems, such as video surveillance and human activity analysis.
公开/授权文献
- US11138441B2 Video action segmentation by mixed temporal domain adaption 公开/授权日:2021-10-05
信息查询