Contrastive learning of scene representation guided by video similarities

发明授权

US12067779B1 Contrastive learning of scene representation guided by video similarities 有权

请登陆查看更多内容

专利标题： Contrastive learning of scene representation guided by video similarities
申请号： US17668014

申请日： 2022-02-09
公开(公告)号： US12067779B1

公开(公告)日： 2024-08-20
发明人: Shixing Chen , Xiang Hao , Xiaohan Nie , Muhammad Raffay Hamid
申请人： Amazon Technologies, Inc.
申请人地址： US WA Seattle
专利权人： Amazon Technologies, Inc.
当前专利权人： Amazon Technologies, Inc.
当前专利权人地址： US WA Seattle
代理机构： BakerHostetler
主分类号： G06V20/40
IPC分类号： G06V20/40 ; G06V10/774

Contrastive learning of scene representation guided by video similarities

摘要：

A plurality of similar video pairs may be determined based on one or more similarity information types. Each video pair of the plurality of similar video pairs may include a first respective video and a second respective video. For each video pair, one or more similar scene pairs may be determined. Each of the one or more similar scene pairs may include a respective first scene from the first respective video and a second respective scene from the second respective video. An encoder may be trained using a contrastive learning model that contrasts a plurality of similar scene pairs with a plurality of random scenes. The plurality of similar scene pairs may include the one or more scene pairs for each video pair. One or more scene features of one or more other scenes of one or more other videos may be determined using the encoder.

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）
G06V20/40	.在视频内容中（提取叠加文本 G06V20/62）（视频检索 G06F16/70）（在视频服务器中处理视频基本流H04N21/234）（在视频客户端中处理视频基本流H04N21/44）