专利检索 ap:("MICROSOFT TECHNOLOGY LICENSING, LLC") AND inv:"Zvi Figov" 第 1 页

1.

发明授权
Textless material scene matching in videos 有权

公开(公告)号：US11995892B2

公开(公告)日：2024-05-28

申请号：US17804188

申请日：2022-05-26

申请人： Microsoft Technology Licensing, LLC

发明人： Mattan Serry , Zvi Figov , Yonit Hoffman , Maayan Yedidia

IPC分类号： G06V20/40 , G06V20/62 , G11B27/036

CPC分类号： G06V20/48 , G06V20/46 , G06V20/635 , G11B27/036

摘要： Systems, methods, and a computer-readable medium are provided for matching textless elements to texted elements in video content. A video processing system including a textless matching system may divide a video into shots, identify shots having similar durations, identify sequences of shots having similar durations, and compare image content in representative frames of the sequences to determine whether the sequences match. When the sequences are determined to match, the sequences may be paired, wherein the first sequence may include shots with overlaid text and the second sequence may include textless version of corresponding texted shots included in the first sequence. In some examples, the video processing system may further replace the determined corresponding texted shots.

2.

发明授权
Detecting prominence of objects in video information 有权

公开(公告)号：US12026200B2

公开(公告)日：2024-07-02

申请号：US17865117

申请日：2022-07-14

申请人： Microsoft Technology Licensing, LLC

发明人： Yonit Hoffman , Tom Hirshberg , Maayan Yedidia , Zvi Figov

IPC分类号： G06F16/78 , G06F3/04847 , G06F16/738 , G06F16/783 , G06V20/40

CPC分类号： G06F16/739 , G06F3/04847 , G06F16/784 , G06V20/46

摘要： A video-processing technique uses machine-trained logic to detect and track people that appear in video information. The technique then ranks the prominence of these people in the video information, to produce ranking information. The prominence of a person reflects a level of importance of the person in the video information, corresponding to the capacity of the person to draw the attention of a viewer. For instance, the prominence of the person reflects, at least in part, an extent to which the person appears in the video information. The technique performs its ranking based on person-specific feature information. The technique produces each instance of person-specific feature information by accumulating features pertaining to a particular person. One or more application systems make use of the ranking information to control the presentation of the video information.

3.

发明授权
Metadata generation for video indexing 有权

公开(公告)号：US11755643B2

公开(公告)日：2023-09-12

申请号：US16921248

申请日：2020-07-06

申请人： Microsoft Technology Licensing, LLC

发明人： Zvi Figov , Irit Ofer

IPC分类号： G06F16/00 , G06F16/71 , G06F16/783 , G06V10/25 , G06V20/40

CPC分类号： G06F16/71 , G06F16/7837 , G06V10/25 , G06V20/41 , G06V20/48

摘要： A video indexing system identifies groups of frames within a video frame sequence captured by a static camera during a same scene. Context metadata is generated for each frame in each group based on an analysis of fewer than all frames in the group. The frames are indexed in a database in association with the generated context metadata.

4.

发明授权
Head position extrapolation based on a 3D model and image data 有权

公开(公告)号：US11386609B2

公开(公告)日：2022-07-12

申请号：US17138207

申请日：2020-12-30

申请人： MICROSOFT TECHNOLOGY LICENSING, LLC

发明人： Mattan Serry , Zvi Figov , Irit Ofer

IPC分类号： G06T15/20 , G06T19/20 , G06T7/60 , G06T3/20 , G06T3/40 , G06V40/16

摘要： An approach using 3D algorithms to solve 2D head localization problems is disclosed. A system can extrapolate aspects of one part of an object, e.g., extract characteristics of a person's head, using a 2D input image of another part of the object, e.g., a 2D image of the person's face. The system then selects an appropriate 3D model by the use of facial features detected in an image of a person's face. Using the selected 3D model and the 3D rotation angles provided by a face detector, the system rotates the model and then projects the model to a 2D shape. The system then scales and translates, e.g., transforms, the 2D shape to match the 2D face bounding box. Then, using the transformed 2D shape, the system extracts a bounding box for the extracted portion of an object, e.g., the head of the person depicted in the 2D input image.