-
公开(公告)号:US20170083770A1
公开(公告)日:2017-03-23
申请号:US15255978
申请日:2016-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US10664140B2
公开(公告)日:2020-05-26
申请号:US15452201
申请日:2017-03-07
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , G06F3/0488 , G06F3/01 , H04N5/262 , H04N21/4728 , H04N21/4402 , H04N21/472 , G06F3/16
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
公开(公告)号:US20170177197A1
公开(公告)日:2017-06-22
申请号:US15452201
申请日:2017-03-07
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , H04N5/262 , G06F3/16 , G06F3/0488 , G06F3/01
CPC classification number: G06F3/04842 , G06F3/013 , G06F3/017 , G06F3/04845 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F2203/0381 , G06F2203/04806 , H04N5/2628 , H04N21/440263 , H04N21/47205 , H04N21/4728
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
公开(公告)号:US09436876B1
公开(公告)日:2016-09-06
申请号:US14577277
申请日:2014-12-19
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
Abstract translation: 可以利用视频分割系统来自动分割数字视频内容。 可以从视频的帧中提取与视频的视觉,音频和/或文本内容相对应的特征。 根据相似性度量来比较相邻帧的提取特征,以确定通过突变转换区分的第一组镜头或视频片段的边界。 根据某些启发式分析第一组镜头,以识别通过逐渐转换区分的第二组镜头。 可以从第一和第二组拍摄中提取关键帧,并且关键帧可以被视频分割系统用于按照场景对第一组和第二组拍摄进行分组。 可以执行附加处理以将元数据(例如演员的名称或歌曲的名称)与检测到的场景相关联。
-
公开(公告)号:US20150268822A1
公开(公告)日:2015-09-24
申请号:US14283554
申请日:2014-05-21
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , G06F3/0488
CPC classification number: G06F3/04842 , G06F3/013 , G06F3/017 , G06F3/04845 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F2203/0381 , G06F2203/04806 , H04N5/2628 , H04N21/440263 , H04N21/47205 , H04N21/4728
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
Abstract translation: 用户可以选择在视频内容中表示的对象,以便相对于该对象设置放大级别。 选择包含对象的表示的视频帧的一部分以维持与放大级别相对应的表示的呈现大小。 该选择提供“智能缩放”功能,使得诸如演员的脸部等感兴趣的对象能够用于选择每个帧的适当部分以放大,使得放大率导致帧的一部分为 被选择,其包括用户感兴趣的一个或多个对象。 可以为一些对象提供预生成的跟踪数据,这些对象可以使用户能够选择一个对象,然后具有应用的预定部分选择和放大倍数,这可以提供比动态确定的数据更平滑的用户体验。
-
公开(公告)号:US20180082127A1
公开(公告)日:2018-03-22
申请号:US15689193
申请日:2017-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
CPC classification number: G06K9/00765 , G06K9/00147 , G06K9/00744 , G06K9/46 , G06K9/469 , G06K9/6212 , G06K9/6224 , G06T2207/10016
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US09564177B1
公开(公告)日:2017-02-07
申请号:US14667645
申请日:2015-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Ryan Gray , Adam Carlson , Ashutosh Vishwas Kulkarni , Anna Makris , Colin Jon Taylor
CPC classification number: G11B27/3081 , G11B27/007 , G11B27/105 , H04N5/783 , H04N5/85 , H04N9/8205
Abstract: Automatic replay or skip ahead functionality can be configured to intelligently navigate to a portion of a video a user desires to view. The context at which a user selects intelligent navigation can be analyzed to determine where to initiate automatic replay or skip ahead. The context for intelligent navigation can be based on scene or shot segmentation data, closed captioning, aggregate video navigation data from a community of users of shared demographic traits and/or interest, and/or other metadata. In the case of automatic replay, playback of a portion of a video can include enhancements for that portion, such as providing closed captioning, display at a decreased frame rate (“slow motion”), zooming in/out on a portion of the frames of a video segment, among other enhancements.
Abstract translation: 自动重播或跳过功能可以配置为智能地导航到用户希望查看的视频的一部分。 可以分析用户选择智能导航的上下文以确定在何处启动自动重放或跳过。 智能导航的背景可以基于场景或拍摄分割数据,隐藏字幕,来自共享人口特征和/或兴趣的用户社区的聚合视频导航数据和/或其他元数据。 在自动重放的情况下,视频的一部分的回放可以包括对该部分的增强,例如提供隐藏字幕,以降低的帧速率(“慢动作”)显示,在帧的一部分上放大/缩小 的视频片段,以及其他增强功能。
-
公开(公告)号:US09558784B1
公开(公告)日:2017-01-31
申请号:US14667652
申请日:2015-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Ryan Gray , Adam Carlson , Ashutosh Vishwas Kulkarni , Anna Makris , Colin Jon Taylor
CPC classification number: G11B27/005 , H04N5/783 , H04N9/8205
Abstract: Automatic replay or skip ahead functionality can be configured to intelligently navigate to a portion of a video a user desires to view. The context at which a user selects intelligent navigation can be analyzed to determine where to initiate automatic replay or skip ahead. The context for intelligent navigation can be based on scene or shot segmentation data, closed captioning, aggregate video navigation data from a community of users of shared demographic traits and/or interest, and/or other metadata. In the case of automatic replay, playback of a portion of a video can include enhancements for that portion, such as providing closed captioning, display at a decreased frame rate (“slow motion”), zooming in/out on a portion of the frames of a video segment, among other enhancements.
-
公开(公告)号:US10528821B2
公开(公告)日:2020-01-07
申请号:US15689193
申请日:2017-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Adam Carlson , Douglas Ryan Gray , Ashutosh Vishwas Kulkarni , Colin Jon Taylor
Abstract: A video segmentation system can be utilized to automate segmentation of digital video content. Features corresponding to visual, audio, and/or textual content of the video can be extracted from frames of the video. The extracted features of adjacent frames are compared according to a similarity measure to determine boundaries of a first set of shots or video segments distinguished by abrupt transitions. The first set of shots is analyzed according to certain heuristics to recognize a second set of shots distinguished by gradual transitions. Key frames can be extracted from the first and second set of shots, and the key frames can be used by the video segmentation system to group the first and second set of shots by scene. Additional processing can be performed to associate metadata, such as names of actors or titles of songs, with the detected scenes.
-
公开(公告)号:US09626084B2
公开(公告)日:2017-04-18
申请号:US14283554
申请日:2014-05-21
Applicant: Amazon Technologies, Inc.
Inventor: Charles Benjamin Franklin Waggoner , Colin Jon Taylor , Jeffrey P. Bezos , Douglas Ryan Gray
IPC: G06F3/0484 , G06F3/0488
CPC classification number: G06F3/04842 , G06F3/013 , G06F3/017 , G06F3/04845 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F2203/0381 , G06F2203/04806 , H04N5/2628 , H04N21/440263 , H04N21/47205 , H04N21/4728
Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
-
-
-
-
-
-
-
-
-