-
公开(公告)号:US10924629B1
公开(公告)日:2021-02-16
申请号:US16712330
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Christian Ciabattoni , Yang Yu , Yik Pui Suen , Ryan Barlow Dall , Ritesh Pase , Ramakanth Mudumba
Abstract: Techniques for automated content validation are provided. In some examples, a media file and a metadata file associated with a title of the media file may be received. One or more scene paragraphs may be identified based at least in part on information in the metadata file. Scenes may be identified at least in part by using a transformer model implemented in a neural network. One or more scene files may be generated from the media file. One or more characters in a scene file of the one or more scene files may be identified. A match score may be determined, based at least in part on an association of the scene file to a scene paragraph of the plurality of scene paragraphs. A validity criterion may be determined for the title associated with the media file based at least in part on the match score.
-
公开(公告)号:US11342003B1
公开(公告)日:2022-05-24
申请号:US16711797
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Christian Ciabattoni , David Niu , Lawrence Kyuil Chang , Gordon Zheng , Ritesh Pase , Shiva Krishnamurthy , Ramakanth Mudumba
Abstract: Disclosed are various embodiments for segmenting and classifying video content using sounds. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to music segments is selected based at least in part on an audio characteristic of the subset of the plurality of segments. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
-
公开(公告)号:US11070891B1
公开(公告)日:2021-07-20
申请号:US16708996
申请日:2019-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Charles Effinger , Ryan Barlow Dall , Christian Garcia Siagian , Ramakanth Mudumba , Lawrence Kyuil Chang
IPC: H04N21/488 , H04N21/43 , H04N21/442 , H04N21/4223 , G10L15/26
Abstract: A subtitle management system is provided that analyzes and adjusts subtitles for video content to improve the experience of viewers. Subtitles may be optimized or otherwise adjusted to display in particular regions of the video content, to display in synchronization with audio presentation of the spoken dialogue represented by the subtitles, to display in particular colors, and the like. Subtitles that are permanently integrated into the video content may be identified and addressed. These and other adjustments may be applied to address any of a variety of subtitle issues and shortcomings with conventional methods of generating subtitles.
-
公开(公告)号:US10904476B1
公开(公告)日:2021-01-26
申请号:US16712294
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Charles Effinger , David Niu , Yang Yu , Narayan Sundaram , Arjun Cholkar , Ramakanth Mudumba
Abstract: Techniques for automated up-sampling of media files are provided. In some examples, a title associated with a media file, a metadata file associated with the title, and the media file may be received. The media file may be partitioned into one or more scene files, each scene file including a plurality of frame images in a sequence. One or more up-sampled scene files may be generated, each corresponding to a scene file of the one or more scene files. An up-sampled media file may be generated by combining at least a subset of the one or more up-sampled scene files. Generating one or more up-sampled scene files may include identifying one or more characters in a frame image of the plurality of frame images, based at least in part on implementation of a facial recognition algorithm including deep learning features in a neural network.
-
公开(公告)号:US11871068B1
公开(公告)日:2024-01-09
申请号:US16712470
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Ryan Barlow Dall , Charles Effinger , Ramakanth Mudumba
IPC: H04N21/43 , H04N5/06 , G06V20/40 , G06V40/16 , H04N21/233
CPC classification number: H04N21/4307 , G06V20/46 , G06V40/165 , G06V40/171 , H04N5/06 , H04N21/233 , H04N21/4302
Abstract: Techniques for identifying synchronization errors between audio and video are described herein. Audio portions in audio for media content may be identified based at least in part on a sound level associated with first respective segments of the audio portions. A subset of the audio portions may be selected based at least in part on a duration associated with the audio portions. For a segment of the subset a first number of frames in the audio and a second number of frames in the video for the segment may be determined. A determination may be made that the segment includes a conversation segment based at least in part on the first number of frames, the second number of frames, and a first threshold. A synchronization error may be identified in the conversation segment based on a difference between the audio and the video of the conversation segment.
-
公开(公告)号:US11120839B1
公开(公告)日:2021-09-14
申请号:US16711841
申请日:2019-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Christian Ciabattoni , David Niu , Lawrence Kyuil Chang , Gordon Zheng , Ritesh Pase , Shiva Krishnamurthy , Ramakanth Mudumba
Abstract: Disclosed are various embodiments for segmenting and classifying video content using conversation. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to conversation segments are selected. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
-
-
-
-
-