-
公开(公告)号:US11216684B1
公开(公告)日:2022-01-04
申请号:US16781456
申请日:2020-02-04
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Vimal Bhat , Harshal Dilip Wanjari , Sushanta Das , Neha Aggarwal
Abstract: Techniques are described for detecting and replacing burned-in subtitles in image and video content.
-
公开(公告)号:US10930263B1
公开(公告)日:2021-02-23
申请号:US16367814
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar
IPC: G10L13/08 , G10L13/033 , H04N21/485 , G10L15/00 , G10L15/16 , G10L13/00
Abstract: This disclosure describes techniques for replicating characteristics of an actor or actresses voice across different languages. The disclosed techniques have the practical application of enabling automatic generation of dubbed video content for multiple languages, with particular speakers in each dubbing having the same voice characteristics as the corresponding speakers in the original version of the video content.
-
公开(公告)号:US10671854B1
公开(公告)日:2020-06-02
申请号:US15948567
申请日:2018-04-09
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Muhammad Yahia , Harshal Dilip Wanjari
IPC: G06K9/00 , G06K9/62 , H04N21/234 , G06N20/00
Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for intelligent content rating determination. Example methods include determining presence of a first feature in a first frame of a video using an object recognition algorithm, determining presence of a second feature in an audio file associated with the video using an audio processing algorithm, and determining presence of a third feature in a text file associated with the video using a natural language processing algorithm. Certain embodiments may include generating a predicted content rating for the video using a machine learning model, where the predicted content rating is based at least in part on the first feature, the second feature, and the third feature, and using feedback data for the predicted content rating to retrain the machine learning model.
-
公开(公告)号:US11816153B1
公开(公告)日:2023-11-14
申请号:US17861057
申请日:2022-07-08
Applicant: Amazon Technologies, Inc.
Inventor: Jatin Jain , Hooman Mahyar , Abhinav Misra
IPC: G06F16/783 , G06Q30/0601 , H04N21/858 , G06V20/40 , G06F16/738 , H04N21/84 , G06N3/02 , G06F16/78 , H04N21/478 , G06V40/16
CPC classification number: G06F16/784 , G06F16/738 , G06F16/7867 , G06N3/02 , G06Q30/0601 , G06V20/40 , H04N21/84 , H04N21/858 , G06V40/172 , H04N21/47815
Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for automated identification and mapping of objects in video content. Example methods may include determining a first set of frames in video content, determining, using one or more object recognition algorithms, a first object present in the first set of frames, determining that a first product corresponding to the first object is present in a product catalog comprising a set of product images, associating a first product identifier of the first product with a video identifier of the video content, and causing presentation of a set of product identifiers associated with the video identifier.
-
公开(公告)号:US11659217B1
公开(公告)日:2023-05-23
申请号:US17301212
申请日:2021-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Avijit Vajpayee , Abhinav Jain , Arjun Cholkar , Vimal Bhat
IPC: H04N21/242 , H04N21/234 , H04N21/233
CPC classification number: H04N21/242 , H04N21/233 , H04N21/234
Abstract: Techniques are described for detecting desynchronization between an audio component and a video component of a media presentation. Feature sets may be determined for portions of the audio component and portions of the video component, which may then be used to generate correlations between portions of the audio component and portions of the video component. Synchronization may then be assessed based on the correlations.
-
公开(公告)号:US11528525B1
公开(公告)日:2022-12-13
申请号:US16052483
申请日:2018-08-01
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Ryan Barlow Dall , Moussa El Chater
IPC: G06K9/62 , G06N3/02 , H04N21/845 , G06F16/70 , H04N21/432 , H04N21/472 , H04N21/6587
Abstract: This disclosure is directed to a system and method that automatically detects repeated content within multiple media items. Content providers often include content, such as an introduction, near the beginning of a media item. In some circumstances, such as in the case of a series of television episodes, the content providers use the same content in each episode of the series. By dividing the media items into portions and analyzing the portions, the systems and methods described can automatically detect the repeated content. Using the detection of the repeated content, a user interface can then allow a user to bypass the repeated content during playback.
-
公开(公告)号:US11308332B1
公开(公告)日:2022-04-19
申请号:US16856744
申请日:2020-04-23
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Muhammad Yahia , Harshal Dilip Wanjari
IPC: G06K9/00 , G06N20/00 , H04N21/234 , G06K9/62
Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for intelligent content rating determination. Example methods include determining presence of a first feature in a first frame of a video using an object recognition algorithm, determining presence of a second feature in an audio file associated with the video using an audio processing algorithm, and determining presence of a third feature in a text file associated with the video using a natural language processing algorithm. Certain embodiments may include generating a predicted content rating for the video using a machine learning model, where the predicted content rating is based at least in part on the first feature, the second feature, and the third feature, and using feedback data for the predicted content rating to retrain the machine learning model.
-
公开(公告)号:US10455297B1
公开(公告)日:2019-10-22
申请号:US16116618
申请日:2018-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Harshal Dilip Wanjari , Vimal Bhat
Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for customized video content summary generation. Example methods may include determining a first segment of digital content including a first set of frames, first textual content, and first audio content. Example methods may include determining a first event that occurs in the first set of frames, determining a first theme of the first event, generating first metadata indicative of the first theme, and determining a meaning of a first sentence that occurs in the first textual content. Some methods may include determining a second theme of the first sentence, generating second metadata indicative of the second theme, determining that user preference data associated with an active user profile includes the first theme and the second theme, generating a video summary that includes a portion of the first segment of digital content, and presenting the video summary.
-
公开(公告)号:US11645249B1
公开(公告)日:2023-05-09
申请号:US16188239
申请日:2018-11-12
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Ryan Dall , Shubham Kansal
IPC: G06F16/215 , G06F16/783 , G06V20/40 , G06F18/22
CPC classification number: G06F16/215 , G06F16/783 , G06F18/22 , G06V20/46 , G06V20/48 , G06V20/49
Abstract: This disclosure is directed to a system and method that detects duplicated content and/or media items. A media item can be split into media item portions. Based on the media item portions, features can be determined. Using the features, media item portion signatures can be determined to generate a media item signature. The media item signature can be compared with a different media item signature to determine duplicated content within the media items.
-
公开(公告)号:US11321877B1
公开(公告)日:2022-05-03
申请号:US17000585
申请日:2020-08-24
Applicant: Amazon Technologies, Inc.
Inventor: Hooman Mahyar , Arjun Cholkar , Harshal Dilip Wanjari
Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for automated selection of color palettes for video content. Example methods may include determining, by one or more computer processors coupled to memory, a first segment of video content, the first segment comprising a first set of frames, determining, using a first video processing algorithm, a first object that is present in the first set of frames, and determining, using a second video processing algorithm, a first semantic characteristic of the first segment. Some example methods may include generating a first vector representing the first object and the first semantic characteristic, and generating, using a first neural network and the first vector, a first color palette recommendation for the first segment. Selection of the first color palette recommendation may cause a color filter to be applied to the first set of frames.
-
-
-
-
-
-
-
-
-