-
公开(公告)号:US20230280972A1
公开(公告)日:2023-09-07
申请号:US18316990
申请日:2023-05-12
Abstract: Embodiments are related to processing of one or more input audio feeds for generation of a target audio stream that includes at least one object of interest to a listener. In some embodiments, the target audio stream may exclusively or primarily include the sound of the object of interest to the listener, without including other persons. This allows a listener to focus on an object of his or her interest and not necessarily have to listen to the performances of other objects in the input audio feed. Some embodiments contemplate multiple audio feeds and/or with multiple objects of interest.
-
公开(公告)号:US12242771B2
公开(公告)日:2025-03-04
申请号:US18316990
申请日:2023-05-12
Abstract: Embodiments are related to processing of one or more input audio feeds for generation of a target audio stream that includes at least one object of interest to a listener. In some embodiments, the target audio stream may exclusively or primarily include the sound of the object of interest to the listener, without including other persons. This allows a listener to focus on an object of his or her interest and not necessarily have to listen to the performances of other objects in the input audio feed. Some embodiments contemplate multiple audio feeds and/or with multiple objects of interest.
-
公开(公告)号:US11972770B2
公开(公告)日:2024-04-30
申请号:US17668347
申请日:2022-02-09
IPC: G10L21/043 , G10L21/055 , G10L25/48 , G11B27/00 , G11B27/28 , H04N21/432
CPC classification number: G10L21/043 , G10L21/055 , G10L25/48 , H04N21/4325
Abstract: Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.
-
公开(公告)号:US20240265933A1
公开(公告)日:2024-08-08
申请号:US18618636
申请日:2024-03-27
IPC: G10L21/043 , G10L21/055 , G10L25/48 , H04N21/432
CPC classification number: G10L21/043 , G10L21/055 , G10L25/48 , H04N21/4325
Abstract: Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.
-
公开(公告)号:US12022092B2
公开(公告)日:2024-06-25
申请号:US18320158
申请日:2023-05-18
Inventor: Abhilash Magan Vensiyani , Yatish Jayant Naik Raikar , Varunkumar B. Tripathi , Vivek Devaraj
IPC: H04N19/159 , H04N19/12 , H04N19/16 , H04N19/169 , H04N19/172 , H04N19/423
CPC classification number: H04N19/159 , H04N19/12 , H04N19/16 , H04N19/172 , H04N19/188 , H04N19/423
Abstract: An example method includes the steps of receiving an encoded video packet including a packet header and generating a modified packet header. The modified packet header is generated by setting a first value in the packet header to indicate zero reference frames, and by setting a second value in the packet header designating an i-frame as unused for reference. The i-frame is decoded in response to the modified packet header to extract the i-frame without caching the i-frame in a decoded picture buffer. A thumbnail image is generated and includes an image from the i-frame. The thumbnail image is stored directly in memory.
-
6.
公开(公告)号:US20230334082A1
公开(公告)日:2023-10-19
申请号:US18336682
申请日:2023-06-16
Inventor: Yatish Jayant Naik Raikar , Soham Sahabhaumik
IPC: G06F16/435 , H04N21/472 , G06F16/74 , G06F16/9535 , H04N21/2668
CPC classification number: G06F16/437 , G06F16/748 , G06F16/9535 , H04N21/2668 , H04N21/47205 , H04N21/8405
Abstract: A distributed computing system for artificial intelligence in relating a second multimedia program content with a first multimedia program content based on a key reference. A user terminal is set up locally in a user’s environment to monitor a first multimedia program content consumed by the user. The user’s reaction to a portion of the first multimedia program content is detected by the user terminal. The relevant portion of the first multimedia program content is identified and parsed to obtain a reference portion. The reference portion is related to a second multimedia portion using database mapping and machine learning.
-
公开(公告)号:US11662972B2
公开(公告)日:2023-05-30
申请号:US17148471
申请日:2021-01-13
IPC: G06F3/0481 , G06F3/16 , G10L25/51
Abstract: Embodiments are related to processing of one or more input audio feeds for generation of a target audio stream that includes at least one object of interest to a listener. In some embodiments, the target audio stream may exclusively or primarily include the sound of the object of interest to the listener, without including other persons. This allows a listener to focus on an object of his or her interest and not necessarily have to listen to the performances of other objects in the input audio feed. Some embodiments contemplate multiple audio feeds and/or with multiple objects of interest.
-
公开(公告)号:US20230291912A1
公开(公告)日:2023-09-14
申请号:US18320158
申请日:2023-05-18
Inventor: Abhilash Magan Vensiyani , Yatish Jayant Naik Raikar , Varunkumar B. Tripathi , Vivek Devaraj
IPC: H04N19/159 , H04N19/12 , H04N19/423 , H04N19/16 , H04N19/172 , H04N19/169
CPC classification number: H04N19/159 , H04N19/12 , H04N19/423 , H04N19/16 , H04N19/172 , H04N19/188
Abstract: An example method includes the steps of receiving an encoded video packet including a packet header and generating a modified packet header. The modified packet header is generated by setting a first value in the packet header to indicate zero reference frames, and by setting a second value in the packet header designating an i-frame as unused for reference. The i-frame is decoded in response to the modified packet header to extract the i-frame without caching the i-frame in a decoded picture buffer. A thumbnail image is generated and includes an image from the i-frame. The thumbnail image is stored directly in memory.
-
-
-
-
-
-
-