-
公开(公告)号:US11042353B2
公开(公告)日:2021-06-22
申请号:US16684514
申请日:2019-11-14
Applicant: Roku, Inc.
Inventor: Markus K. Cremer , Shashank Merchant , Robert Coover , Todd J. Hodges , Joshua Ernest Morris
Abstract: In one aspect, an example method includes (i) determining, by a playback device, a first loudness level of a first portion of first media content from a first source while the playback device presents the first media content, with the first portion having a first length; (ii) switching, by the playback device, from presenting the first media content from the first source to presenting second media content from a second source; (iii) based on the switching, determining, by the playback device, second loudness levels of second portions of the first media content while the playback device presents the second media content, with the second portions having a second length that is shorter than the first length; and (iv) while the playback device presents the second media content, adjusting, by the playback device, a volume of the playback device based on one or more of the second loudness levels.
-
公开(公告)号:US12244885B2
公开(公告)日:2025-03-04
申请号:US18364169
申请日:2023-08-02
Applicant: Roku, Inc.
Inventor: Shashank Merchant
IPC: H04N21/2668 , H04N21/234 , H04N21/25 , H04N21/258 , H04N21/431
Abstract: Provided herein are system, apparatus, article of manufacture, method and/or computer program product, and/or combinations and sub-combinations thereof, for utilizing a personalized content system to enhance an effectiveness of content being shown to the user via personalized content associated with the content. This method allows the platform to present the most relevant personalized in-channel content to the users, in a user interface, which then correspondingly helps drive user interest. This method scans content to identify objects of interest located within the content, generates a query based on the identified objects of interest and a user profile, searches content sources for related content, associates the related content with an occurrence of a corresponding identified object of interest within the content and communicates, to the user device, the content and the related content for instantiation through a user interface of the user device.
-
公开(公告)号:US20230353822A1
公开(公告)日:2023-11-02
申请号:US18218865
申请日:2023-07-06
Applicant: ROKU, INC.
Inventor: KURT R. THIELEN , Peter Dunker , Markus K. Cremer , Steven D. Scherf , Shashank Merchant
IPC: H04N21/44 , H04N21/431 , H04N21/234 , H04N21/488 , H04N21/4415 , H04N21/262 , H04N21/432 , H04N21/442 , H04N21/462
CPC classification number: H04N21/44016 , H04N21/4316 , H04N21/23424 , H04N21/4882 , H04N21/4415 , H04N21/26241 , H04N21/4325 , H04N21/44231 , H04N21/4622
Abstract: In one aspect, an example method includes (i) identifying, by a playback device, a media device that provides media content to the playback device; (ii) providing, by the playback device, replacement media content for display; (iii) determining, by the playback device, that while the playback device is displaying the replacement media content a remote control transmitted an instruction to the identified media device; (iv) determining, by the playback device, a playback-modification action corresponding to the instruction and the identified media device; and (v) modifying, by the playback device, playback of the replacement media content in accordance with the playback-modification action.
-
公开(公告)号:US11740860B2
公开(公告)日:2023-08-29
申请号:US17730515
申请日:2022-04-27
Applicant: Roku, Inc.
Inventor: Markus K. Cremer , Shashank Merchant , Aneesh Vartakavi
CPC classification number: G06F3/165 , G06N3/08 , H03G3/24 , H03G3/3005 , H03G3/3089 , H04N5/147 , H04N21/439 , H04N21/812
Abstract: In one aspect, an example method includes (i) presenting first media content from a first source; (ii) encountering a trigger to switch from presenting the first media content from the first source to presenting second media content from a second source; (iii) determining a first loudness level of the first media content; (iv) determining a second loudness level of the second media content; (v) based on a difference between the first loudness level and the second loudness level, adjusting a loudness level of the second media content so as to generate modified media content having a third loudness level that is different from the second loudness level; and (vi) responsive to encountering the trigger, presenting the modified media content having the third loudness level.
-
公开(公告)号:US11736762B2
公开(公告)日:2023-08-22
申请号:US17722523
申请日:2022-04-18
Applicant: Roku, Inc.
Inventor: Mihailo M. Stojancic , Jose Pio Pereira , Peter Wendt , Shashank Merchant , Sunil Suresh Kulkarni
IPC: H04N21/439 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N21/84 , H04N21/845 , H04N1/32 , H04N21/43
CPC classification number: H04N21/4394 , G10H1/0008 , H04N1/32 , H04N21/41407 , H04N21/4223 , H04N21/42203 , H04N21/439 , H04N21/4332 , H04N21/43072 , H04N21/8352 , H04N21/84 , H04N21/8456 , G10H2210/041 , G10H2210/051 , G10H2240/141 , G10H2250/235
Abstract: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.
-
公开(公告)号:US11729458B2
公开(公告)日:2023-08-15
申请号:US16441949
申请日:2019-06-14
Applicant: Roku, Inc.
Inventor: Mihailo M. Stojancic , Sunil Suresh Kulkarni , Shashank Merchant , Jose Pio Pereira , Oleksiy Bolgarov
IPC: H04N21/439 , H04N21/84 , H04N21/845 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N1/32 , H04N21/43
CPC classification number: H04N21/4394 , G10H1/0008 , H04N1/32 , H04N21/41407 , H04N21/4223 , H04N21/42203 , H04N21/439 , H04N21/4332 , H04N21/43072 , H04N21/8352 , H04N21/84 , H04N21/8456 , G10H2210/041 , G10H2210/051 , G10H2240/141 , G10H2250/235
Abstract: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.
-
公开(公告)号:US11631404B2
公开(公告)日:2023-04-18
申请号:US17444946
申请日:2021-08-12
Applicant: Roku, Inc.
Inventor: Jose Pio Pereira , Sunil Suresh Kulkarni , Mihailo M. Stojancic , Shashank Merchant , Peter Wendt
IPC: G10L15/16 , G10L15/20 , G06T7/246 , G06T7/215 , G06K9/62 , G06F16/00 , G06T7/254 , G06F16/45 , G06F16/48 , G06V10/50 , G06V20/40 , G10L15/02 , G10L15/06 , G10L15/10 , G10L15/14 , G10L21/0232 , G10L25/81 , G06F16/906 , G06F16/44 , G10L21/0216
Abstract: Audio distortion compensation methods to improve accuracy and efficiency of audio content identification are described. The method is also applicable to speech recognition. Methods to detect the interference from speakers and sources, and distortion to audio from environment and devices, are discussed. Additional methods to detect distortion to the content after performing search and correlation are illustrated. The causes of actual distortion at each client are measured and registered and learnt to generate rules for determining likely distortion and interference sources. The learnt rules are applied at the client, and likely distortions that are detected are compensated or heavily distorted sections are ignored at audio level or signature and feature level based on compute resources available. Further methods to subtract the likely distortions in the query at both audio level and after processing at signature and feature level are described.
-
公开(公告)号:US11490154B2
公开(公告)日:2022-11-01
申请号:US17180263
申请日:2021-02-19
Applicant: Roku, Inc.
Inventor: Shashank Merchant , Bruce William Randall , Brian Johnson , Chungwon Seo , Seunghyeong Lee , Paul Jeffrey Holmquist
IPC: H04N21/43 , H04N21/4402 , H04N7/01 , H04N21/8547 , H04N21/438 , G06V20/40
Abstract: One high-level aspect of a content-modification system and related methods may involve facilitating content modification by a content-presentation device of received broadcast content in a controlled manner, while under circumstances in which some events that can impact timing may be unpredictable. In particular, certain operations by a content-presentation device may involve matching received content with specific expected content as determined by one or another component of the content-modification system, in order to confirm proper conditions are met for the content-presentation device to proceed with, or continue, content-modification operations. It can happen the matching procedure becomes subject or susceptible to timing irregularities, or jitter. In some instances, jitter may impact the ability to derive the benefits of content modification. Accordingly, example embodiments herein are directed to systems and method for compensation and/or mitigating the effects of jitter.
-
公开(公告)号:US20220239977A1
公开(公告)日:2022-07-28
申请号:US17722523
申请日:2022-04-18
Applicant: Roku, Inc.
Inventor: Mihailo M. Stojancic , Jose Pio Pereira , Peter Wendt , Shashank Merchant , Sunil Suresh Kulkarni
IPC: H04N21/439 , H04N1/32 , H04N21/8352 , G10H1/00 , H04N21/414 , H04N21/422 , H04N21/4223 , H04N21/433 , H04N21/84 , H04N21/845
Abstract: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.
-
40.
公开(公告)号:US11361017B1
公开(公告)日:2022-06-14
申请号:US16929858
申请日:2020-07-15
Applicant: Roku, Inc.
Inventor: Sunil Suresh Kulkarni , Pradipkumar Dineshbhai Gajjar , Jose Pio Pereira , Preshant Ramanathan , Mihailo M. Stojancic , Shashank Merchant
IPC: G06F16/45 , G06F16/48 , G06F16/906 , G06F16/44
Abstract: Techniques are described that exclude use of “stop-fingerprints” from media database formation and search query to an automatic content recognition (ACR) systems based on media content fingerprints updated by stop-fingerprint analysis. A classification process is presented which takes in fingerprints from reference media files as an input and produces a modified set of fingerprints as an output by applying a novel stop-fingerprint classification algorithm. Architecture for the distributed stop-fingerprint generation is presented. Various cases, as stop-fingerprints generation for the entire reference database, stop-fingerprints generation for the individual reference fingerprint files, and temporal fingerprint classification obtained through intermediate steps of the temporal fingerprint classification algorithm are presented. A hash-based signature classification algorithm is also described.
-
-
-
-
-
-
-
-
-