-
公开(公告)号:US20250104715A1
公开(公告)日:2025-03-27
申请号:US18975589
申请日:2024-12-10
Applicant: Roku, Inc.
Inventor: Snehal Karia , Greg Garner , Sunil Ramesh
IPC: G10L15/26 , G10L15/25 , G10L25/57 , G10L25/78 , H04N21/488
Abstract: In one aspect, an example method includes (i) obtaining media, wherein the obtained media includes (a) audio representing speech and (b) video; (ii) using at least the audio representing speech as a basis to generate speech text; (iii) using at least the audio representing speech to determine starting and ending time points of the speech; and (iv) using at least the generated speech text and the determined starting and ending time points of the speech to (a) generate closed-captioning or subtitle data that includes closed-captioning or subtitle text based on the generated speech text and (b) associating the generated closed-captioning or subtitle data with the obtained media, such that the closed-captioning or subtitle text is time-aligned with the video based on the determined starting and ending time points of the speech.
-
公开(公告)号:US12260882B2
公开(公告)日:2025-03-25
申请号:US18666243
申请日:2024-05-16
Applicant: Roku, Inc.
Inventor: Sunil Ramesh , Michael Cutter , Karina Levitian
IPC: G11B27/00 , G06T7/00 , G11B27/036 , G06T7/70
Abstract: In one aspect, an example method includes (i) estimating, using a skeletal detection model, a pose of an original actor for each of multiple frames of a video; (ii) obtaining, for each of a plurality of the estimated poses, a respective image of a replacement actor; (iii) obtaining replacement speech in the replacement actor's voice that corresponds to speech of the original actor in the video; (iv) generating, using the estimated poses, the images of the replacement actor, and the replacement speech, synthetic frames corresponding to the multiple frames of the video that depict the replacement actor in place of the original actor, with the synthetic frames including facial expressions for the replacement actor that temporally align with the replacement speech; and (iv) combining the synthetic frames and the replacement speech so as to obtain a synthetic video that replaces the original actor with the replacement actor.
-
公开(公告)号:US12256175B2
公开(公告)日:2025-03-18
申请号:US18065026
申请日:2022-12-13
Applicant: Roku, Inc.
Inventor: Sunil Ramesh , Michael Cutter , Karina Levitian
Abstract: In one aspect, an example method includes (i) receiving audio from a first client device engaged in a communication session with a second client device, the audio comprising one or more words spoken by a user of the first client device; (ii) using the audio and a facial feature model to estimate facial movement that corresponds to the one or more words spoken by the user; (iii) generating a synthetic video depicting an avatar of the user moving according to the estimated facial movement; and (iv) in response to generating the synthetic video, causing the second client device to present the synthetic video synchronized with the audio.
-
公开(公告)号:US20250037742A1
公开(公告)日:2025-01-30
申请号:US18913452
申请日:2024-10-11
Applicant: Roku, Inc.
Inventor: Michael Cutter , Sunil Ramesh , Karina Levitian
IPC: G11B27/036 , G06V20/40
Abstract: In one aspect, an example method includes (i) obtaining, by a computing system, video data representing video content; (ii) analyzing, by the computing system, the video data to identify an event that is a subject of the video content; (iii) using, by the computing system, the identified event as a basis to select audio content; and (iv) performing, by the computing system, an operation that facilitates editing the video content to include the selected audio content.
-
公开(公告)号:US12198700B2
公开(公告)日:2025-01-14
申请号:US18328358
申请日:2023-06-02
Applicant: Roku, Inc.
Inventor: Snehal Karia , Greg Garner , Sunil Ramesh
IPC: G10L15/26 , G10L15/25 , G10L25/57 , G10L25/78 , H04N21/488
Abstract: In one aspect, an example method includes (i) obtaining media, wherein the obtained media includes (a) audio representing speech and (b) video; (ii) using at least the audio representing speech as a basis to generate speech text; (iii) using at least the audio representing speech to determine starting and ending time points of the speech; and (iv) using at least the generated speech text and the determined starting and ending time points of the speech to (a) generate closed-captioning or subtitle data that includes closed-captioning or subtitle text based on the generated speech text and (b) associating the generated closed-captioning or subtitle data with the obtained media, such that the closed-captioning or subtitle text is time-aligned with the video based on the determined starting and ending time points of the speech.
-
公开(公告)号:US20240431008A1
公开(公告)日:2024-12-26
申请号:US18662309
申请日:2024-05-13
Applicant: Roku, Inc.
Inventor: Adil Jagmag , Jonan Gerard Willem Maria Janssen , Sharada Palasamudram Ashok Kumar , Sunil Ramesh , Michael Cutter Cutter , Shashank C. Merchant , Yu-Cheng Lin
IPC: H05B47/175
Abstract: In one aspect, an example method includes (i) causing a content-presentation device to output for presentation reference visual content; (ii) causing a lighting device to emit light in accordance with a lighting parameter, wherein initially the light emitted in accordance with the lighting parameter does not correspond to the reference visual content; (iii) receiving first input indicating an adjustment to the lighting parameter; (iv) responsive to receiving the first input, causing the lighting device to adjust the lighting parameter, thereby causing the lighting device to adjust the emitted light; (v) receiving second input indicating that the adjusted emitted light corresponds to the reference visual content; and (vi) responsive to receiving the second input, calibrating the lighting device at least by (i) causing storage of the adjusted lighting parameter and (ii) causing the lighting device to be configured to emit light in accordance with the stored lighting parameter.
-
公开(公告)号:US20240430543A1
公开(公告)日:2024-12-26
申请号:US18586269
申请日:2024-02-23
Applicant: Roku, Inc.
Inventor: Sunil Ramesh , Michael Cutter
IPC: H04N21/845 , H04N21/422 , H04N21/442 , H04N21/81
Abstract: In one aspect, disclosed is an example method for use in connection with a media-presentation device having an associated presentation area, the method including: (i) obtaining presentation area data associated with the presentation area; (ii) determining that the obtained presentation area data satisfies each and every condition of a condition set; (iii) responsive to at least determining that the obtained presentation area data satisfies each and every condition of the condition set, identifying an upcoming media segment insertion point within a media program; and (iv) facilitating the media-presentation device outputting for presentation a media segment starting at the identified media segment insertion point within the media program.
-
8.
公开(公告)号:US20240273236A1
公开(公告)日:2024-08-15
申请号:US18167307
申请日:2023-02-10
Applicant: Roku, Inc.
Inventor: Soren Riise , Sunil Ramesh , Michael Cutter , Karina Levitian
CPC classification number: G06F21/6254 , G06V40/172 , G06F2221/2141
Abstract: A method and system for use of obfuscation coding. An example method includes a computing system receiving an image that depicts a human face. The method then includes the computing system generating a modified image based on the received image, with the generating of the modified image involving inserting into the received image an obfuscation marker that (i) obfuscates the human face in a manner that prevents a person observing the modified image from identifying the human face and (ii) includes predefined symbology that is interpretable by a machine to identify the obfuscated human face. Further, the method includes using the predefined symbology in the modified image as a basis to take action based on an identity of the human face.
-
公开(公告)号:US20240195940A1
公开(公告)日:2024-06-13
申请号:US18065026
申请日:2022-12-13
Applicant: Roku, Inc.
Inventor: Sunil Ramesh , Michael Cutter , Karina Levitian
CPC classification number: H04N7/157 , G06T13/205 , G06T13/40 , G06V10/774 , G06V10/811 , G06V10/82 , G06V40/168 , G06V40/172
Abstract: In one aspect, an example method includes (i) receiving audio from a first client device engaged in a communication session with a second client device, the audio comprising one or more words spoken by a user of the first client device; (ii) using the audio and a facial feature model to estimate facial movement that corresponds to the one or more words spoken by the user; (iii) generating a synthetic video depicting an avatar of the user moving according to the estimated facial movement; and (iv) in response to generating the synthetic video, causing the second client device to present the synthetic video synchronized with the audio.
-
10.
公开(公告)号:US20240155174A1
公开(公告)日:2024-05-09
申请号:US18319024
申请日:2023-05-17
Applicant: Roku, Inc.
Inventor: Hugo du Plooy , Kenny Wu , Erwin Bellers , Sunil Ramesh , Michael Cutter
IPC: H04N21/41 , H04N21/422 , H05B47/165
CPC classification number: H04N21/4131 , H04N21/42203 , H05B47/165
Abstract: In one aspect, a method includes receiving video content comprising a first data portion comprising parameters that control a visual appearance of frames of video content presented by a content-presentation device. The content-presentation device is communicatively coupled to a light unit that is (i) present in a viewing area of the content-presentation device and (ii) configured to provide ambient light in the viewing area. The method also includes extracting, from the received video content, a second data portion, separate from the first data portion, specifying one or more light control parameters that a controller associated with the light unit is configured to interpret as instructions for controlling the light unit. The method also includes transmitting the extracted second data portion to the controller to facilitate the controller controlling the light unit according to the specified one or more light control parameters.
-
-
-
-
-
-
-
-
-