Patent search ap:("Google LLC") AND inv:"Michael Rubinstein" Page 2

11.

发明授权
Taking photos through visual obstructions 有权

公开(公告)号：US10412316B2

公开(公告)日：2019-09-10

申请号：US15392452

申请日：2016-12-28

Applicant: Google LLC

Inventor： Michael Rubinstein , William Freeman , Ce Liu

IPC: H04N5/262 , H04N5/222 , H04N5/232 , G06T7/30 , G06T7/90 , G06T5/00 , G06T5/20 , G06T5/50

Abstract: The present disclosure relates to systems and methods for image capture. Namely, an image capture system may include a camera configured to capture images of a field of view, a display, and a controller. An initial image of the field of view from an initial camera pose may be captured. An obstruction may be determined to be observable in the field of view. Based on the obstruction, at least one desired camera pose may be determined. The at least one desired camera pose includes at least one desired position of the camera. A capture interface may be displayed, which may include instructions for moving the camera to the at least one desired camera pose. At least one further image of the field of view from the at least one desired camera pose may be captured. Captured images may be processed to remove the obstruction from a background image.

12.

发明申请
AUDIO-VISUAL HEARING AID 有权

公开(公告)号：US20240428816A1

公开(公告)日：2024-12-26

申请号：US18797400

申请日：2024-08-07

Applicant: Google LLC

Inventor： Anatoly Efros , Noam Etzion-Rosenberg , Tal Remez , Oran Lang , Inbar Mosseri , Israel Or Weinstein , Benjamin Schlesinger , Michael Rubinstein , Ariel Ephrat , Yukun Zhu , Stella Laurenzo , Amit Pitaru , Yossi Matias

IPC: G10L21/0208 , G10L17/00 , G10L21/0272 , G10L25/57

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

13.

发明申请
DIFFUSION-GUIDED THREE-DIMENSIONAL RECONSTRUCTION 有权

公开(公告)号：US20240412458A1

公开(公告)日：2024-12-12

申请号：US18741680

申请日：2024-06-12

Applicant: Google LLC

Inventor： Varun Jampani , Chun-Han Yao , Amit Raj , Wei-Chih Hung , Ming-Hsuan Yang , Michael Rubinstein , Yuanzhen Li

IPC: G06T17/20 , G06T5/70 , G06T11/00

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for editing images based on decoder-based accumulative score sampling (DASS) losses.

14.

发明公开
Systems and Methods for Identifying and Extracting Object-Related Effects in Videos 审中-公开

公开(公告)号：US20240249523A1

公开(公告)日：2024-07-25

申请号：US18560609

申请日：2022-05-11

Applicant: Google LLC

Inventor： Forrester H. Cole , Andrew Zisserman , Tali Dekel , William Tafel Freeman , Erika Lu , Michael Rubinstein

IPC: G06V20/40 , G06T7/194 , G06T7/246 , G06T7/73 , G06V10/26 , G06V10/776 , G06V10/82

CPC classification number: G06V20/46 , G06T7/194 , G06T7/246 , G06T7/73 , G06V10/26 , G06V10/776 , G06V10/82 , G06T2207/10016 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084

Abstract: The present disclosure provides systems and methods for identifying and extracting object-related effects in videos. Given an ordinary video and a rough segmentation mask overtime of one or more subjects of interest, example systems proposed herein can estimate an omnimatte for each subject—an alpha matte and color image that includes the subject along with all its related time-varying scene elements. Example implementations of the proposed models can be trained only on the input video in a self-supervised manner, without any manual labels, and are generic. For example, the models can produce omnimattes automatically for arbitrary objects and a variety of effects.

15.

发明授权
Re-timing objects in video via layered neural rendering 有权

公开(公告)号：US12243145B2

公开(公告)日：2025-03-04

申请号：US17927101

申请日：2020-05-22

Applicant: Google LLC

Inventor： Forrester H. Cole , Erika Lu , Tali Dekel , William T. Freeman , David Henry Salesin , Michael Rubinstein

IPC: G06T13/80 , G06V10/44 , G06V10/82 , G06V20/40 , G11B27/00 , G11B27/031

Abstract: A computer-implemented method for decomposing videos into multiple layers (212, 213) that can be re-combined with modified relative timings includes obtaining video data including a plurality of image frames (201) depicting one or more objects. For each of the plurality of frames, the computer-implemented method includes generating one or more object maps descriptive of a respective location of at least one object of the one or more objects within the image frame. For each of the plurality of frames, the computer-implemented method includes inputting the image frame and the one or more object maps into a machine-learned layer Tenderer model. (220) For each of the plurality of frames, the computer-implemented method includes receiving, as output from the machine-learned layer Tenderer model, a background layer illustrative of a background of the video data and one or more object layers respectively associated with one of the one or more object maps. The object layers include image data illustrative of the at least one object and one or more trace effects at least partially attributable to the at least one object such that the one or more object layers and the background layer can be re-combined with modified relative timings.

16.

发明授权
Audio-visual speech separation 有权

公开(公告)号：US11894014B2

公开(公告)日：2024-02-06

申请号：US17951002

申请日：2022-09-22

Applicant: Google LLC

Inventor： Inbar Mosseri , Michael Rubinstein , Ariel Ephrat , William Freeman , Oran Lang , Kevin William Wilson , Tali Dekel , Avinatan Hassidim

IPC: G10L25/57 , G10L15/16 , G10L21/10 , G10L21/18 , G06V20/40 , G06V40/16 , G10L15/25 , G06F18/214 , G10L17/18

CPC classification number: G10L25/57 , G06F18/214 , G06V20/41 , G06V40/161 , G10L15/16 , G10L15/25 , G10L17/18 , G10L21/10 , G10L21/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

17.

发明公开
Heart Rate and Respiratory Rate Measurements from Imagery 审中-公开

公开(公告)号：US20230277069A1

公开(公告)日：2023-09-07

申请号：US18011899

申请日：2022-03-03

Applicant: Google LLC

Inventor： Jiening Zhan , Sean Kyungmok Bae , Silviu Borac , Yunus Emre , Jonathan Wesor Wang , Jiang Wu , Mehr Kashyap , Ming Jack Po , Liwen Chen , Melissa Chung , John Cannon , Eric Steven Teasley , James Alexander Taylor, Jr. , Michael Vincent McConnell , Alejandra Maciel , Allen KC Chai , Shwetak Patel , Gregory Sean Corrado , Si-Hyuck Kang , Yun Liu , Michael Rubinstein , Michael Spencer Krainin , Neal Wadhwa

IPC: A61B5/0205 , A61B5/00

CPC classification number: A61B5/0205 , A61B5/0077 , A61B5/725 , A61B5/6898 , A61B5/7257 , A61B5/7278 , A61B5/7485 , A61B5/0816

Abstract: Generally, the present disclosure is directed to systems and methods for measuring heart rate and respiratory rate using a camera such as, for example, a smartphone camera or other consumer-grade camera. Specifically, the present disclosure presents and validates two algorithms that make use of smartphone cameras (or the like) for measuring heart rate (HR) and respiratory rate (RR) for consumer wellness use. As an example, HR can be measured by placing the finger of a subject over the rear-facing camera. As another example, RR can be measured via a video of the subject sitting still in front of the front-facing camera.

18.

发明申请
AUDIO-VISUAL SPEECH SEPARATION 有权

公开(公告)号：US20230122905A1

公开(公告)日：2023-04-20

申请号：US17951002

申请日：2022-09-22

Applicant: Google LLC

Inventor： Inbar Mosseri , Michael Rubinstein , Ariel Ephrat , William Freeman , Oran Lang , Kevin William Wilson , Tali Dekel , Avinatan Hassidim

IPC: G10L21/10 , G10L15/16 , G10L21/18 , G06V20/40 , G06V40/16 , G10L15/25 , G06F18/214

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

19.

发明授权
Adaptive glare removal and/or color correction 有权

公开(公告)号：US11483463B2

公开(公告)日：2022-10-25

申请号：US16883587

申请日：2020-05-26

Applicant: Google LLC

Inventor： Julia Winn , Abraham Stephens , Daniel Pettigrew , Aaron Maschinot , Ce Liu , Michael Krainin , Michael Rubinstein , Jingyu Cui

IPC: G06K9/34 , H04N5/232 , B60J3/04 , G02F1/1333 , G02F1/137 , H04N5/00 , H04N1/60 , G06T5/00 , H04N1/38 , G06T5/50 , H05B45/20

Abstract: Some implementations relate to determining whether glare is present in captured image(s) of an object (e.g., a photo) and/or to determining one or more attributes of any present glare. Some of those implementations further relate to adapting one or more parameters for a glare removal process based on whether the glare is determined to be present and/or based on one or more of the determined attributes of any glare determined to be present. Some additional and/or alternative implementations disclosed herein relate to correcting color of a flash image of an object (e.g., a photo). The flash image is based on one or more images captured by a camera of a client device with a flash component of the client device activated. In various implementations, correcting the color of the flash image is based on a determined color space of an ambient image of the object.

20.

发明授权
Taking photos through visual obstructions 有权

公开(公告)号：US11050948B2

公开(公告)日：2021-06-29

申请号：US16526343

申请日：2019-10-21

Applicant: Google LLC

Inventor： Michael Rubinstein , William Freeman , Ce Liu

IPC: H04N5/262 , G06T5/00 , H04N5/222 , H04N5/232 , G06T5/50 , G06T7/30 , G06T7/90 , G06T5/20

Abstract: The present disclosure relates to systems and methods for image capture. Namely, an image capture system may include a camera configured to capture images of a field of view, a display, and a controller. An initial image of the field of view from an initial camera pose may be captured. An obstruction may be determined to be observable in the field of view. Based on the obstruction, at least one desired camera pose may be determined. The at least one desired camera pose includes at least one desired position of the camera. A capture interface may be displayed, which may include instructions for moving the camera to the at least one desired camera pose. At least one further image of the field of view from the at least one desired camera pose may be captured. Captured images may be processed to remove the obstruction from a background image.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification