-
公开(公告)号:US11145075B2
公开(公告)日:2021-10-12
申请号:US16767401
申请日:2019-10-04
Applicant: Google LLC
Inventor: Julien Valentin , Onur G. Guleryuz , Mira Leung , Maksym Dzitsiuk , Jose Pascoal , Mirko Schmidt , Christoph Rhemann , Neal Wadhwa , Eric Turner , Sameh Khamis , Adarsh Prakash Murthy Kowdle , Ambrus Csaszar , João Manuel Castro Afonso , Jonathan T. Barron , Michael Schoenberg , Ivan Dryanovski , Vivek Verma , Vladimir Tankovich , Shahram Izadi , Sean Ryan Francesco Fanello , Konstantine Nicholas John Tsotsos
Abstract: A handheld user device includes a monocular camera to capture a feed of images of a local scene and a processor to select, from the feed, a keyframe and perform, for a first image from the feed, stereo matching using the first image, the keyframe, and a relative pose based on a pose associated with the first image and a pose associated with the keyframe to generate a sparse disparity map representing disparities between the first image and the keyframe. The processor further is to determine a dense depth map from the disparity map using a bilateral solver algorithm, and process a viewfinder image generated from a second image of the feed with occlusion rendering based on the depth map to incorporate one or more virtual objects into the viewfinder image to generate an AR viewfinder image. Further, the processor is to provide the AR viewfinder image for display.
-
12.
公开(公告)号:US20200372284A1
公开(公告)日:2020-11-26
申请号:US16616235
申请日:2019-10-16
Applicant: Google LLC
Inventor: Christoph Rhemann , Abhimitra Meka , Matthew Whalen , Jessica Lynn Busch , Sofien Bouaziz , Geoffrey Douglas Harvey , Andrea Tagliasacchi , Jonathan Taylor , Paul Debevec , Peter Joseph Denny , Sean Ryan Francesco Fanello , Graham Fyffe , Jason Angelo Dourgarian , Xueming Yu , Adarsh Prakash Murthy Kowdle , Julien Pascal Christophe Valentin , Peter Christopher Lincoln , Rohit Kumar Pandey , Christian Häne , Shahram Izadi
Abstract: Methods, systems, and media for relighting images using predicted deep reflectance fields are provided. In some embodiments, the method comprises: identifying a group of training samples, wherein each training sample includes (i) a group of one-light-at-a-time (OLAT) images that have each been captured when one light of a plurality of lights arranged on a lighting structure has been activated, (ii) a group of spherical color gradient images that have each been captured when the plurality of lights arranged on the lighting structure have been activated to each emit a particular color, and (iii) a lighting direction, wherein each image in the group of OLAT images and each of the spherical color gradient images are an image of a subject, and wherein the lighting direction indicates a relative orientation of a light to the subject; training a convolutional neural network using the group of training samples, wherein training the convolutional neural network comprises: for each training iteration in a series of training iterations and for each training sample in the group of training samples: generating an output predicted image, wherein the output predicted image is a representation of the subject associated with the training sample with lighting from the lighting direction associated with the training sample; identifying a ground-truth OLAT image included in the group of OLAT images for the training sample that corresponds to the lighting direction for the training sample; calculating a loss that indicates a perceptual difference between the output predicted image and the identified ground-truth OLAT image; and updating parameters of the convolutional neural network based on the calculated loss; identifying a test sample that includes a second group of spherical color gradient images and a second lighting direction; and generating a relit image of the subject included in each of the second group of spherical color gradient images with lighting from the second lighting direction using the trained convolutional neural network.
-
公开(公告)号:US20250045968A1
公开(公告)日:2025-02-06
申请号:US18570562
申请日:2021-06-16
Applicant: Google LLC
Inventor: Onur G. Guleryuz , Ruofei Du , Hugues H. Hoppe , Sean Ryan Francesco Fanello , Philip Andrew Chou , Danhang Tang , Philip Davidson
Abstract: Nonlinear peri-codec optimization for image and video coding includes obtaining a source image including pixel values expressed in a first defined image sample space, generating a neuralized image representing the source image, the neuralized image including pixel values that are expressed as neural latent space values, encoding the input image wherein the neural latent space values are used as pixel values in a second defined image sample space and the input image is in an operative image format of the encoder, such that a decoder decodes the encoded image to obtain a reconstructed image in the second defined image sample space, wherein the reconstructed image is a reconstructed neuralized image including reconstructed neural latent space values, such that a deneuralized reconstructed image corresponding to the source image is obtained by a nonlinear post-codec image processor in the first defined image sample space.
-
公开(公告)号:US20240290025A1
公开(公告)日:2024-08-29
申请号:US18588948
申请日:2024-02-27
Applicant: GOOGLE LLC
Inventor: Yinda Zhang , Sean Ryan Francesco Fanello , Ziqian Bai , Feitong Tan , Zeng Huang , Kripasindhu Sarkar , Danhang Tang , Di Qiu , Abhimitra Meka , Ruofei Du , Mingsong Dou , Sergio Orts Escolano , Rohit Kumar Pandey , Thabo Beeler
CPC classification number: G06T13/40 , G06T7/90 , G06T17/20 , G06V10/44 , G06T2207/10024 , G06T2207/20084
Abstract: A method comprises receiving a first sequence of images of a portion of a user, the first sequence of images being monocular images; generating an avatar based on the first sequence of images, the avatar being based on a model including a feature vector associated with a vertex; receiving a second sequence of images of the portion of the user; and based on the second sequence of images, modifying the avatar with a displacement of the vertex to represent a gesture of the avatar.
-
15.
公开(公告)号:US20240212325A1
公开(公告)日:2024-06-27
申请号:US18596822
申请日:2024-03-06
Applicant: Google LLC
Inventor: Yinda Zhang , Feitong Tan , Danhang Tang , Mingsong Dou , Kaiwen Guo , Sean Ryan Francesco Fanello , Sofien Bouaziz , Cem Keskin , Ruofei Du , Rohit Kumar Pandey , Deqing Sun
IPC: G06V10/771 , G06T7/70 , G06T17/00 , G06V10/44 , G06V10/75
CPC classification number: G06V10/771 , G06T7/70 , G06T17/00 , G06V10/44 , G06V10/751 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for training models to predict dense correspondences across images such as human images. A model may be trained using synthetic training data created from one or more 3D computer models of a subject. In addition, one or more geodesic distances derived from the surfaces of one or more of the 3D models may be used to generate one or more loss values, which may in turn be used in modifying the model's parameters during training.
-
公开(公告)号:US20220027659A1
公开(公告)日:2022-01-27
申请号:US17309171
申请日:2020-09-21
Applicant: Google LLC
Inventor: Chloe LeGendre , Paul Debevec , Wan-Chun Ma , Rohit Pandey , Sean Ryan Francesco Fanello , Christina Tong
Abstract: Techniques of estimating lighting from portraits includes generating a lighting estimate from a single image of a face based on a machine learning (ML) system using multiple bidirectional reflection distribution functions (BRDFs) as a loss function. In some implementations, the ML system is trained using images of faces formed with HDR illumination computed from LDR imagery. The technical solution includes training a lighting estimation model in a supervised manner using a dataset of portraits and their corresponding ground truth illumination.
-
公开(公告)号:US12254406B2
公开(公告)日:2025-03-18
申请号:US17304505
申请日:2021-06-22
Applicant: Google LLC
Inventor: Onur G. Guleryuz , Sean Ryan Francesco Fanello
Abstract: A method including, in a training phase, training a gaze prediction model including a first model and a second model, the first model and the second model being configured in conjunction to predict segmentation data based on training data, training a third model together with the first model and the second model, the third model being configured to predict a training characteristic using an output of the first model based on the training data, and in an operational phase, receiving operational data and predicting an operational characteristic using the trained first model and the trained third model.
-
公开(公告)号:US12066282B2
公开(公告)日:2024-08-20
申请号:US17413847
申请日:2020-11-11
Applicant: GOOGLE LLC
Inventor: Sean Ryan Francesco Fanello , Kaiwen Guo , Peter Christopher Lincoln , Philip Lindsley Davidson , Jessica L. Busch , Xueming Yu , Geoffrey Harvey , Sergio Orts Escolano , Rohit Kumar Pandey , Jason Dourgarian , Danhang Tang , Adarsh Prakash Murthy Kowdle , Emily B. Cooper , Mingsong Dou , Graham Fyffe , Christoph Rhemann , Jonathan James Taylor , Shahram Izadi , Paul Ernest Debevec
IPC: G01B11/25 , G01B11/245 , G06T15/50 , G06T17/20
CPC classification number: G01B11/2513 , G01B11/245 , G06T15/506 , G06T17/205
Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.
-
公开(公告)号:US20240020915A1
公开(公告)日:2024-01-18
申请号:US18353213
申请日:2023-07-17
Applicant: Google LLC
Inventor: Yinda Zhang , Feitong Tan , Sean Ryan Francesco Fanello , Abhimitra Meka , Sergio Orts Escolano , Danhang Tang , Rohit Kumar Pandey , Jonathan James Taylor
Abstract: Techniques include introducing a neural generator configured to produce novel faces that can be rendered at free camera viewpoints (e.g., at any angle with respect to the camera) and relit under an arbitrary high dynamic range (HDR) light map. A neural implicit intrinsic field takes a randomly sampled latent vector as input and produces as output per-point albedo, volume density, and reflectance properties for any queried 3D location. These outputs are aggregated via a volumetric rendering to produce low resolution albedo, diffuse shading, specular shading, and neural feature maps. The low resolution maps are then upsampled to produce high resolution maps and input into a neural renderer to produce relit images.
-
公开(公告)号:US11868523B2
公开(公告)日:2024-01-09
申请号:US17305219
申请日:2021-07-01
Applicant: GOOGLE LLC
Inventor: Ivana Tosic Rodgers , Sean Ryan Francesco Fanello , Sofien Bouaziz , Rohit Kumar Pandey , Eric Aboussouan , Adarsh Prakash Murthy Kowdle
CPC classification number: G06F3/013 , G02B27/0093 , G02B27/0101 , G02B27/0172 , G06F18/23 , G06N3/08 , G02B2027/0138 , G02B2027/0178
Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.
-
-
-
-
-
-
-
-
-