Patent search ap:("NVIDIA CORPORATION") AND inv:"Jan Kautz" Page 4

31.

发明申请
EQUIVARIANT LANDMARK TRANSFORMATION FOR LANDMARK LOCALIZATION 审中-公开

公开(公告)号：US20180365512A1

公开(公告)日：2018-12-20

申请号：US16006728

申请日：2018-06-12

Applicant: NVIDIA Corporation

Inventor： Pavlo Molchanov , Stephen Walter Tyree , Jan Kautz , Sina Honari

IPC: G06K9/46 , G06K9/66 , G06K9/62 , G06N3/08

Abstract: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

32.

发明授权
Model-based three-dimensional head pose estimation 有权

公开(公告)号：US09830703B2

公开(公告)日：2017-11-28

申请号：US14825129

申请日：2015-08-12

Applicant: NVIDIA CORPORATION

Inventor： Gregory P. Meyer , Shalini Gupta , Iuri Frosio , Nagilla Dikpal Reddy , Jan Kautz

IPC: G06K9/00 , G06T7/00

CPC classification number: G06T7/507 , G06K9/6276 , G06T7/251 , G06T7/277 , G06T7/70 , G06T7/75 , G06T7/77 , G06T2200/28 , G06T2207/10016 , G06T2207/10028 , G06T2207/30201

Abstract: One embodiment of the present invention sets forth a technique for estimating a head pose of a user. The technique includes acquiring depth data associated with a head of the user and initializing each particle included in a set of particles with a different candidate head pose. The technique further includes performing one or more optimization passes that include performing at least one iterative closest point (ICP) iteration for each particle and performing at least one particle swarm optimization (PSO) iteration. Each ICP iteration includes rendering the three-dimensional reference model based on the candidate head pose associated with the particle and comparing the three-dimensional reference model to the depth data. Each PSO iteration comprises updating a global best head pose associated with the set of particles and modifying at least one candidate head pose. The technique further includes modifying a shape of the three-dimensional reference model based on depth data.

33.

发明申请
SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR GENERATING ONE OR MORE VALUES FOR A SIGNAL PATCH USING NEIGHBORING PATCHES COLLECTED BASED ON A DISTANCE DYNAMICALLY COMPUTED FROM A NOISE DISTRIBUTION OF THE SIGNAL PATCH 审中-公开

公开(公告)号：US20170263041A1

公开(公告)日：2017-09-14

申请号：US15421364

申请日：2017-01-31

Applicant: NVIDIA Corporation

Inventor： Iuri Frosio , Jan Kautz

IPC: G06T15/00 , G09G5/36

CPC classification number: G09G5/363 , G06T5/002

Abstract: A system, method and computer program product are provided for generating one or more values for a signal patch using neighboring patches collected based on a distance dynamically computed from a noise distribution of the signal patch. In use, a reference patch is identified from a signal, and a reference distance is computed based on a noise distribution in the reference patch. Neighbor patches are then collected from the signal based on the computed reference distance from the reference patch. Further, the collected neighbor patches are processed with the reference patch to generate one or more values for the reference patch.

34.

发明申请
MIXED PRIMARY DISPLAY WITH SPATIALLY MODULATED BACKLIGHT 审中-公开
Title translation: 具有空调调制背光的混合主显示

公开(公告)号：US20160307482A1

公开(公告)日：2016-10-20

申请号：US15130886

申请日：2016-04-15

Applicant: NVIDIA Corporation

Inventor： Fu-Chung Huang , David Patrick Luebke , Jan Kautz , Dawid Stanislaw Pajak

IPC: G09G3/00 , G06T15/04 , G09G3/36 , G06T15/80 , G06T15/00 , G09G3/34

CPC classification number: G09G3/002 , G09G3/001 , G09G3/2003 , G09G3/2074 , G09G3/3406 , G09G3/3433 , G09G3/36 , G09G5/363 , G09G5/397 , G09G2300/023 , G09G2300/0426 , G09G2340/0407 , G09G2360/08

Abstract: A method, computer readable medium, and system are disclosed for generating mixed-primary data for display. The method includes the steps of receiving a source image that includes a plurality of pixels, dividing the source image into a plurality of blocks, analyzing the source image based on an image decomposition algorithm, encoding chroma information and modulation information to generate a video signal, and transmitting the video signal to a mixed-primary display. The chroma information and modulation information correspond with two or more mixed-primary color components and are generated by the image decomposition algorithm to minimize error between a reproduced image and the source image. The two or more mixed-primary colors selected for each block of the source image are not limited to any particular set of colors and each mixed-primary color component may be selected from any color capable of being reproduced by the mixed-primary display.

Abstract translation: 公开了一种用于生成用于显示的混合主数据的方法，计算机可读介质和系统。该方法包括以下步骤：接收包括多个像素的源图像，将源图像划分为多个块，基于图像分解算法分析源图像，对色度信息和调制信息进行编码以产生视频信号，并将视频信号发送到混合主显示器。色度信息和调制信息与两个或更多个混合原色分量相对应，并且由图像分解算法产生，以最小化再现图像与源图像之间的误差。为源图像的每个块选择的两个或多个混合原色不限于任何特定的颜色集合，并且可以从能够由混合主显示器再现的任何颜色中选择每个混合原色分量。

35.

发明申请
UNIFIED OPTIMIZATION METHOD FOR END-TO-END CAMERA IMAGE PROCESSING FOR TRANSLATING A SENSOR CAPTURED IMAGE TO A DISPLAY IMAGE 有权
Title translation: 用于将传感器捕获的图像转换为显示图像的端到端相机图像处理的统一优化方法

公开(公告)号：US20150206504A1

公开(公告)日：2015-07-23

申请号：US14600507

申请日：2015-01-20

Applicant: NVIDIA Corporation

Inventor： Dawid Stanislaw Pajak , Felix Heide , Nagilla Dikpal Reddy , Mushfiqur Rouf , Jan Kautz , Kari Pulli , Orazio Gallo

IPC: G09G5/02 , G06T11/00

CPC classification number: G09G5/02 , G06T3/4015 , G06T5/001 , G06T5/002 , G09G5/026 , G09G5/363 , G09G2320/0238 , G09G2320/0242 , G09G2320/0247 , G09G2320/066 , G09G2360/08

Abstract: A computer implemented method of determining a latent image from an observed image is disclosed. The method comprises implementing a plurality of image processing operations within a single optimization framework, wherein the single optimization framework comprises solving a linear minimization expression. The method further comprises mapping the linear minimization expression onto at least one non-linear solver. Further, the method comprises using the non-linear solver, iteratively solving the linear minimization expression in order to extract the latent image from the observed image, wherein the linear minimization expression comprises: a data term, and a regularization term, and wherein the regularization term comprises a plurality of non-linear image priors.

Abstract translation: 公开了一种从观察图像确定潜像的计算机实现方法。该方法包括在单个优化框架内实现多个图像处理操作，其中单个优化框架包括求解线性最小化表达式。该方法还包括将线性最小化表达映射到至少一个非线性求解器上。此外，该方法包括使用非线性求解器，迭代地求解线性最小化表达以从观察图像中提取潜像，其中线性最小化表达式包括：数据项和正则化项，其中正则化术语包括多个非线性图像先验。

36.

发明授权
Performing occlusion-aware global 3D pose and shape estimation of articulated objects 有权

公开(公告)号：US12100113B2

公开(公告)日：2024-09-24

申请号：US17584213

申请日：2022-01-25

Applicant: NVIDIA Corporation

Inventor： Ye Yuan , Umar Iqbal , Pavlo Molchanov , Jan Kautz

IPC: G06T19/20 , G06T7/00 , G06T7/20

CPC classification number: G06T19/20 , G06T7/0002 , G06T7/20 , G06T2207/10016 , G06T2207/20084 , G06T2207/30241 , G06T2219/2016

Abstract: In order to determine accurate three-dimensional (3D) models for objects within a video, the objects are first identified and tracked within the video, and a pose and shape are estimated for these tracked objects. A translation and global orientation are removed from the tracked objects to determine local motion for the objects, and motion infilling is performed to fill in any missing portions for the object within the video. A global trajectory is then determined for the objects within the video, and the infilled motion and global trajectory are then used to determine infilled global motion for the object within the video. This enables the accurate depiction of each object as a 3D pose sequence for that model that accounts for occlusions and global factors within the video.

37.

发明公开
PHYSICS-GUIDED MOTION DIFFUSION MODEL 审中-公开

公开(公告)号：US20240169636A1

公开(公告)日：2024-05-23

申请号：US18317378

申请日：2023-05-15

Applicant: NVIDIA Corporation

Inventor： Ye Yuan , Jiaming Song , Umar Iqbal , Arash Vahdat , Jan Kautz

IPC: G06T13/40 , G06T5/00 , G06T13/80

CPC classification number: G06T13/40 , G06T5/002 , G06T13/80 , G06T2207/20081 , G06T2207/20084

Abstract: Systems and methods are disclosed that improve performance of synthesized motion generated by a diffusion neural network model. A physics-guided motion diffusion model incorporates physical constraints into the diffusion process to model the complex dynamics induced by forces and contact. Specifically, a physics-based motion projection module uses motion imitation in a physics simulator to project the denoised motion of a diffusion step to a physically plausible motion. The projected motion is further used in the next diffusion iteration to guide the denoising diffusion process. The use of physical constraints in the physics-guided motion diffusion model iteratively pulls the motion toward a physically-plausible space, reducing artifacts such as floating, foot sliding, and ground penetration.

38.

发明授权
Future object trajectory predictions for autonomous machine applications 有权

公开(公告)号：US11989642B2

公开(公告)日：2024-05-21

申请号：US17952866

申请日：2022-09-26

Applicant: NVIDIA Corporation

Inventor： Ruben Villegas , Alejandro Troccoli , Iuri Frosio , Stephen Tyree , Wonmin Byeon , Jan Kautz

IPC: G06N3/044 , B60W40/02 , G06N3/045 , G06N3/08

CPC classification number: G06N3/044 , B60W40/02 , G06N3/08 , G06N3/045

Abstract: In various examples, historical trajectory information of objects in an environment may be tracked by an ego-vehicle and encoded into a state feature. The encoded state features for each of the objects observed by the ego-vehicle may be used—e.g., by a bi-directional long short-term memory (LSTM) network—to encode a spatial feature. The encoded spatial feature and the encoded state feature for an object may be used to predict lateral and/or longitudinal maneuvers for the object, and the combination of this information may be used to determine future locations of the object. The future locations may be used by the ego-vehicle to determine a path through the environment, or may be used by a simulation system to control virtual objects—according to trajectories determined from the future locations—through a simulation environment.

39.

发明公开
POSE TRANSFER FOR THREE-DIMENSIONAL CHARACTERS USING A LEARNED SHAPE CODE 审中-公开

公开(公告)号：US20240070987A1

公开(公告)日：2024-02-29

申请号：US18110287

申请日：2023-02-15

Applicant: NVIDIA Corporation

Inventor： Xueting Li , Sifei Liu , Shalini De Mello , Orazio Gallo , Jiashun Wang , Jan Kautz

IPC: G06T19/00 , G06T7/10 , G06T17/20

CPC classification number: G06T19/00 , G06T7/10 , G06T17/20

Abstract: Transferring pose to three-dimensional characters is a common computer graphics task that typically involves transferring the pose of a reference avatar to a (stylized) three-dimensional character. Since three-dimensional characters are created by professional artists through imagination and exaggeration, and therefore, unlike human or animal avatars, have distinct shape and features, matching the pose of a three-dimensional character to that of a reference avatar generally requires manually creating shape information for the three-dimensional character that is required for pose transfer. The present disclosure provides for the automated transfer of a reference pose to a three-dimensional character, based specifically on a learned shape code for the three-dimensional character.

40.

发明授权
Three-dimensional object reconstruction from a video 有权

公开(公告)号：US11880927B2

公开(公告)日：2024-01-23

申请号：US18320446

申请日：2023-05-19

Applicant: NVIDIA Corporation

Inventor： Xueting Li , Sifei Liu , Kihwan Kim , Shalini De Mello , Jan Kautz

IPC: G06T15/04 , G06T7/579 , G06T7/70 , G06T17/20 , G06T15/20

CPC classification number: G06T15/04 , G06T7/579 , G06T7/70 , G06T15/20 , G06T17/20 , G06T2207/10016 , G06T2207/20084 , G06T2207/30244

Abstract: A three-dimensional (3D) object reconstruction neural network system learns to predict a 3D shape representation of an object from a video that includes the object. The 3D reconstruction technique may be used for content creation, such as generation of 3D characters for games, movies, and 3D printing. When 3D characters are generated from video, the content may also include motion of the character, as predicted based on the video. The 3D object construction technique exploits temporal consistency to reconstruct a dynamic 3D representation of the object from an unlabeled video. Specifically, an object in a video has a consistent shape and consistent texture across multiple frames. Texture, base shape, and part correspondence invariance constraints may be applied to fine-tune the neural network system. The reconstruction technique generalizes well—particularly for non-rigid objects.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification