Patent search ap:("NVIDIA CORPORATION") AND inv:"Jan Kautz" Page 2

11.

发明授权
Future object trajectory predictions for autonomous machine applications 有权

公开(公告)号：US11514293B2

公开(公告)日：2022-11-29

申请号：US16564978

申请日：2019-09-09

Applicant: NVIDIA Corporation

Inventor： Ruben Villegas , Alejandro Troccoli , Iuri Frosio , Stephen Tyree , Wonmin Byeon , Jan Kautz

IPC: G06N3/04 , G06N3/08 , B60W40/02

Abstract: In various examples, historical trajectory information of objects in an environment may be tracked by an ego-vehicle and encoded into a state feature. The encoded state features for each of the objects observed by the ego-vehicle may be used—e.g., by a bi-directional long short-term memory (LSTM) network—to encode a spatial feature. The encoded spatial feature and the encoded state feature for an object may be used to predict lateral and/or longitudinal maneuvers for the object, and the combination of this information may be used to determine future locations of the object. The future locations may be used by the ego-vehicle to determine a path through the environment, or may be used by a simulation system to control virtual objects—according to trajectories determined from the future locations—through a simulation environment.

12.

发明授权
3D human body pose estimation using a model trained from unlabeled multi-view data 有权

公开(公告)号：US11417011B2

公开(公告)日：2022-08-16

申请号：US16897057

申请日：2020-06-09

Applicant: NVIDIA Corporation

Inventor： Umar Iqbal , Pavlo Molchanov , Jan Kautz

IPC: G06T7/70 , G06N5/04 , G06T7/50 , G06N20/00

Abstract: Learning to estimate a 3D body pose, and likewise the pose of any type of object, from a single 2D image is of great interest for many practical graphics applications and generally relies on neural networks that have been trained with sample data which annotates (labels) each sample 2D image with a known 3D pose. Requiring this labeled training data however has various drawbacks, including for example that traditionally used training data sets lack diversity and therefore limit the extent to which neural networks are able to estimate 3D pose. Expanding these training data sets is also difficult since it requires manually provided annotations for 2D images, which is time consuming and prone to errors. The present disclosure overcomes these and other limitations of existing techniques by providing a model that is trained from unlabeled multi-view data for use in 3D pose estimation.

13.

发明授权
Inverse rendering of a scene from a single image 有权

公开(公告)号：US11295514B2

公开(公告)日：2022-04-05

申请号：US16685538

申请日：2019-11-15

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Kihwan Kim , Jan Kautz , Guilin Liu , Soumyadip Sengupta

IPC: G06T15/50 , G06T9/00 , G06N3/08 , G06N3/04

Abstract: Inverse rendering estimates physical scene attributes (e.g., reflectance, geometry, and lighting) from image(s) and is used for gaming, virtual reality, augmented reality, and robotics. An inverse rendering network (IRN) receives a single input image of a 3D scene and generates the physical scene attributes for the image. The IRN is trained by using the estimated physical scene attributes generated by the IRN to reproduce the input image and updating parameters of the IRN to reduce differences between the reproduced input image and the input image. A direct renderer and a residual appearance renderer (RAR) reproduce the input image. The RAR predicts a residual image representing complex appearance effects of the real (not synthetic) image based on features extracted from the image and the reflectance and geometry properties. The residual image represents near-field illumination, cast shadows, inter-reflections, and realistic shading that are not provided by the direct renderer.

14.

发明申请
DISTANCE DETERMINATIONS USING ONE OR MORE NEURAL NETWORKS 有权

公开(公告)号：US20210326694A1

公开(公告)日：2021-10-21

申请号：US16852944

申请日：2020-04-20

Applicant: Nvidia Corporation

Inventor： Jialiang Wang , Varun Jampani , Stan Birchfield , Charles Loop , Jan Kautz

IPC: G06N3/08 , G06N3/04

Abstract: Apparatuses, systems, and techniques are presented to determine distance for one or more objects. In at least one embodiment, a disparity network is trained to determine distance data from input stereoscopic images using a loss function that includes at least one of a gradient loss term and an occlusion loss term.

15.

发明授权
Transforming convolutional neural networks for visual sequence learning 有权

公开(公告)号：US11049018B2

公开(公告)日：2021-06-29

申请号：US15880472

申请日：2018-01-25

Applicant: NVIDIA Corporation

Inventor： Xiaodong Yang , Pavlo Molchanov , Jan Kautz

IPC: G06N3/08 , G06K9/00 , G06N3/04 , G06K9/62

Abstract: A method, computer readable medium, and system are disclosed for visual sequence learning using neural networks. The method includes the steps of replacing a non-recurrent layer within a trained convolutional neural network model with a recurrent layer to produce a visual sequence learning neural network model and transforming feedforward weights for the non-recurrent layer into input-to-hidden weights of the recurrent layer to produce a transformed recurrent layer. The method also includes the steps of setting hidden-to-hidden weights of the recurrent layer to initial values and processing video image data by the visual sequence learning neural network model to generate classification or regression output data.

16.

发明授权
Iterative spatio-temporal action detection in video 有权

公开(公告)号：US11017556B2

公开(公告)日：2021-05-25

申请号：US16152303

申请日：2018-10-04

Applicant: NVIDIA Corporation

Inventor： Xiaodong Yang , Xitong Yang , Fanyi Xiao , Ming-Yu Liu , Jan Kautz

IPC: G06T7/73 , G06K9/00 , G06T7/277

Abstract: Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.

17.

发明申请
LEARNING RIGIDITY OF DYNAMIC SCENES FOR THREE-DIMENSIONAL SCENE FLOW ESTIMATION 有权

公开(公告)号：US20210150736A1

公开(公告)日：2021-05-20

申请号：US17156406

申请日：2021-01-22

Applicant: NVIDIA Corporation

Inventor： Zhaoyang Lv , Kihwan Kim , Deqing Sun , Alejandro Jose Troccoli , Jan Kautz

IPC: G06T7/254 , G06T7/90 , G06T7/50 , G06N3/08 , G06T7/194 , G06T3/00 , G06T7/70 , G06T7/60 , G06T7/11 , G06N5/04 , G06T7/285 , G06T7/215

Abstract: A neural network model receives color data for a sequence of images corresponding to a dynamic scene in three-dimensional (3D) space. Motion of objects in the image sequence results from a combination of a dynamic camera orientation and motion or a change in the shape of an object in the 3D space. The neural network model generates two components that are used to produce a 3D motion field representing the dynamic (non-rigid) part of the scene. The two components are information identifying dynamic and static portions of each image and the camera orientation. The dynamic portions of each image contain motion in the 3D space that is independent of the camera orientation. In other words, the motion in the 3D space (estimated 3D scene flow data) is separated from the motion of the camera.

18.

发明申请
SYNTHESIZING DATA FOR TRAINING ONE OR MORE NEURAL NETWORKS 有权

公开(公告)号：US20210142177A1

公开(公告)日：2021-05-13

申请号：US16682967

申请日：2019-11-13

Applicant: Nvidia Corporation

Inventor： Arun Mallya , Jan Kautz , Zhizhong Li , Pavlo Molchanov , Hongxu Danny Yin

IPC: G06N3/08 , G06N3/04

Abstract: Apparatuses, systems, and techniques are presented to generate data useful for further training of a neural network. In at least one embodiment, one or more neural networks can be re-trained based, at least in part, on data generated by the one or more neural networks including data used to previously train the one or more neural networks.

19.

发明申请
THREE-DIMENSIONAL (3D) POSE ESTIMATION FROM A MONOCULAR CAMERA 有权

公开(公告)号：US20210117661A1

公开(公告)日：2021-04-22

申请号：US17135697

申请日：2020-12-28

Applicant: NVIDIA Corporation

Inventor： Umar Iqbal , Pavlo Molchanov , Thomas Michael Breuel , Jan Kautz

IPC: G06K9/00 , G06N3/08 , G06T7/73 , G06N5/04 , G06T7/579

Abstract: Estimating a three-dimensional (3D) pose of an object, such as a hand or body (human, animal, robot, etc.), from a 2D image is necessary for human-computer interaction. A hand pose can be represented by a set of points in 3D space, called keypoints. Two coordinates (x,y) represent spatial displacement and a third coordinate represents a depth of every point with respect to the camera. A monocular camera is used to capture an image of the 3D pose, but does not capture depth information. A neural network architecture is configured to generate a depth value for each keypoint in the captured image, even when portions of the pose are occluded, or the orientation of the object is ambiguous. Generation of the depth values enables estimation of the 3D pose of the object.

20.

发明授权
Learning-based camera pose estimation from images of an environment 有权

公开(公告)号：US10964061B2

公开(公告)日：2021-03-30

申请号：US16872752

申请日：2020-05-12

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Samarth Manoj Brahmbhatt , Kihwan Kim , Jan Kautz

IPC: G06T7/80 , G06T7/00 , G06K9/00 , G06K9/20 , G06K9/46 , G06N3/00 , G06T7/579 , G06T7/20

Abstract: A deep neural network (DNN) system learns a map representation for estimating a camera position and orientation (pose). The DNN is trained to learn a map representation corresponding to the environment, defining positions and attributes of structures, trees, walls, vehicles, etc. The DNN system learns a map representation that is versatile and performs well for many different environments (indoor, outdoor, natural, synthetic, etc.). The DNN system receives images of an environment captured by a camera (observations) and outputs an estimated camera pose within the environment. The estimated camera pose is used to perform camera localization, i.e., recover the three-dimensional (3D) position and orientation of a moving camera, which is a fundamental task in computer vision with a wide variety of applications in robot navigation, car localization for autonomous driving, device localization for mobile navigation, and augmented/virtual reality.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification