Patent search ap:("NVIDIA CORPORATION") AND inv:"Jan Kautz" Page 7

61.

发明申请
FUTURE OBJECT TRAJECTORY PREDICTIONS FOR AUTONOMOUS MACHINE APPLICATIONS 审中-公开

公开(公告)号：US20200082248A1

公开(公告)日：2020-03-12

申请号：US16564978

申请日：2019-09-09

Applicant: NVIDIA Corporation

Inventor： Ruben Villegas , Alejandro Troccoli , Iuri Frosio , Stephen Tyree , Wonmin Byeon , Jan Kautz

IPC: G06N3/04 , G06N3/08 , G05D1/02

Abstract: In various examples, historical trajectory information of objects in an environment may be tracked by an ego-vehicle and encoded into a state feature. The encoded state features for each of the objects observed by the ego-vehicle may be used—e.g., by a bi-directional long short-term memory (LSTM) network—to encode a spatial feature. The encoded spatial feature and the encoded state feature for an object may be used to predict lateral and/or longitudinal maneuvers for the object, and the combination of this information may be used to determine future locations of the object. The future locations may be used by the ego-vehicle to determine a path through the environment, or may be used by a simulation system to control virtual objects—according to trajectories determined from the future locations—through a simulation environment.

62.

发明申请
FAST MULTI-SCALE POINT CLOUD REGISTRATION WITH A HIERARCHICAL GAUSSIAN MIXTURE 审中-公开

公开(公告)号：US20190319851A1

公开(公告)日：2019-10-17

申请号：US16351312

申请日：2019-03-12

Applicant: NVIDIA Corporation

Inventor： Benjamin David Eckart , Kihwan Kim , Jan Kautz

IPC: H04L12/24 , H04L29/08 , G06F16/22

Abstract: Point cloud registration sits at the core of many important and challenging 3D perception problems including autonomous navigation, object/scene recognition, and augmented reality (AR). A new registration algorithm is presented that achieves speed and accuracy by registering a point cloud to a representation of a reference point cloud. A target point cloud is registered to the reference point cloud by iterating through a number of cycles of an EM algorithm where, during an Expectation step, each point in the target point cloud is associated with a node of a hierarchical tree data structure and, during a Maximization step, an estimated transformation is determined based on the association of the points with corresponding nodes of the hierarchical tree data structure. The estimated transformation is determined by solving a minimization problem associated with a sum, over a number of mixture components, over terms related to a Mahalanobis distance.

63.

发明授权
System and method for optical flow estimation 有权

公开(公告)号：US10424069B2

公开(公告)日：2019-09-24

申请号：US15942213

申请日：2018-03-30

Applicant: NVIDIA Corporation

Inventor： Deqing Sun , Xiaodong Yang , Ming-Yu Liu , Jan Kautz

IPC: G06T7/207 , G06N5/04 , G06T3/00 , G06T7/00 , G06T7/246 , G06N3/04 , G06N3/08

Abstract: A method, computer readable medium, and system are disclosed for estimating optical flow between two images. A first pyramidal set of features is generated for a first image and a partial cost volume for a level of the first pyramidal set of features is computed, by a neural network, using features at the level of the first pyramidal set of features and warped features extracted from a second image, where the partial cost volume is computed across a limited range of pixels that is less than a full resolution of the first image, in pixels, at the level. The neural network processes the features and the partial cost volume to produce a refined optical flow estimate for the first image and the second image.

64.

发明申请
Domain Stylization Using a Neural Network Model 审中-公开

公开(公告)号：US20190244060A1

公开(公告)日：2019-08-08

申请号：US16265725

申请日：2019-02-01

Applicant: NVIDIA Corporation

Inventor： Aysegul Dundar , Ming-Yu Liu , Ting-Chun Wang , John Zedlewski , Jan Kautz

IPC: G06K9/62 , G06N3/08 , G06N3/04 , G06K9/32 , G06T3/00 , G06T7/10

CPC classification number: G06K9/6256 , G06K9/3233 , G06K9/6267 , G06N3/0454 , G06N3/08 , G06T3/0056 , G06T7/10

Abstract: A style transfer neural network may be used to generate stylized synthetic images, where real images provide the style (e.g., seasons, weather, lighting) for transfer to synthetic images. The stylized synthetic images may then be used to train a recognition neural network. In turn, the trained neural network may be used to predict semantic labels for the real images, providing recognition data for the real images. Finally, the real training dataset (real images and predicted recognition data) and the synthetic training dataset are used by the style transfer neural network to generate stylized synthetic images. The training of the neural network, prediction of recognition data for the real images, and stylizing of the synthetic images may be repeated for a number of iterations. The stylization operation more closely aligns a covariate of the synthetic images to the covariate of the real images, improving accuracy of the recognition neural network.

65.

发明申请
SWITCHABLE PROPAGATION NEURAL NETWORK 审中-公开

公开(公告)号：US20190213439A1

公开(公告)日：2019-07-11

申请号：US16353835

申请日：2019-03-14

Applicant: NVIDIA Corporation

Inventor： Sifei Liu , Shalini De Mello , Jinwei Gu , Varun Jampani , Jan Kautz

IPC: G06K9/62 , G06K9/00 , G06T7/90 , G06T5/00 , G06T7/10 , G06T5/50 , G06N3/08 , G06N3/04

CPC classification number: G06K9/6215 , G06K9/00744 , G06K9/6256 , G06N3/04 , G06N3/08 , G06N3/084 , G06T5/009 , G06T5/50 , G06T7/10 , G06T7/90 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/20208

Abstract: A temporal propagation network (TPN) system learns the affinity matrix for video image processing tasks. An affinity matrix is a generic matrix that defines the similarity of two points in space. The TPN system includes a guidance neural network model and a temporal propagation module and is trained for a particular computer vision task to propagate visual properties from a key-frame represented by dense data (color), to another frame that is represented by coarse data (grey-scale). The guidance neural network model generates an affinity matrix referred to as a global transformation matrix from task-specific data for the key-frame and the other frame. The temporal propagation module applies the global transformation matrix to the key-frame property data to produce propagated property data (color) for the other frame. For example, the TPN system may be used to colorize several frames of greyscale video using a single manually colorized key-frame.

66.

发明申请
SYSTEMS AND METHODS FOR DYNAMIC FACIAL ANALYSIS USING A RECURRENT NEURAL NETWORK 审中-公开

公开(公告)号：US20190180469A1

公开(公告)日：2019-06-13

申请号：US15836549

申请日：2017-12-08

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Xiaodong Yang , Shalini De Mello , Jan Kautz

IPC: G06T7/73 , G06N3/08

CPC classification number: G06T7/73 , G06N3/08 , G06T3/4046 , G06T13/40 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201 , G06T2207/30204

Abstract: A method, computer readable medium, and system are disclosed for dynamic facial analysis. The method includes the steps of receiving video data representing a sequence of image frames including at least one head and extracting, by a neural network, spatial features comprising pitch, yaw, and roll angles of the at least one head from the video data. The method also includes the step of processing, by a recurrent neural network, the spatial features for two or more image frames in the sequence of image frames to produce head pose estimates for the at least one head.

67.

发明申请
CREATING AN IMAGE UTILIZING A MAP REPRESENTING DIFFERENT CLASSES OF PIXELS 审中-公开

公开(公告)号：US20190147296A1

公开(公告)日：2019-05-16

申请号：US16188920

申请日：2018-11-13

Applicant: NVIDIA Corporation

Inventor： Ting-Chun Wang , Ming-Yu Liu , Bryan Christopher Catanzaro , Jan Kautz , Andrew J. Tao

IPC: G06K9/62 , G06K9/68 , G06K9/72

Abstract: A method, computer readable medium, and system are disclosed for creating an image utilizing a map representing different classes of specific pixels within a scene. One or more computing systems use the map to create a preliminary image. This preliminary image is then compared to an original image that was used to create the map. A determination is made whether the preliminary image matches the original image, and results of the determination are used to adjust the computing systems that created the preliminary image, which improves a performance of such computing systems. The adjusted computing systems are then used to create images based on different input maps representing various object classes of specific pixels within a scene.

68.

发明申请
Learning-Based Camera Pose Estimation From Images of an Environment 审中-公开

公开(公告)号：US20190108651A1

公开(公告)日：2019-04-11

申请号：US16137064

申请日：2018-09-20

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Samarth Manoj Brahmbhatt , Kihwan Kim , Jan Kautz

IPC: G06T7/80 , G06T7/00

Abstract: A deep neural network (DNN) system learns a map representation for estimating a camera position and orientation (pose). The DNN is trained to learn a map representation corresponding to the environment, defining positions and attributes of structures, trees, walls, vehicles, walls, etc. The DNN system learns a map representation that is versatile and performs well for many different environments (indoor, outdoor, natural, synthetic, etc.). The DNN system receives images of an environment captured by a camera (observations) and outputs an estimated camera pose within the environment. The estimated camera pose is used to perform camera localization, i.e., recover the three-dimensional (3D) position and orientation of a moving camera, which is a fundamental task in computer vision with a wide variety of applications in robot navigation, car localization for autonomous driving, device localization for mobile navigation, and augmented/virtual reality.

69.

发明申请
UNIFIED OPTIMIZATION METHOD FOR END-TO-END CAMERA IMAGE PROCESSING FOR TRANSLATING A SENSOR CAPTURED IMAGE TO A DISPLAY IMAGE 审中-公开
Title translation: 用于将传感器捕获的图像转换为显示图像的端到端相机图像处理的统一优化方法

公开(公告)号：US20170011710A1

公开(公告)日：2017-01-12

申请号：US15276626

申请日：2016-09-26

Applicant: NVIDIA Corporation

Inventor： Dawid Stanislaw Pajak , Felix Heide , Nagilla Dikpal Reddy , Mushfiqur Rouf , Jan Kautz , Kari Pulli , Orazio Gallo

IPC: G09G5/02 , G09G5/36 , G06T3/40 , G06T5/00

CPC classification number: G09G5/02 , G06T3/4015 , G06T5/001 , G06T5/002 , G09G5/026 , G09G5/363 , G09G2320/0238 , G09G2320/0242 , G09G2320/0247 , G09G2320/066 , G09G2360/08

Abstract: A computer implemented method of determining a latent image from an observed image is disclosed. The method comprises implementing a plurality of image processing operations within a single optimization framework, wherein the single optimization framework comprises solving a linear minimization expression. The method further comprises mapping the linear minimization expression onto at least one non-linear solver. Further, the method comprises using the non-linear solver, iteratively solving the linear minimization expression in order to extract the latent image from the observed image, wherein the linear minimization expression comprises: a data term, and a regularization term, and wherein the regularization term comprises a plurality of non-linear image priors.

Abstract translation: 公开了一种从观察图像确定潜像的计算机实现方法。该方法包括在单个优化框架内实现多个图像处理操作，其中单个优化框架包括求解线性最小化表达式。该方法还包括将线性最小化表达映射到至少一个非线性求解器上。此外，该方法包括使用非线性求解器，迭代地求解线性最小化表达以从观察图像中提取潜像，其中线性最小化表达式包括：数据项和正则化项，其中正则化术语包括多个非线性图像先验。

70.

发明申请
DUAL FORMULATION FOR A COMPUTER VISION RETENTION MODEL 有权

公开(公告)号：US20250111661A1

公开(公告)日：2025-04-03

申请号：US18882629

申请日：2024-09-11

Applicant: NVIDIA Corporation

Inventor： Ali Hatamizadeh , Michael Ranzinger , Jan Kautz

IPC: G06V10/82 , G06V10/26 , G06V10/774 , G06V10/776 , G06V10/94

Abstract: Transformers are neural networks that learn context and thus meaning by tracking relationships in sequential data. The main building block of transformers is self-attention which allows for cross interaction among all input sequence tokens with each other. This scheme effectively captures short-and long-range spatial dependencies and imposes time and space quadratic complexity in terms of the input sequence length, which enables their use with Natural Language Processing (NLP) and computer vision tasks. While the training parallelism of transformers allows for competitive performance, unfortunately the inference is slow and expensive due to the computational complexity. The present disclosure provides a computer vision retention model that is configured for both parallel training and recurrent inference, which can enable competitive performance during training and fast and memory-efficient inferences during deployment.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification