专利检索 ap:("NVIDIA Corporation") AND inv:"Pavlo Molchanov" 第 1 页

1.

发明公开
NEURAL NETWORK BASED FACIAL ANALYSIS USING FACIAL LANDMARKS AND ASSOCIATED CONFIDENCE VALUES 审中-公开

公开(公告)号：US20240265254A1

公开(公告)日：2024-08-08

申请号：US18605628

申请日：2024-03-14

申请人： NVIDIA Corporation

发明人： Nuri Murat Arar , Niranjan Avadhanam , Nishant Puri , Shagan Sah , Rajath Shetty , Sujay Yadawadkar , Pavlo Molchanov

IPC分类号： G06N3/08 , G06F18/21 , G06F18/214 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/94 , G06V20/59 , G06V20/64 , G06V40/16 , G06V40/18

CPC分类号： G06N3/08 , G06F18/214 , G06F18/2193 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/95 , G06V20/597 , G06V20/647 , G06V40/171 , G06V40/193

摘要： Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

2.

发明公开
SHARPNESS-AWARE MINIMIZATION FOR ROBUSTNESS IN SPARSE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240127067A1

公开(公告)日：2024-04-18

申请号：US18459083

申请日：2023-08-31

申请人： NVIDIA Corporation

发明人： Annamarie Bair , Hongxu Yin , Pavlo Molchanov , Maying Shen , Jose Manuel Alvarez Lopez

IPC分类号： G06N3/082

CPC分类号： G06N3/082

摘要： Systems and methods are disclosed for improving natural robustness of sparse neural networks. Pruning a dense neural network may improve inference speed and reduces the memory footprint and energy consumption of the resulting sparse neural network while maintaining a desired level of accuracy. In real-world scenarios in which sparse neural networks deployed in autonomous vehicles perform tasks such as object detection and classification for acquired inputs (images), the neural networks need to be robust to new environments, weather conditions, camera effects, etc. Applying sharpness-aware minimization (SAM) optimization during training of the sparse neural network improves performance for out of distribution (OOD) images compared with using conventional stochastic gradient descent (SGD) optimization. SAM optimizes a neural network to find a flat minimum: a region that both has a small loss value, but that also lies within a region of low loss.

3.

发明公开
GLOBAL CONTEXT VISION TRANSFORMER 审中-公开

公开(公告)号：US20230394781A1

公开(公告)日：2023-12-07

申请号：US18083397

申请日：2022-12-16

申请人： NVIDIA Corporation

发明人： Ali Hatamizadeh , Hongxu Yin , Jan Kautz , Pavlo Molchanov

IPC分类号： G06V10/42 , G06V10/44 , G06V10/77 , G06T3/40 , G06V10/82

CPC分类号： G06V10/42 , G06V10/44 , G06V10/82 , G06T3/40 , G06V10/7715

摘要： Vision transformers are deep learning models that employ a self-attention mechanism to obtain feature representations for an input image. To date, the configuration of vision transformers has limited the self-attention computation to a local window of the input image, such that short-range dependencies are modeled in the output. The present disclosure provides a vision transformer that captures global context, and that is therefore able to model long-range dependencies in its output.

4.

发明申请
PRUNING A VISION TRANSFORMER 有权

公开(公告)号：US20230080247A1

公开(公告)日：2023-03-16

申请号：US17551005

申请日：2021-12-14

申请人： NVIDIA Corporation

发明人： Hongxu Yin , Huanrui Yang , Pavlo Molchanov , Jan Kautz

IPC分类号： G06V10/94 , G06V10/70

摘要： A vision transformer is a deep learning model used to perform vision processing tasks such as image recognition. Vision transformers are currently designed with a plurality of same-size blocks that perform the vision processing tasks. However, some portions of these blocks are unnecessary and not only slow down the vision transformer but use more memory than required. In response, parameters of these blocks are analyzed to determine a score for each parameter, and if the score falls below a threshold, the parameter is removed from the associated block. This reduces a size of the resulting vision transformer, which reduces unnecessary memory usage and increases performance.

5.

发明授权
Articulated body mesh estimation using three-dimensional (3D) body keypoints 有权

公开(公告)号：US11361507B1

公开(公告)日：2022-06-14

申请号：US17315060

申请日：2021-05-07

申请人： NVIDIA Corporation

发明人： Umar Iqbal , Pavlo Molchanov , Jan Kautz , Yun Rong Guo , Cheng Xie

IPC分类号： G06T17/20 , G06T19/20

摘要： Estimating a three-dimensional (3D) pose and shape of an articulated body mesh is useful for many different applications including health and fitness, entertainment, and computer graphics. A set of estimated 3D keypoint positions for a human body structure are processed to compute parameters defining the pose and shape of a parametric human body mesh using a set of geometric operations. During processing, 3D keypoints are extracted from the parametric human body mesh and a set of rotations are computed to align the extracted 3D keypoints with the estimated 3D keypoints. The set of rotations may correctly position a particular 3D keypoint location at a “joint”, but an arbitrary number of rotations of the “joint” keypoint may produce a twist in a connection to a child keypoint. Rules are applied to the set of rotations to resolve ambiguous twists and articulate the parametric human body mesh according to the computed parameters.

6.

发明申请
3D HUMAN BODY POSE ESTIMATION USING A MODEL TRAINED FROM UNLABELED MULTI-VIEW DATA 有权

公开(公告)号：US20210248772A1

公开(公告)日：2021-08-12

申请号：US16897057

申请日：2020-06-09

申请人： NVIDIA Corporation

发明人： Umar Iqbal , Pavlo Molchanov , Jan Kautz

IPC分类号： G06T7/70 , G06N20/00 , G06N5/04 , G06T7/50

摘要： Learning to estimate a 3D body pose, and likewise the pose of any type of object, from a single 2D image is of great interest for many practical graphics applications and generally relies on neural networks that have been trained with sample data which annotates (labels) each sample 2D image with a known 3D pose. Requiring this labeled training data however has various drawbacks, including for example that traditionally used training data sets lack diversity and therefore limit the extent to which neural networks are able to estimate 3D pose. Expanding these training data sets is also difficult since it requires manually provided annotations for 2D images, which is time consuming and prone to errors. The present disclosure overcomes these and other limitations of existing techniques by providing a model that is trained from unlabeled multi-view data for use in 3D pose estimation.

7.

发明申请
NEURAL NETWORK BASED FACIAL ANALYSIS USING FACIAL LANDMARKS AND ASSOCIATED CONFIDENCE VALUES 有权

公开(公告)号：US20210182625A1

公开(公告)日：2021-06-17

申请号：US17004252

申请日：2020-08-27

申请人： NVIDIA Corporation

发明人： Nuri Murat Arar , Niranjan Avadhanam , Nishant Puri , Shagan Sah , Rajath Shetty , Sujay Yadawadkar , Pavlo Molchanov

IPC分类号： G06K9/62 , G06K9/00

摘要： Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

8.

发明授权
Neural network based facial analysis using facial landmarks and associated confidence values 有权

公开(公告)号：US11934955B2

公开(公告)日：2024-03-19

申请号：US18051296

申请日：2022-10-31

申请人： NVIDIA Corporation

发明人： Nuri Murat Arar , Niranjan Avadhanam , Nishant Puri , Shagan Sah , Rajath Shetty , Sujay Yadawadkar , Pavlo Molchanov

IPC分类号： G06N3/08 , G06F18/21 , G06F18/214 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/94 , G06V20/59 , G06V20/64 , G06V40/16 , G06V40/18

CPC分类号： G06N3/08 , G06F18/214 , G06F18/2193 , G06N20/00 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/95 , G06V20/597 , G06V20/647 , G06V40/171 , G06V40/193

摘要： Systems and methods for more accurate and robust determination of subject characteristics from an image of the subject. One or more machine learning models receive as input an image of a subject, and output both facial landmarks and associated confidence values. Confidence values represent the degrees to which portions of the subject's face corresponding to those landmarks are occluded, i.e., the amount of uncertainty in the position of each landmark location. These landmark points and their associated confidence values, and/or associated information, may then be input to another set of one or more machine learning models which may output any facial analysis quantity or quantities, such as the subject's gaze direction, head pose, drowsiness state, cognitive load, or distraction state.

9.

发明申请
PERFORMANCE-AWARE SIZE REDUCTION FOR NEURAL NETWORKS 有权

公开(公告)号：US20230077258A1

公开(公告)日：2023-03-09

申请号：US17398673

申请日：2021-08-10

申请人： Nvidia Corporation

发明人： Maying Shen , Pavlo Molchanov , Hongxu Yin , Lei Mao , Jianna Liu , Jose Manuel Alvarez Lopez

IPC分类号： G06N3/08 , G06N3/04 , G06F11/34

摘要： Apparatuses, systems, and techniques are presented to simplify neural networks. In at least one embodiment, one or more portions of one or more neural networks are cause to be removed based, at least in part, on one or more performance metrics of the one or more neural networks.

10.

发明授权
3D human body pose estimation using a model trained from unlabeled multi-view data 有权

公开(公告)号：US11417011B2

公开(公告)日：2022-08-16

申请号：US16897057

申请日：2020-06-09

申请人： NVIDIA Corporation

发明人： Umar Iqbal , Pavlo Molchanov , Jan Kautz

IPC分类号： G06T7/70 , G06N5/04 , G06T7/50 , G06N20/00

摘要： Learning to estimate a 3D body pose, and likewise the pose of any type of object, from a single 2D image is of great interest for many practical graphics applications and generally relies on neural networks that have been trained with sample data which annotates (labels) each sample 2D image with a known 3D pose. Requiring this labeled training data however has various drawbacks, including for example that traditionally used training data sets lack diversity and therefore limit the extent to which neural networks are able to estimate 3D pose. Expanding these training data sets is also difficult since it requires manually provided annotations for 2D images, which is time consuming and prone to errors. The present disclosure overcomes these and other limitations of existing techniques by providing a model that is trained from unlabeled multi-view data for use in 3D pose estimation.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类