专利检索 ap:("NVIDIA Corporation") AND inv:"Maying Shen" 第 1 页

1.

发明公开
SHARPNESS-AWARE MINIMIZATION FOR ROBUSTNESS IN SPARSE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240127067A1

公开(公告)日：2024-04-18

申请号：US18459083

申请日：2023-08-31

申请人： NVIDIA Corporation

发明人： Annamarie Bair , Hongxu Yin , Pavlo Molchanov , Maying Shen , Jose Manuel Alvarez Lopez

IPC分类号： G06N3/082

CPC分类号： G06N3/082

摘要： Systems and methods are disclosed for improving natural robustness of sparse neural networks. Pruning a dense neural network may improve inference speed and reduces the memory footprint and energy consumption of the resulting sparse neural network while maintaining a desired level of accuracy. In real-world scenarios in which sparse neural networks deployed in autonomous vehicles perform tasks such as object detection and classification for acquired inputs (images), the neural networks need to be robust to new environments, weather conditions, camera effects, etc. Applying sharpness-aware minimization (SAM) optimization during training of the sparse neural network improves performance for out of distribution (OOD) images compared with using conventional stochastic gradient descent (SGD) optimization. SAM optimizes a neural network to find a flat minimum: a region that both has a small loss value, but that also lies within a region of low loss.

2.

发明申请
PRUNING NEURAL NETWORKS 有权

公开(公告)号：US20220292360A1

公开(公告)日：2022-09-15

申请号：US17201768

申请日：2021-03-15

申请人： NVIDIA Corporation

发明人： Maying Shen , Pavlo Molchanov , Hongxu Yin , Jose Manuel Alvarez Lopez

IPC分类号： G06N3/08 , G06N3/04 , G06F11/30 , G06N3/063

摘要： Apparatuses, systems, and techniques to remove one or more nodes of a neural network. In at least one embodiment, one or more nodes of a neural network are removed, based on, for example, whether the one or more nodes are likely to affect performance of the neural network.

3.

发明公开
AUGMENTING LEGACY NEURAL NETWORKS FOR FLEXIBLE INFERENCE 审中-公开

公开(公告)号：US20230325670A1

公开(公告)日：2023-10-12

申请号：US17820780

申请日：2022-08-18

申请人： NVIDIA Corporation

发明人： Jason Lavar Clemons , Stephen W. Keckler , Iuri Frosio , Jose Manuel Alvarez Lopez , Maying Shen

IPC分类号： G06N3/08

CPC分类号： G06N3/082

摘要： A technique for dynamically configuring and executing an augmented neural network in real-time according to performance constraints also maintains the legacy neural network execution path. A neural network model that has been trained for a task is augmented with low-compute “shallow” phases paired with each legacy phase and the legacy phases of the neural network model are held constant (e.g., unchanged) while the shallow phases are trained. During inference, one or more of the shallow phases can be selectively executed in place of the corresponding legacy phase. Compared with the legacy phases, the shallow phases are typically less accurate, but have reduced latency and consume less power. Therefore, processing using one or more of the shallow phases in place of one or more of the legacy phases enables the augmented neural network to dynamically adapt to changes in the execution environment (e.g., processing load or performance requirement).

4.

发明申请
PERFORMANCE-AWARE SIZE REDUCTION FOR NEURAL NETWORKS 有权

公开(公告)号：US20230077258A1

公开(公告)日：2023-03-09

申请号：US17398673

申请日：2021-08-10

申请人： Nvidia Corporation

发明人： Maying Shen , Pavlo Molchanov , Hongxu Yin , Lei Mao , Jianna Liu , Jose Manuel Alvarez Lopez

IPC分类号： G06N3/08 , G06N3/04 , G06F11/34

摘要： Apparatuses, systems, and techniques are presented to simplify neural networks. In at least one embodiment, one or more portions of one or more neural networks are cause to be removed based, at least in part, on one or more performance metrics of the one or more neural networks.

5.

发明公开
DYNAMIC NEURAL NETWORK MODEL SPARSIFICATION 审中-公开

公开(公告)号：US20240119291A1

公开(公告)日：2024-04-11

申请号：US18203552

申请日：2023-05-30

申请人： NVIDIA Corporation

发明人： Jose M. Alvarez Lopez , Pavlo Molchanov , Hongxu Yin , Maying Shen , Lei Mao , Xinglong Sun

IPC分类号： G06N3/082 , G06N3/0495

CPC分类号： G06N3/082 , G06N3/0495

摘要： Machine learning is a process that learns a neural network model from a given dataset, where the model can then be used to make a prediction about new data. In order to reduce the size, computation, and latency of a neural network model, a compression technique can be employed which includes model sparsification. To avoid the negative consequences of pruning a fully pretrained neural network model and on the other hand of training a sparse model in the first place without any recovery option, the present disclosure provides a dynamic neural network model sparsification process which allows for recovery of previously pruned parts to improve the quality of the sparse neural network model.

6.

发明申请
CALCULATING DATA COMPRESSION PARAMETERS 有权

公开(公告)号：US20220156982A1

公开(公告)日：2022-05-19

申请号：US16952893

申请日：2020-11-19

申请人： NVIDIA Corporation

发明人： Yerlan Idelbayev , Pavlo Molchanov , Hongxu Danny Yin , Maying Shen , Jose Manuel Alvarez Lopez

IPC分类号： G06T9/00 , G06T1/20

摘要： Apparatuses, systems, and techniques for calculating data compression parameters using codebook entry values. In at least one embodiment, one or more circuits is to calculate one or more data compression parameters based, at least in part, on at least on one or more values of the data to be compressed in relation to at least two codebook entry values.