专利检索 ap:("NVIDIA Corporation") AND inv:"Mark Hummel" 第 1 页

1.

发明授权
Self-synchronizing remote memory operations in a multiprocessor system 有权

公开(公告)号：US12105960B2

公开(公告)日：2024-10-01

申请号：US17900808

申请日：2022-08-31

申请人： NVIDIA CORPORATION

发明人： Srinivas Santosh Kumar Madugula , Olivier Giroux , Wishwesh Anil Gandhi , Michael Allen Parker , Raghuram L , Ivan Tanasic , Manan Patel , Mark Hummel , Alexander L. Minkin

IPC分类号： G06F3/06

CPC分类号： G06F3/0611 , G06F3/0659 , G06F3/0673

摘要： Various embodiments include techniques for performing self-synchronizing remote memory operations in a multiprocessor computing system. During a remote memory operation in the multiprocessor computing system, a source processing unit transmits multiple segments of data to a destination processing. For each segment of data, the source processing unit transmits a remote memory operation to the destination processing unit that includes associated metadata that identifies the memory location of a corresponding synchronization object. The remote memory operation along with the metadata is transmitted as a single unit to the destination processing unit. The destination processing unit splits the operation into the remote memory operation and the memory synchronization operation. As a result, the source processing unit avoids the need to perform a separate memory synchronization operation, thereby reducing inter-processor communications and increasing performance of remote memory operations.

2.

发明公开
MULTICAST-REDUCTION ASSISTED BY NETWORK DEVICES 审中-公开

公开(公告)号：US20240137410A1

公开(公告)日：2024-04-25

申请号：US18545339

申请日：2023-12-19

申请人： NVIDIA Corporation

发明人： Glenn Dearth , Mark Hummel , Nan Jiang , Gregory Thorson

IPC分类号： H04L67/1008 , H04L47/70 , H04L47/80 , H04L67/1014

CPC分类号： H04L67/1008 , H04L47/806 , H04L47/827 , H04L67/1014

摘要： Systems and techniques for performing multicast-reduction operations. In at least one embodiment, a network device receives first network data associated with a multicast operation to be collectively performed by at least a plurality of endpoints. The network device reserves resources to process second network data to be received from the endpoints, and sends the first network data to a plurality of additional network devices. The network device receives the second network data, and processes the second network data using the reserved resources.

3.

发明授权
Techniques for an efficient fabric attached memory 有权

公开(公告)号：US11822491B2

公开(公告)日：2023-11-21

申请号：US17506438

申请日：2021-10-20

申请人： NVIDIA Corporation

发明人： John Feehrer , Denis Foley , Mark Hummel , Vyas Venkataraman , Ram Gummadi , Samuel H. Duncan , Glenn Dearth , Brian Kelleher

IPC分类号： G06F13/16 , G06F13/40 , G06F9/455 , G06N20/00 , G06F17/16 , G06F12/1027

CPC分类号： G06F13/1652 , G06F9/45558 , G06F12/1027 , G06F13/1668 , G06F13/4022 , G06F17/16 , G06N20/00 , G06F2009/45583

摘要： Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

4.

发明申请
NETWORK ROUTING USING AGGREGATED LINKS 有权

公开(公告)号：US20210014156A1

公开(公告)日：2021-01-14

申请号：US16700611

申请日：2019-12-02

申请人： Nvidia Corporation

发明人： Glenn Dearth , Mark Hummel

IPC分类号： H04L12/709 , H04L12/721 , H04L12/751 , H04L12/707

摘要： Introduced herein is a routing technique that, for example, routes a transaction to a destination port over a network that supports link aggregation and multi-port connection. In one embodiment, two tables that can be searched based on the target and supplemental routing IDs of the transaction are utilized to route the transaction to the proper port of the destination endpoint. In an embodiment, the first table provides a list of available ports at each hop/route point that can route the transaction to the destination endpoint, and the second table provides a supplemental routing ID that can select a specific group of ports from the first table that can correctly route the transaction to the proper port.

5.

发明授权
Multicast-reduction assisted by network devices 有权

公开(公告)号：US11956306B1

公开(公告)日：2024-04-09

申请号：US17709111

申请日：2022-03-30

申请人： NVIDIA Corporation

发明人： Glenn Dearth , Mark Hummel , Nan Jiang , Gregory Thorson

IPC分类号： H04L47/70 , H04L47/80 , H04L67/1008 , H04L67/1014

CPC分类号： H04L67/1008 , H04L47/806 , H04L47/827 , H04L67/1014

摘要： Systems and techniques for performing multicast-reduction operations. In at least one embodiment, a network device receives first network data associated with a multicast operation to be collectively performed by at least a plurality of endpoints. The network device reserves resources to process second network data to be received from the endpoints, and sends the first network data to a plurality of additional network devices. The network device receives the second network data, and processes the second network data using the reserved resources.

6.

发明公开
MULTICAST AND REFLECTIVE MEMORY BEHAVIOR FOR MEMORY MODEL CONSISTENCY 审中-公开

公开(公告)号：US20230229599A1

公开(公告)日：2023-07-20

申请号：US17578266

申请日：2022-01-18

申请人： NVIDIA Corporation

发明人： Glenn Alan Dearth , Mark Hummel , Daniel Joseph Lustig

IPC分类号： G06F12/1045 , G06F12/02 , G06F13/16

CPC分类号： G06F12/1063 , G06F12/1054 , G06F12/0238 , G06F13/1668

摘要： In various examples, a memory model may support multicasting where a single request for a memory access operation may be propagated to multiple physical addresses associated with multiple processing elements (e.g., corresponding to respective local memory). Thus, the request may cause data to be read from and/or written to memory for each of the processing elements. In some examples, a memory model exposes multicasting to processes. This may include providing for separate multicast and unicast instructions or shared instructions with one or more parameters (e.g., indicating a virtual address) being used to indicate multicasting or unicasting. Additionally or alternatively, whether a request(s) is processed using multicasting or unicasting may be opaque to a process and/or application or may otherwise be determined by the system. One or more constraints may be imposed on processing requests using multicasting to maintain a coherent memory interface.

7.

发明授权
Techniques for efficiently synchronizing data transmissions on a network 有权

公开(公告)号：US10789194B2

公开(公告)日：2020-09-29

申请号：US16364565

申请日：2019-03-26

申请人： NVIDIA Corporation

发明人： Larry R. Dennison , Mark Hummel , Glenn Dearth

IPC分类号： G06F13/40 , G06F3/06 , G06F13/00 , G06F9/38 , G06F12/0891

摘要： Systems and techniques for synchronizing transactions between processing devices on an interconnection network are provided. Upon receiving a stream of posted transactions followed by a flush transaction from a source processing device connected to the interconnection network, the flush transaction is trapped before it enters the interconnecting network. Subsequently, based on monitoring for responses received from a destination processing device for transactions corresponding to the posted transactions, a flush response is generated and returned to the source processing device. The described techniques enable efficient synchronizing posted writes, posted atomics and the like over complex interconnection fabrics such that a first GPU can write data to a second GPU so that a third GPU can safely consume the data written to the second GPU.

8.

发明授权
System and method for early packet header verification 有权

公开(公告)号：US09720768B2

公开(公告)日：2017-08-01

申请号：US14875880

申请日：2015-10-06

申请人： Nvidia Corporation

发明人： Stephen D. Glaser , Eric Tyson , Mark Hummel , Michael Osborn , Jonathan Owen , Marvin Denman , Dennis Ma , Denis Foley

IPC分类号： H03M13/00 , G06F11/10 , H03M13/09

CPC分类号： H04L1/0061 , G06F11/1004 , H03M13/09 , H04L1/0045 , H04L1/008

摘要： A receiver, transmitter and method for early packet header verification are provided. In one embodiment, the method includes: (1) receiving a payload flit of a preceding packet and a header flit of a current packet; and (2) using a Cyclic Redundancy Check (CRC) in the header flit to verify the payload flit of the preceding packet and the header flit of the current packet.

9.

发明申请
SYSTEM AND METHOD FOR EARLY PACKET HEADER VERIFICATION 有权

公开(公告)号：US20170097867A1

公开(公告)日：2017-04-06

申请号：US14875880

申请日：2015-10-06

申请人： Nvidia Corporation

发明人： Stephen D. Glaser , Eric Tyson , Mark Hummel , Michael Osborn , Jonathan Owen , Marvin Denman , Dennis Ma , Denis Foley

IPC分类号： G06F11/10 , H03M13/09

CPC分类号： H04L1/0061 , G06F11/1004 , H03M13/09 , H04L1/0045 , H04L1/008

摘要： A receiver, transmitter and method for early packet header verification are provided. In one embodiment, the method includes: (1) receiving a payload flit of a preceding packet and a header flit of a current packet; and (2) using a Cyclic Redundancy Check (CRC) in the header flit to verify the payload flit of the preceding packet and the header flit of the current packet.

10.

发明公开
MULTICAST-REDUCTION ASSISTED BY NETWORK DEVICES 审中-公开

公开(公告)号：US20240098139A1

公开(公告)日：2024-03-21

申请号：US17709111

申请日：2022-03-30

申请人： NVIDIA Corporation

发明人： Glenn Dearth , Mark Hummel , Nan Jiang , Gregory Thorson

IPC分类号： H04L67/1008 , H04L67/1012 , H04L67/1014

CPC分类号： H04L67/1008 , H04L67/1012 , H04L67/1014

摘要： Systems and techniques for performing multicast-reduction operations. In at least one embodiment, a network device receives first network data associated with a multicast operation to be collectively performed by at least a plurality of endpoints. The network device reserves resources to process second network data to be received from the endpoints, and sends the first network data to a plurality of additional network devices. The network device receives the second network data, and processes the second network data using the reserved resources.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类