Patent search ap:"INNOVIUM Page INC."

1.

发明授权
Network switch with integrated gradient aggregation for distributed machine learning 有权

公开(公告)号：US12236323B1

公开(公告)日：2025-02-25

申请号：US18217483

申请日：2023-06-30

Applicant: Innovium, Inc.

Inventor： William Brad Matthews , Puneet Agarwal

IPC: G06N20/00 , H04L47/2441 , H04L47/32 , H04L49/00 , H04L67/10 , H04L49/25

Abstract: Distributed machine learning systems and other distributed computing systems are improved by embedding compute logic at the network switch level to perform collective actions, such as reduction operations, on gradients or other data processed by the nodes of the system. The switch is configured to recognize data units that carry data associated with a collective action that needs to be performed by the distributed system, referred to herein as “compute data,” and process that data using a compute subsystem within the switch. The compute subsystem includes a compute engine that is configured to perform various operations on the compute data, such as “reduction” operations, and forward the results back to the compute nodes. The reduction operations may include, for instance, summation, averaging, bitwise operations, and so forth. In this manner, the network switch may take over some or all of the processing of the distributed system during the collective phase.

2.

发明申请
AUTOMATIC FLOW MANAGEMENT 有权

公开(公告)号：US20240422104A1

公开(公告)日：2024-12-19

申请号：US18823281

申请日：2024-09-03

Applicant: Innovium, Inc.

Inventor： William Brad MATTHEWS , Rupa BUDHIA , Puneet AGARWAL

IPC: H04L47/2441 , H04L43/0882 , H04L47/11

Abstract: Packet-switching operations in a network device are managed based on the detection of excessive-rate traffic flows. A network device receives a data unit, determines the traffic flow to which the data unit belongs, and updates flow tracking information for that flow. The network device utilizes the tracking information to determine when a rate at which the network device is receiving data belonging to the flow exceeds an excessive-rate threshold and is thus an excessive-rate flow. The network device may enable one or more excessive-rate policies on an excessive-rate traffic flow. Such a policy may include any number of features that affect how the device handles data units belonging to the flow, such as excessive-rate notification, differentiated discard, differentiated congestion notification, and reprioritization. Memory and other resource optimizations for such flow tracking and management are also described.

3.

发明授权
Shared traffic manager 有权

公开(公告)号：US12068972B1

公开(公告)日：2024-08-20

申请号：US18208648

申请日：2023-06-12

Applicant: Innovium, Inc.

Inventor： William Brad Matthews , Puneet Agarwal , Bruce Hui Kwan

IPC: H04L47/625 , H04L49/90 , H04L49/901

CPC classification number: H04L47/6255 , H04L49/901 , H04L49/9084

Abstract: A traffic manager is shared amongst two or more egress blocks of a network device, thereby allowing traffic management resources to be shared between the egress blocks. Schedulers within a traffic manager may generate and queue read instructions for reading buffered portions of data units that are ready to be sent to the egress blocks. The traffic manager may be configured to select a read instruction for a given buffer bank from the read instruction queues based on a scoring mechanism or other selection logic. To avoid sending too much data to an egress block during a given time slot, once a data unit portion has been read from the buffer, it may be temporarily stored in a shallow read data cache. Alternatively, a single, non-bank specific controller may determine all of the read instructions and write operations that should be executed in a given time slot.

4.

发明授权
Efficient buffer utilization for network data units 有权

公开(公告)号：US11949601B1

公开(公告)日：2024-04-02

申请号：US17942676

申请日：2022-09-12

Applicant: Innovium, Inc.

Inventor： Ajit Kumar Jain , Mohammad Kamel Issa , Avinash Gyanendra Mani , Ashwin Alapati

IPC: H04L47/785 , H04L45/74 , H04L47/30 , H04L47/41

CPC classification number: H04L47/786 , H04L45/74 , H04L47/30 , H04L47/41

Abstract: Approaches, techniques, and mechanisms are disclosed for efficiently buffering data units within a network device. A traffic manager or other network device component receives Transport Data Units (“TDUs”), which are sub-portions of Protocol Data Units (“PDUs”). Rather than buffer an entire TDU together, the component divides the TDU into multiple Storage Data Units (“SDUs”) that can fit in SDU buffer entries within physical memory banks. A TDU-to-SDU Mapping (“TSM”) memory stores TSM lists that indicate which SDU entries store SDUs for a given TDU. Physical memory banks in which the SDUs are stored may be grouped together into logical SDU banks that are accessed together as if a single bank. The TSM memory may include a number of distinct TSM banks, with each logical SDU bank having a corresponding TSM bank. Techniques for maintaining inter-packet and intra-packet linking data compatible with such buffers are also disclosed.

5.

发明授权
Reconfigurable circuit devices 有权

公开(公告)号：US11924966B1

公开(公告)日：2024-03-05

申请号：US17402425

申请日：2021-08-13

Applicant: Innovium, Inc.

Inventor： Vittal Balasubramanian , Yongming Xiong , Keith Michael Ring

IPC: H05K1/02 , G06F30/394 , H01L23/66

CPC classification number: H05K1/0243 , G06F30/394 , H01L23/66 , H05K1/0251 , H05K2201/10356 , H05K2201/10689

Abstract: Loss reduction methods are described. A first transmission loss associated with signal transmission through a trace in a first circuit board design is determined. The trace is routed from an integrated circuit disposed on a circuit board to a circuit element disposed on the circuit board. It is determined that the first transmission loss is greater than a threshold transmission loss. The first circuit board design is altered to obtain a second circuit board design. In the second circuit board design, the trace is routed from the integrated circuit to a connector disposed on the circuit board, and the connector is electrically coupled to the circuit element by a cable. A second transmission loss associated with signal transmission between the integrated circuit and the circuit element in the second circuit board design is less than the threshold transmission loss.

6.

发明公开
DELAY-BASED AUTOMATIC QUEUE MANAGEMENT AND TAIL DROP 审中-公开

公开(公告)号：US20240039852A1

公开(公告)日：2024-02-01

申请号：US18378522

申请日：2023-10-10

Applicant: Innovium, Inc.

Inventor： William Brad MATTHEWS , Bruce Hui KWAN , Puneet AGARWAL

IPC: H04L47/20 , H04L43/0852 , H04L47/56 , H04L47/32

CPC classification number: H04L47/20 , H04L43/0858 , H04L47/568 , H04L47/32

Abstract: Approaches, techniques, and mechanisms are disclosed for improving operations of a network switching device and/or network-at-large by utilizing queue delay as a basis for measuring congestion for the purposes of Automated Queue Management (“AQM”) and/or other congestion-based policies. Queue delay is an exact or approximate measure of the amount of time a data unit waits at a network device as a consequence of queuing, such as the amount of time the data unit spends in an egress queue while the data unit is being buffered by a traffic manager. Queue delay may be used as a substitute for queue size in existing AQM, Weighted Random Early Detection (“WRED”), Tail Drop, Explicit Congestion Notification (“ECN”), reflection, and/or other congestion management or notification algorithms. Or, a congestion score calculated based on the queue delay and one or more other metrics, such as queue size, may be used as a substitute.

7.

发明授权
Foldable ingress buffer for network apparatuses 有权

公开(公告)号：US11888691B1

公开(公告)日：2024-01-30

申请号：US16933264

申请日：2020-07-20

Applicant: Innovium, Inc.

Inventor： Ajit Kumar Jain

IPC: H04B1/64 , H04L12/10 , H04L41/0823 , G11C7/10 , H04L41/12

CPC classification number: H04L41/0823 , G11C7/1006 , H04L41/12

Abstract: A network device implements a foldable ingress buffer for buffering data units as they are being received. The buffer is organized into a grid of memory banks, having different columns and rows. A Transport Data Unit (“TDU”) is stored interleaved across entries in multiple banks. As each portion of a TDU is received, the portion is written to a different bank of the buffer. In each column of the buffer, a full-sized TDU has portions in a number of rows equal to the number of folds in the buffer. The sum of the bank widths for each row thus needs be no larger than half the maximum TDU size, which further means that the number of columns in the grid of banks may be reduced by at least half compared to non-folded approaches, with little increase in the number of rows, if any, depending on blocking and reading requirements.

8.

发明授权
Multi-destination traffic handling optimizations in a network device 有权

公开(公告)号：US11637786B1

公开(公告)日：2023-04-25

申请号：US17121404

申请日：2020-12-14

Applicant: Innovium, Inc.

Inventor： William Brad Matthews , Puneet Agarwal , Bruce Hui Kwan , Ajit Kumar Jain

IPC: H04L12/26 , H04L47/41 , H04L47/22 , H04L49/9047 , H04L49/9015 , H04L49/90 , H04L47/6275 , H04L45/16 , H04L45/24 , H04L47/30 , H04L47/625 , H04L47/32

Abstract: When a measure of buffer space queued for garbage collection in a network device grows beyond a certain threshold, one or more actions are taken to decreasing an enqueue rate of certain classes of traffic, such as of multicast traffic, whose reception may have caused and/or be likely to exacerbate garbage-collection-related performance issues. When the amount of buffer space queued for garbage collection shrinks to an acceptable level, these one or more actions may be reversed. In an embodiment, to more optimally handle multi-destination traffic, queue admission control logic for high-priority multi-destination data units, such as mirrored traffic, may be performed for each destination of the data units prior to linking the data units to a replication queue. If a high-priority multi-destination data unit is admitted to any queue, the high-priority multi-destination data unit can no longer be dropped, and is linked to a replication queue for replication.

9.

发明授权
Distributed artificial intelligence extension modules for network switches 有权

公开(公告)号：US11516149B1

公开(公告)日：2022-11-29

申请号：US17367331

申请日：2021-07-03

Applicant: Innovium, Inc.

Inventor： William Brad Matthews , Puneet Agarwal

IPC: H04L49/00 , H04L45/00 , G06N20/00

Abstract: Distributed machine learning systems and other distributed computing systems are improved by compute logic embedded in extension modules coupled directly to network switches. The compute logic performs collective actions, such as reduction operations, on gradients or other compute data processed by the nodes of the system. The reduction operations may include, for instance, summation, averaging, bitwise operations, and so forth. In this manner, the extension modules may take over some or all of the processing of the distributed system during the collective phase. An inline version of the module sits between a switch and the network. Data units carrying compute data are intercepted and processed using the compute logic, while other data units pass through the module transparently to or from the switch. Multiple modules may be connected to the switch, each coupled to a different group of nodes, and sharing intermediate results. A sidecar version is also described.

10.

发明授权
Distributed artificial intelligence extension modules for network switches 有权

公开(公告)号：US11057318B1

公开(公告)日：2021-07-06

申请号：US16552938

申请日：2019-08-27

Applicant: Innovium, Inc.

Inventor： William Brad Matthews , Puneet Agarwal

IPC: H04L12/935 , H04L12/771 , G06N20/00

Abstract: Distributed machine learning systems and other distributed computing systems are improved by compute logic embedded in extension modules coupled directly to network switches. The compute logic performs collective actions, such as reduction operations, on gradients or other compute data processed by the nodes of the system. The reduction operations may include, for instance, summation, averaging, bitwise operations, and so forth. In this manner, the extension modules may take over some or all of the processing of the distributed system during the collective phase. An inline version of the module sits between a switch and the network. Data units carrying compute data are intercepted and processed using the compute logic, while other data units pass through the module transparently to or from the switch. Multiple modules may be connected to the switch, each coupled to a different group of nodes, and sharing intermediate results. A sidecar version is also described.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification