-
公开(公告)号:US11829730B2
公开(公告)日:2023-11-28
申请号:US17940654
申请日:2022-09-08
Inventor: Nitin Chawla , Tanmoy Roy , Anuj Grover , Giuseppe Desoli
CPC classification number: G06F7/57 , G06F3/0604 , G06F3/0659 , G06F3/0673 , G06N3/063
Abstract: A memory array arranged in multiple columns and rows. Computation circuits that each calculate a computation value from cell values in a corresponding column. A column multiplexer cycles through multiple data lines that each corresponds to a computation circuit. Cluster cycle management circuitry determines a number of multiplexer cycles based on a number of columns storing data of a compute cluster. A sensing circuit obtains the computation values from the computation circuits via the column multiplexer as the column multiplexer cycles through the data lines. The sensing circuit combines the obtained computation values over the determined number of multiplexer cycles. A first clock may initiate the multiplexer to cycle through its data lines for the determined number of multiplexer cycles, and a second clock may initiate each individual cycle. The multiplexer or additional circuitry may be utilized to modify the order in which data is written to the columns.
-
公开(公告)号:US11687762B2
公开(公告)日:2023-06-27
申请号:US16280991
申请日:2019-02-20
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
CPC classification number: G06N3/063 , G06F1/26 , G06F17/16 , G06F17/175 , G06N3/045 , G06N3/08 , G06N20/00
Abstract: Embodiments of a device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit along with a plurality of convolution accelerators and an arithmetic unit coupled to the reconfigurable stream switch. The arithmetic unit has at least one input and at least one output. The at least one input is arranged to receive streaming data passed through the reconfigurable stream switch, and the at least one output is arranged to stream resultant data through the reconfigurable stream switch. The arithmetic unit also has a plurality of data paths. At least one of the plurality of data paths is solely dedicated to performance of operations that accelerate an activation function represented in the form of a piece-wise second order polynomial approximation.
-
公开(公告)号:US11593609B2
公开(公告)日:2023-02-28
申请号:US16794062
申请日:2020-02-18
Inventor: Giuseppe Desoli , Carmine Cappetta , Thomas Boesch , Surinder Pal Singh , Saumya Suneja
Abstract: Embodiments of an electronic device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit along with a plurality of convolution accelerators and a decompression unit coupled to the reconfigurable stream switch. The decompression unit decompresses encoded kernel data in real time during operation of convolutional neural network.
-
公开(公告)号:US10402527B2
公开(公告)日:2019-09-03
申请号:US15423289
申请日:2017-02-02
Inventor: Thomas Boesch , Giuseppe Desoli
Abstract: Embodiments are directed towards a reconfigurable stream switch formed in an integrated circuit. The stream switch includes a plurality of output ports, a plurality of input ports, and a plurality of selection circuits. The output ports each have an output port architectural composition, and each is arranged to unidirectionally pass output data and output control information. The input ports each have an input port architectural composition, and each is arranged to unidirectionally receive first input data and first input control information. Each one of the selection circuits is coupled to an associated one of the output ports. Each selection circuit is further coupled to all of the input ports such that each selection circuit is arranged to reconfigurably couple its associated output port to no more than one input port at any given time.
-
公开(公告)号:US11836346B2
公开(公告)日:2023-12-05
申请号:US17742987
申请日:2022-05-12
Inventor: Nitin Chawla , Giuseppe Desoli , Anuj Grover , Thomas Boesch , Surinder Pal Singh , Manuj Ayodhyawasi
CPC classification number: G06F3/0604 , G06F3/0619 , G06F3/0655 , G06F3/0679 , G06N3/08
Abstract: A memory array arranged as a plurality of memory cells. The memory cells are configured to operate at a determined voltage. A memory management circuitry coupled to the plurality of memory cells tags a first set of the plurality of memory cells as low-voltage cells and tags a second set of the plurality of memory cells as high-voltage cells. A power source provides a low voltage to the first set of memory cells and provides a high voltage to the second set of memory cells based on the tags.
-
公开(公告)号:US11823771B2
公开(公告)日:2023-11-21
申请号:US17158875
申请日:2021-01-26
Inventor: Nitin Chawla , Thomas Boesch , Anuj Grover , Surinder Pal Singh , Giuseppe Desoli
Abstract: A system includes a random access memory organized into individually addressable words. Streaming access control circuitry is coupled to word lines of the random access memory. The streaming access control circuitry responds to a request to access a plurality of individually addressable words of a determined region of the random access memory by generating control signals to drive the word lines to streamingly access the plurality of individually addressable words of the determined region. The request indicates an offset associated with the determined region and a pattern associated with the streaming access.
-
公开(公告)号:US11710032B2
公开(公告)日:2023-07-25
申请号:US18055245
申请日:2022-11-14
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
CPC classification number: G06N3/063 , G06F17/18 , G06F18/217 , G06N3/04 , G06N3/08
Abstract: A convolutional neural network includes a pooling unit. The pooling unit performs pooling operations between convolution layers of the convolutional neural network. The pooling unit includes hardware blocks that promote computational and area efficiency in the convolutional neural network.
-
公开(公告)号:US11562115B2
公开(公告)日:2023-01-24
申请号:US15423284
申请日:2017-02-02
Inventor: Thomas Boesch , Giuseppe Desoli
IPC: G06N3/04 , G06F30/327 , G06N20/10 , G06N3/08 , G06F30/34 , G06N20/00 , G06N7/00 , G06F115/08 , G06N3/063 , G06F9/445 , G06F13/40 , G06F15/78
Abstract: Embodiments are directed towards a configurable accelerator framework device that includes a stream switch and a plurality of convolution accelerators. The stream switch has a plurality of input ports and a plurality of output ports. Each of the input ports is configurable at run time to unidirectionally pass data to any one or more of the output ports via a stream link. Each one of the plurality of convolution accelerators is configurable at run time to unidirectionally receive input data via at least two of the plurality of stream switch output ports, and each one of the plurality of convolution accelerators is further configurable at run time to unidirectionally communicate output data via an input port of the stream switch.
-
公开(公告)号:US11531873B2
公开(公告)日:2022-12-20
申请号:US16909673
申请日:2020-06-23
Inventor: Thomas Boesch , Giuseppe Desoli , Surinder Pal Singh , Carmine Cappetta
Abstract: Techniques and systems are provided for implementing a convolutional neural network. One or more convolution accelerators are provided that each include a feature line buffer memory, a kernel buffer memory, and a plurality of multiply-accumulate (MAC) circuits arranged to multiply and accumulate data. In a first operational mode the convolutional accelerator stores feature data in the feature line buffer memory and stores kernel data in the kernel data buffer memory. In a second mode of operation, the convolutional accelerator stores kernel decompression tables in the feature line buffer memory.
-
公开(公告)号:US11507831B2
公开(公告)日:2022-11-22
申请号:US16799671
申请日:2020-02-24
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
Abstract: A convolutional neural network includes a pooling unit. The pooling unit performs pooling operations between convolution layers of the convolutional neural network. The pooling unit includes hardware blocks that promote computational and area efficiency in the convolutional neural network.
-
-
-
-
-
-
-
-
-