METHOD AND APPARATUS FOR PROCESSING ARTIFICIAL NEURAL NETWORK WITH EFFICIENT MATRIX MULTIPLICATION OPERATION

    公开(公告)号:US20250077406A1

    公开(公告)日:2025-03-06

    申请号:US18804853

    申请日:2024-08-14

    Abstract: Provided is an artificial neural network processing apparatus including: first to fourth submatrix multiplication operators configured to perform a first submatrix multiplication operation and then a second submatrix multiplication operation using eight pieces of input data; a memory mapping unit configured to map at least a portion of the eight pieces of input data to the first to fourth submatrix multiplication operators with a first mapping structure for the first submatrix multiplication operation, and map at least a portion of the eight pieces of input data to the first to fourth submatrix multiplication operators with a second mapping structure for the second submatrix multiplication operation, wherein the first mapping structure and the second mapping structure have different mapping structures; and a controlling unit configured to control the memory mapping unit to be formed with the first mapping structure or the second mapping structure.

    METHOD AND APPARATUS FOR FILTERING PIXEL BLOCKS
    2.
    发明申请
    METHOD AND APPARATUS FOR FILTERING PIXEL BLOCKS 有权
    滤波像素块的方法和装置

    公开(公告)号:US20140348250A1

    公开(公告)日:2014-11-27

    申请号:US14286822

    申请日:2014-05-23

    CPC classification number: H04N19/86 H04N19/423

    Abstract: Provided is a method for a plurality of processing elements to filter a plurality of pixel blocks in a plurality of picture partitions for a single frame image. The method for filtering pixel blocks includes: checking the status of a second boundary pixel block adjacent to a picture partition boundary, the second boundary pixel block being one of a plurality of pixel blocks in a second picture partition and neighboring a first boundary pixel block in a first picture partition, the first boundary pixel block neighboring the picture partition boundary; selecting a filtering area for the first boundary pixel block based on the status of the second boundary pixel block; and filtering the filtering area for the first boundary pixel block.

    Abstract translation: 提供了一种用于多个处理元件的滤波方法,用于对用于单帧图像的多个图像分区中的多个像素块进行滤波。 用于滤波像素块的方法包括:检查与图像分区边界相邻的第二边界像素块的状态,第二边界像素块是第二图像分区中的多个像素块中的一个,并且邻近第一边界像素块 第一图像分区,与图像分区边界相邻的第一边界像素块; 基于第二边界像素块的状态为第一边界像素块选择滤波区域; 并对第一边界像素块的滤波区域进行滤波。

    NEURAL NETWORK ACCELERATOR WITH SYSTOLIC ARRAY STRUCTURE

    公开(公告)号:US20200175355A1

    公开(公告)日:2020-06-04

    申请号:US16677835

    申请日:2019-11-08

    Abstract: A neural network accelerator in which processing elements are configured in a systolic array structure includes a memory to store a plurality of feature data including first and second feature data and a plurality of kernel data including first and second kernel data, a first processing element to perform an operation based on the first feature data and the first kernel data and output the first feature data, a selection circuit to select one of the first feature data and the second feature data, based on a control signal, and output the selected feature data, a second processing element to perform an operation based on the selected feature data and one of the first and the second kernel data, and a controller to generate the control signal, based on a neural network characteristic associated with the plurality of feature data and kernel data.

    APPARATUS FOR INSTRUCTION GENERATION FOR ARTIFICIAL INTELLIGENCE PROCESSOR AND OPTIMIZATION METHOD THEREOF

    公开(公告)号:US20210312281A1

    公开(公告)日:2021-10-07

    申请号:US17217777

    申请日:2021-03-30

    Inventor: Hyun Mi KIM

    Abstract: An apparatus for automatically generating instructions for an artificial intelligence processor and a method for optimizing the same are provided. The method includes: obtaining a combination of conditions for actions performed by the artificial intelligence processor in consideration of optimization condition information for the actions based on model optimization information that optimizes a neural network model to which the artificial intelligence processor is applied and configuration information of the artificial intelligence processor; generating hardware modeling based on the combination of conditions and predicting a performance value through the hardware modeling; and determining an optimal combination of conditions by comparing the predicted performance value and a preset optimal performance value.

    NEURAL NETWORK COMPUTING DEVICE AND OPERATION METHOD THEREOF

    公开(公告)号:US20190220739A1

    公开(公告)日:2019-07-18

    申请号:US16225729

    申请日:2018-12-19

    CPC classification number: G06N3/08 G06F7/523

    Abstract: Provided is a neural network computing device including a neural network memory configured to store input data, a kernel memory configured to store kernel data corresponding to the input data, a kernel data controller configured to determine whether or not a first part of the kernel data matches a predetermined bit string, and if the first part matches the predetermined bit string, configured to generate a plurality of specific data based on a second part of the kernel data, and a neural core configured to perform a first operation between one of the plurality of specific data and the input data.

Patent Agency Ranking