专利检索 ap:("Meta Platforms, Inc.") AND inv:"Thomas Mark Ulrich" 第 2 页

11.

发明申请
DEVICE AND METHOD FOR FLEXIBLY SUMMING MATRIX VALUES 有权

公开(公告)号：US20220374499A1

公开(公告)日：2022-11-24

申请号：US17834203

申请日：2022-06-07

申请人： Meta Platforms, Inc.

发明人： Krishnakumar Narayanan Nair , Thomas Mark Ulrich , Ehsan Khish Ardestani Zadeh

IPC分类号： G06F17/16 , G06F7/78

摘要： A device includes a matrix transpose component, a matrix processing component, a data alignment component, and a data reduction component. The matrix transpose component is configured to transpose an input matrix of elements to output an output matrix of the elements that have been transposed. The matrix processing component is configured to multiply a first multiplication input matrix with a second multiplication input matrix, wherein the output matrix of the matrix transpose component is utilized as the first multiplication input matrix and a mask vector is utilized as the second multiplication input matrix. The data alignment component is configured to modify at least a portion of elements of a result of the matrix processing component. The data reduction component is configured to sum at least the elements of the modified result of the matrix processing component to determine a sum of the group of values.

12.

发明授权
Bypassing zero-value multiplications in a hardware multiplier 有权

公开(公告)号：US11614920B2

公开(公告)日：2023-03-28

申请号：US16869288

申请日：2020-05-07

申请人： Meta Platforms, Inc.

发明人： Thomas Mark Ulrich , Abdulkadir Utku Diril , Zhao Wang

IPC分类号： G06F7/575 , G06F7/483 , G06F7/523 , G06F17/16 , G06N3/02

摘要： A device (e.g., integrated circuit chip) includes a first operand register, a second operand register, a multiplication unit, and a hardware logic component. The first operand register is configured to store a first operand value. The second operand register is configured to store a second operand value. The multiplication unit is configured to at least multiply the first operand value with the second operand value. The hardware logic component is configured to detect whether a zero value is provided and in response to a detection that the zero value is being provided: cause an update of at least the first operand register to be disabled, and cause a result of a multiplication of the first operand value with the second operand value to be a zero-value result.

13.

发明授权
Systems and methods for reducing power consumption of convolution operations of artificial neural networks 有权

公开(公告)号：US11599181B1

公开(公告)日：2023-03-07

申请号：US16725331

申请日：2019-12-23

申请人： Meta Platforms, Inc.

发明人： Krishnakumar Narayanan Nair , Abdulkadir Utku Diril , Yuchen Hao , Thomas Mark Ulrich , Rakesh Komuravelli , Ehsan Khish Ardestani Zadeh , Martin Schatz

IPC分类号： G06F1/3234 , G06F12/0875 , G06N3/063 , G06F17/16 , G06N3/04

摘要： A computer-implemented method may include (1) maintaining (a) a filter matrix in a filter cache included in a local memory device (LMD) included in a hardware accelerator, and (b) a plurality of activation matrices corresponding to different rows of an activation volume in an activation cache included in the LMD, (2) for each activation matrix, directing a matrix multiplication unit (MMU) included in the hardware accelerator to execute a matrix multiplication operation (MMU) using the filter matrix and the activation matrix, (3) loading an additional filter matrix into the filter cache, and (4) directing the MMU to execute a plurality of additional MMOs, each additional MMO using one filter matrix included in the filter cache and one activation matrix included in the activation cache, such that the MMU reuses the filter matrix for at least one additional MMO and uses the additional filter matrix for a different additional MMO.

14.

发明申请
HIGH THROUGHPUT MATRIX PROCESSOR WITH SUPPORT FOR CONCURRENTLY PROCESSING MULTIPLE MATRICES 有权

公开(公告)号：US20230004624A1

公开(公告)日：2023-01-05

申请号：US17855391

申请日：2022-06-30

申请人： Meta Platforms, Inc.

发明人： Krishnakumar Narayanan Nair , Olivia Wu , Ehsan Khish Ardestani Zadeh , Abdulkadir Utku Diril , Thomas Mark Ulrich , Yuchen Hao , Rakesh Komuravelli , Aravind Kalaiah

IPC分类号： G06F17/16 , G06F7/544 , G06F17/15

摘要： A system comprises a data input vector unit, a weight input vector unit, and a plurality of calculation units. The data input vector unit is configured to concurrently receive elements of different rows of a first and second data matrix. The weight input vector unit is configured to receive a combined weight vector and at least in part concurrently provide obtained weight elements of a first and second weight matrix to a corresponding first and second group of calculation units. At least one calculation unit of each group of the first and second group of calculation units is configured to multiply elements from the data input vector unit with corresponding elements of the corresponding weight matrix from the weight input vector unit and sum together multiplication results of the corresponding calculation unit to at least in part determine a corresponding element in a first or second convolution result matrix.

15.

发明授权
Mapping convolution to a channel convolution engine 有权

公开(公告)号：US11537865B2

公开(公告)日：2022-12-27

申请号：US16793961

申请日：2020-02-18

申请人： Meta Platforms, Inc.

发明人： Krishnakumar Narayanan Nair , Rakesh Komuravelli , Abdulkadir Utku Diril , Ehsan Khish Ardestani Zadeh , Yuchen Hao , Martin Schatz , Thomas Mark Ulrich , Olivia Wu , Anup Ramesh Kadkol , Amin Firoozshahian

IPC分类号： G06N3/063 , G06N3/08 , G06F17/16 , G06F9/30

摘要： A processor system comprises a first and second group of registers and a hardware channel convolution processor unit. The first group of registers is configured to store data elements of channels of a portion of a convolution data matrix. Each register stores at least one data element from each channel. The second group of registers is configured to store data elements of convolution weight matrices including a separate convolution weight matrix for each channel. Each register stores at least one data element from each convolution weight matrix. The hardware channel convolution processor unit is configured to multiply each data element in the first group of registers with a corresponding data element in the second group of registers and sum together the multiplication results for each specific channel to determine corresponding channel convolution result data elements in a corresponding channel convolution result matrix.

16.

发明授权
Device and method for flexibly summing matrix values 有权

公开(公告)号：US11379557B2

公开(公告)日：2022-07-05

申请号：US16869303

申请日：2020-05-07

申请人： Meta Platforms, Inc.

发明人： Krishnakumar Narayanan Nair , Thomas Mark Ulrich , Ehsan Khish Ardestani Zadeh

IPC分类号： G06F17/16 , G06F7/78

摘要： A device includes a matrix transpose component, a matrix processing component, a data alignment component, and a data reduction component. The matrix transpose component is configured to transpose an input matrix of elements to output an output matrix of the elements that have been transposed. The matrix processing component is configured to multiply a first multiplication input matrix with a second multiplication input matrix, wherein the output matrix of the matrix transpose component is utilized as the first multiplication input matrix and a mask vector is utilized as the second multiplication input matrix. The data alignment component is configured to modify at least a portion of elements of a result of the matrix processing component. The data reduction component is configured to sum at least the elements of the modified result of the matrix processing component to determine a sum of the group of values.

17.

发明申请
USING A LOW-BIT-WIDTH DOT PRODUCT ENGINE TO SUM HIGH-BIT-WIDTH NUMBERS 有权

公开(公告)号：US20230056304A1

公开(公告)日：2023-02-23

申请号：US17894431

申请日：2022-08-24

申请人： Meta Platforms, Inc.

发明人： Thomas Mark Ulrich , Krishnakumar Narayanan Nair , Ehsan Khish Ardestani Zadeh

IPC分类号： G06F7/544 , G06F7/483

摘要： A system includes a vector multiplier configured to multiply a first vector of integer elements with a second vector of integer elements to determine a resulting vector of integer elements, wherein integer elements of the first and second vectors of integer elements are represented using a first number of bits and an integer element of the first vector of integer elements represents a portion of a value of a group of values. The system further includes a vector adder configured to add together the integer elements of the resulting vector of integer elements to determine a summed result, a bit shifter configured to shift bits of the summed result leftward, and an accumulator configured to determine an accumulated output sum that includes the leftward-shifted summed result.

18.

发明授权
Support for different matrix multiplications by selecting adder tree intermediate results 有权

公开(公告)号：US11520854B2

公开(公告)日：2022-12-06

申请号：US16667700

申请日：2019-10-29

申请人： Meta Platforms, Inc.

发明人： Yuchen Hao , Krishnakumar Narayanan Nair , Ehsan Khish Ardestani Zadeh , Rakesh Komuravelli , Abdulkadir Utku Diril , Thomas Mark Ulrich

IPC分类号： G06F17/16

摘要： A first group of elements is element-wise multiplied with a second group of elements using a plurality of multipliers belonging to a matrix multiplication hardware unit. Results of the plurality of multipliers are added together using a hierarchical tree of adders belonging to the matrix multiplication hardware unit and a final result of the hierarchical tree of adders or any of a plurality of intermediate results of the hierarchical tree of adders is selectively provided for use in determining an output result matrix. A control unit is used to instruct the matrix multiplication hardware unit to perform a plurality of different matrix multiplications in parallel by using a combined matrix that includes elements of a plurality of different operand matrices and utilize one or more selected ones of the intermediate results of the hierarchical tree of adders for use in determining the output result matrix that includes different groups of elements representing different multiplication results corresponding to different ones of the different operand matrices.

19.

发明授权
Hardware for floating-point arithmetic in multiple formats 有权

公开(公告)号：US11275560B2

公开(公告)日：2022-03-15

申请号：US16795097

申请日：2020-02-19

申请人： Meta Platforms, Inc.

发明人： Thomas Mark Ulrich , Abdulkadir Utku Diril , Krishnakumar Narayanan Nair , Zhao Wang , Rakesh Komuravelli

IPC分类号： G06F7/487 , G06F7/485 , H03M7/24

摘要： A floating-point number in a first format representation is received. Based on an identification of a floating-point format type of the floating-point number, different components of the first format representation are identified. The different components of the first format representation are placed in corresponding components of a second format representation of the floating-point number, wherein a total number of bits of the second format representation is larger than a total number of bits of the first format representation. At least one of the components of the second format representation is padded with one or more zero bits. The floating-point number in the second format representation is stored in a register. A multiplication using the second format representation of the floating-point number is performed.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类