-
公开(公告)号:US20240354260A1
公开(公告)日:2024-10-24
申请号:US18762987
申请日:2024-07-03
IPC分类号: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/499 , G06F7/53 , G06F7/57 , G06F9/30 , G06F9/32 , G06F9/345 , G06F9/38 , G06F9/48 , G06F11/00 , G06F11/10 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F15/78 , G06F17/16 , H03H17/06
CPC分类号: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/4876 , G06F7/49915 , G06F7/53 , G06F7/57 , G06F9/3001 , G06F9/30014 , G06F9/30021 , G06F9/30032 , G06F9/30036 , G06F9/30065 , G06F9/30072 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/3818 , G06F9/383 , G06F9/3836 , G06F9/3851 , G06F9/3856 , G06F9/3867 , G06F9/3887 , G06F9/48 , G06F11/00 , G06F11/1048 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F17/16 , H03H17/0664 , G06F9/30018 , G06F9/325 , G06F9/381 , G06F9/3822 , G06F11/10 , G06F15/7807 , G06F15/781 , G06F2212/452 , G06F2212/60 , G06F2212/602 , G06F2212/68
摘要: A method for sorting of a vector in a processor is provided that includes performing, by the processor in response to a vector sort instruction, sorting of values stored in lanes of the vector to generate a sorted vector, wherein the values are sorted in an order indicated by the vector sort instruction, and storing the sorted vector in a storage location.
-
2.
公开(公告)号:US20240338178A1
公开(公告)日:2024-10-10
申请号:US18298315
申请日:2023-04-10
申请人: EDGECORTIX INC
CPC分类号: G06F7/5443 , G06F7/4983 , G06F7/49915
摘要: Multiple-precision multiply-and-accumulate operation is performed by a multiply-and-accumulate (MAC) unit configured to operate in an integer mode to perform computations on first data-width integer values to produce third data-width integer values and configured to operate in a floating point mode to perform computations on second data-width floating point values to produce third data-width floating point values, wherein the second data width is twice the first data width and the third data width is larger than the second data width. The MAC unit includes a first multiplier configured to multiply two integer values in the integer mode or multiply mantissa values extracted from each of two floating point values in the floating point mode. The MAC unit further includes a second multiplier, and is further configured to multiply two integer values in the integer mode or refrain from using the second multiplier in the floating point mode.
-
公开(公告)号:US20240296010A1
公开(公告)日:2024-09-05
申请号:US18591349
申请日:2024-02-29
申请人: Graphcore Limited
发明人: Thomas BROWN
CPC分类号: G06F7/49915 , G06F7/523 , G06F7/556
摘要: A processing unit is provided with circuitry enabling execution quick evaluation of an exponential function. A multiplier circuit is used to multiply the input operand by log2(e), such that a result for the exponential function may be determined by evaluating 2i+f, where i is an integer part of a fixed-point number and f is a fractional part of the fixed-point number. A lookup table is used for providing an estimate for 2f based on the l MSBs of f. The lookup entries are provided according to a function such that the estimates for 2f are provided without bias towards either zero or infinity in the result. In other words, the maximum multiplicative error for each entry of the lookup table is the same in both negative and positive directions. In this way, statistical errors in the evaluation of a large number of exponential functions may be avoided.
-
公开(公告)号:US12072812B2
公开(公告)日:2024-08-27
申请号:US17237391
申请日:2021-04-22
发明人: Timothy D. Anderson , Joseph Zbiciak , Duc Quang Bui , Abhijeet Chachad , Kai Chirca , Naveen Bhoria , Matthew D. Pierson , Daniel Wu , Ramakrishnan Venkatasubramanian
IPC分类号: G06F9/30 , G06F7/24 , G06F7/487 , G06F7/499 , G06F7/53 , G06F7/57 , G06F9/32 , G06F9/345 , G06F9/38 , G06F9/48 , G06F11/00 , G06F11/10 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F12/1045 , G06F17/16 , H03H17/06 , G06F15/78
CPC分类号: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/4876 , G06F7/49915 , G06F7/53 , G06F7/57 , G06F9/3001 , G06F9/30014 , G06F9/30021 , G06F9/30032 , G06F9/30036 , G06F9/30065 , G06F9/30072 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/3818 , G06F9/383 , G06F9/3836 , G06F9/3851 , G06F9/3856 , G06F9/3867 , G06F9/3887 , G06F9/48 , G06F11/00 , G06F11/1048 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F17/16 , H03H17/0664 , G06F9/30018 , G06F9/325 , G06F9/381 , G06F9/3822 , G06F11/10 , G06F15/7807 , G06F15/781 , G06F2212/452 , G06F2212/60 , G06F2212/602 , G06F2212/68
摘要: Disclosed embodiments include an electronic device having a processor core, a memory, a register, and a data load unit to receive a plurality of data elements stored in the memory in response to an instruction. All of the data elements hare the same data size, which is specified by one or more coding bits. The data load unit includes an address generator to generate addresses corresponding to locations in the memory at which the data elements are located, and a formatting unit to format the data elements. The register is configured to store the formatted data elements, and the processor core is configured to receive the formatted data elements from the register.
-
公开(公告)号:US12058714B2
公开(公告)日:2024-08-06
申请号:US18153997
申请日:2023-01-12
发明人: Aris Papasakellariou
CPC分类号: H04W72/535 , G06F7/49973 , H04W72/0453 , H04W72/21 , H04W72/23 , H04W72/53 , H04W72/54
摘要: Methods and apparatuses for PDCCH reception and transmission. A method for PDCCH reception includes transmitting a capability for receptions of PDCCHs on a downlink (DL) cell. PDCCH receptions on the DL cell are according to (X1, Y1) or (X2, Y2) when any two PDCCH receptions are within Y1 or Y2 symbols or have first symbols separated by at least X1 or X2 symbols, respectively. The method further includes receiving a configuration of search space sets for PDCCH receptions on the DL cell; determining, based on the configuration of the search space sets, whether PDCCH receptions are according to (X2, Y2); and receiving on the DL cell: a maximum number of MPDCCHmax,X1,μ PDCCHs within Y1 symbols when PDCCH receptions are not according to (X2, Y2), and a maximum number of MPDCCHmax,X2,μ PDCCHs within Y2 symbols when PDCCH receptions are according to (X2, Y2).
-
公开(公告)号:US12039482B2
公开(公告)日:2024-07-16
申请号:US17454581
申请日:2021-11-11
申请人: 7-Eleven, Inc.
CPC分类号: G06Q10/08 , G06F7/49947 , G06F17/18 , G06Q10/04
摘要: A data prediction subsystem receives event data indicating an amount of an item removed from each of a plurality of locations over a previous period of time. For each location, prediction data is determined using the event data. The prediction data includes, for each day over a future period of time, a non-integer value indicating an anticipated amount of the item that will be removed from the location. An improved rounding process is used to round the prediction value for each day. The resulting prediction data is used to proactively request items with improved communication and computational efficiency.
-
公开(公告)号:US20240143274A1
公开(公告)日:2024-05-02
申请号:US18311509
申请日:2023-05-03
发明人: Hyeonuk SIM , Jongeun LEE , Azat AZAMAT
CPC分类号: G06F5/01 , G06F7/49947
摘要: A neural network operation apparatus and method are disclosed. A neural network operation apparatus includes a receiver that receives data for a neural network operation, and a processor that performs a scaling operation by multiplying the data by a constant, performs a rounding operation by truncating bits forming a result of the scaling operation, performs a scaling back operation based on a result of the rounding operation, and generates a neural network operation result by accumulating results of the scaling back operation.
-
公开(公告)号:US20240126506A1
公开(公告)日:2024-04-18
申请号:US18399381
申请日:2023-12-28
申请人: Intel Corporation
发明人: Roberto DiCecco , Joshua Fender , Shane O'Connell
CPC分类号: G06F7/485 , G06F7/483 , G06F7/4876 , G06F7/49947 , G06F7/5443 , G06F17/16
摘要: Circuitry for decomposing block floating-point numbers into lower precision floating-point numbers is provided. The circuitry may include a high precision storage circuit configured to provide high precision floating-point numbers, input selectors configured to receive the high precision floating-point numbers from the high precision storage circuit and to generate corresponding lower precision floating-point components with adjusted exponents, and a low precision block floating-point vector circuit configured to combine the various lower precision floating-point components generated by the input selectors. The lower precision floating-point components may be processed spatially or over multiple iterations over time.
-
公开(公告)号:US20240061903A1
公开(公告)日:2024-02-22
申请号:US17892852
申请日:2022-08-22
申请人: Xilinx, Inc.
发明人: Wenzong Yang , Wang Xi , Yadong Li , Junbin Wang , Shaoxia Fang
CPC分类号: G06F17/16 , G06F7/5443 , G06F7/552 , G06F7/4991
摘要: Circuits and methods for determining a maximum bias for computing softmax on a tensor include a processor circuit configured to transform in parallel, elements of each group of a plurality of groups of elements of a tensor X into respective power-of-two elements. The respective power-of-two element from element xt of the tensor is pt, pt=(xt*log2e), and pt has an integer part and a fraction part. A first comparison circuit (204) is configured to determine respective group-level biases for the groups. The group-level bias of groupm is dm, and dm is an integer part of a maximum of the power-of-two elements of groupm. A second comparison circuit is configured to determine a greatest one of the respective group-level biases to be a tensor-level bias, dmax.
-
公开(公告)号:US20240012613A1
公开(公告)日:2024-01-11
申请号:US18372737
申请日:2023-09-26
发明人: Casper Van Benthem
CPC分类号: G06F5/01 , G06F7/49936
摘要: Apparatus includes hardware logic arranged to normalise an n-bit input number. The hardware logic comprises at least a first hardware logic stage, an intermediate hardware logic stage and a final hardware logic stage. Each stage comprises a left shifting logic element, the first and intermediate stages each also comprise a plurality of OR-reduction logic elements and the intermediate and final stages each also comprise one or more multiplexers. The OR-reduction logic elements operate on different subsets of bits from the number input to the particular stage. In the intermediate and final hardware logic stages, a first of the multiplexers selects an OR-reduction result received from a previous hardware logic stage and the left shifting logic element is arranged to perform left shifting on the updated binary number received from an immediately previous hardware logic stage dependent upon the selected OR-reduction result.
-
-
-
-
-
-
-
-
-