COMPUTING APPARATUS, METHOD, BOARD CARD AND COMPUTER-READABLE STORAGE MEDIUM

    公开(公告)号:US20220253280A1

    公开(公告)日:2022-08-11

    申请号:US17557669

    申请日:2021-12-21

    IPC分类号: G06F7/498 G06F7/499

    摘要: The present disclosure provides a computing device for processing a multi-bit width value, an integrated circuit board card, a method, and a computer readable storage medium. The computing device is included in the combined processing apparatus, and the combined processing apparatus further includes a general interconnection interface, and other processing devices. The computing device interacts with the other processing device to jointly complete a computing operation specified by a user. The combined processing apparatus further includes a storage device connected to an apparatus and the other processing devices and configured to store data of the apparatus and the other processing device. The solution of the present disclosure can split the multi-bit width value so that the processing capability of the processor is not influenced by the bit width.

    Arithmetic processing apparatus, control method, and non-transitory computer-readable recording medium having stored therein control program

    公开(公告)号:US11410036B2

    公开(公告)日:2022-08-09

    申请号:US16898493

    申请日:2020-06-11

    申请人: FUJITSU LIMITED

    发明人: Makiko Ito

    摘要: An arithmetic processing apparatus includes: a memory that stores, when a training of a given machine learning model is repeatedly performed in a plurality of iterations, an error of a decimal point position of each of a plurality of fixed-point number data obtained one in each of the plurality of iterations, the error being obtained based on statistical information related to a distribution of leftmost set bit positions for positive number and leftmost unset bit positions for negative number or a distribution of rightmost set bit positions of the plurality of fixed-point number data; and a processor coupled to the memory, the processor being configured to: determine, based on a tendency of the error in each of the plurality of iterations, an offset amount for correcting a decimal point position of fixed-point number data used in the training.

    MULTIPLICATION AND ACCUMULATION (MAC) OPERATOR

    公开(公告)号:US20220236949A1

    公开(公告)日:2022-07-28

    申请号:US17724253

    申请日:2022-04-19

    申请人: SK hynix Inc.

    发明人: Choung Ki SONG

    IPC分类号: G06F7/499 G06F7/523 G06F7/50

    摘要: A multiplication-accumulation (MAC) includes a multiplication circuit, a pre-processing circuit, and an adder tree. The multiplication circuit performs a multiplication operation on a plurality of weight data and a plurality of vector data each having a floating-point format to output a plurality of multiplication data. The pre-processing circuit performs shifting on mantissa data of the plurality of multiplication data by a difference between first maximum exponent data having a greatest value among the exponent data of the plurality of multiplication data and the remaining exponent data to output a plurality of pre-processed mantissa data. The adder tree adds the plurality of mantissa data to output mantissa addition bits.

    MULTIPLICATION AND ACCUMULATION(MAC) OPERATOR AND PROCESSING-IN-MEMORY (PIM) DEVICE INCLUDING THE MAC OPERATOR

    公开(公告)号:US20220229633A1

    公开(公告)日:2022-07-21

    申请号:US17703744

    申请日:2022-03-24

    申请人: SK hynix Inc.

    发明人: Choung Ki SONG

    IPC分类号: G06F7/499 G06F7/523 G06F7/50

    摘要: A multiplication-accumulation (MAC) includes a multiplication circuit, a pre-processing circuit, and an adder tree. The multiplication circuit performs a multiplication operation on a plurality of weight data and a plurality of vector data each having a floating-point format to output a plurality of multiplication data. The pre-processing circuit performs shifting on mantissa data of the plurality of multiplication data by a difference between first maximum exponent data having a greatest value among the exponent data of the plurality of multiplication data and the remaining exponent data to output a plurality of pre-processed mantissa data. The adder tree adds the plurality of mantissa data to output mantissa addition bits.

    SYSTEM AND METHOD FOR HANDLING FLOATING POINT HARDWARE EXCEPTION

    公开(公告)号:US20220188109A1

    公开(公告)日:2022-06-16

    申请号:US17686682

    申请日:2022-03-04

    摘要: A method includes receiving an input data at a floating point arithmetic operating unit, wherein the floating point operating unit is configured to perform a floating point arithmetic operation on the input data. The method includes determining whether the received input data is a qnan (quiet not-a-number) or whether the received input data is an snan (signaling not-a-number) prior to performing the floating point arithmetic operation. The method also includes converting a value of the received input data to a modified value prior to performing the floating point arithmetic operation if the received input data is either qnan or snan, wherein the converting eliminates special handling associated with the floating point arithmetic operation on the input data being either qnan or snan.

    COMPUTING APPARATUS AND METHOD, BOARD CARD, AND COMPUTER READABLE STORAGE MEDIUM

    公开(公告)号:US20220188071A1

    公开(公告)日:2022-06-16

    申请号:US17556389

    申请日:2021-12-20

    IPC分类号: G06F7/499 G06F7/48

    摘要: The present disclosure relates to a computing device for processing a multi-bit width value, an integrated circuit board card, a method, and a computer readable storage medium. The computing device may be included in a combined processing apparatus, and the combined processing apparatus may further include a general interconnection interface, and an other processing device. The computing device interacts with the other processing device to jointly complete a computing operation specified by a user. The combined processing apparatus may further include a storage device connected to an apparatus and the other processing device and configured to store data of the apparatus and the other processing device. The solution of the present disclosure can split the multi-bit width value so that the processing capability of the processor is not influenced by the bit width.