-
1.
公开(公告)号:US20240020094A1
公开(公告)日:2024-01-18
申请号:US18222101
申请日:2023-07-14
发明人: Yuqin YU , Yaohui ZENG , Renyu BIAN , Huaisheng ZHANG
IPC分类号: G06F7/523
CPC分类号: G06F7/523
摘要: Multiplication-accumulation method and apparatus, a processor, and a computer program product are provided. The method includes: when a logical operation unit performs single-precision floating-point number multiplication-accumulation operation, combining two half-precision multiplier-accumulators in each single-precision multiplication-accumulation unit to perform the multiplication-accumulation operation on to-be-processed single-precision floating-point numbers to obtain corresponding single-precision multiplication-accumulation results, a total of N multiplication-accumulation results being obtained; and when the logical operation unit performs half-precision floating-point number multiplication-accumulation operation, performing, by each half-precision multiplier-accumulator, the multiplication-accumulation operation on to-be-processed half-precision floating-point numbers to obtain corresponding half-precision multiplication-accumulation results, a total of 2N multiplication-accumulation results being obtained. Utilization of the multiplier-accumulators is improved.
-
公开(公告)号:US20240176585A1
公开(公告)日:2024-05-30
申请号:US18225467
申请日:2023-07-24
发明人: Yaohui ZENG , Renyu BIAN , Huaisheng ZHANG
摘要: The present application relates to a data processing method, a computer device, and a storage medium. The method includes: acquiring data formats of two pieces of input data; the data formats of the two pieces of input data being the same; determining a target data conversion algorithm matching the data formats from a plurality of preset data conversion algorithms, and performing, by using the target data conversion algorithm, data format conversion on the two pieces of input data to obtain at least two pieces of target input data; processing, by using a multiplier, the at least two pieces of target input data to obtain a preliminary operation result; and determining truncation bit widths corresponding to the two pieces of input data, and processing the preliminary operation result according to the truncation bit widths, to obtain a multiplication operation result corresponding to the two pieces of input data.
-