-
公开(公告)号:US20250156697A1
公开(公告)日:2025-05-15
申请号:US19019769
申请日:2025-01-14
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Xinghao CHEN , Zhijun TU , Yunhe WANG
IPC: G06N3/0495 , G06N3/084
Abstract: This application provides a binary quantization method, a neural network training method, a device, and a storage medium. The binary quantization method includes: determining to-be-quantized data in a neural network; determining a quantization parameter corresponding to the to-be-quantized data, where the quantization parameter includes a scaling factor and an offset; determining, based on the scaling factor and the offset, a binary upper limit and a binary lower limit corresponding to the to-be-quantized data; and performing binary quantization on the to-be-quantized data based on the scaling factor and the offset, to quantize the to-be-quantized data into the binary upper limit or the binary lower limit.