BINARY QUANTIZATION METHOD, NEURAL NETWORK TRAINING METHOD, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20250156697A1

    公开(公告)日:2025-05-15

    申请号:US19019769

    申请日:2025-01-14

    Abstract: This application provides a binary quantization method, a neural network training method, a device, and a storage medium. The binary quantization method includes: determining to-be-quantized data in a neural network; determining a quantization parameter corresponding to the to-be-quantized data, where the quantization parameter includes a scaling factor and an offset; determining, based on the scaling factor and the offset, a binary upper limit and a binary lower limit corresponding to the to-be-quantized data; and performing binary quantization on the to-be-quantized data based on the scaling factor and the offset, to quantize the to-be-quantized data into the binary upper limit or the binary lower limit.

Patent Agency Ranking