LAYER-LEVEL QUANTIZATION IN NEURAL NETWORKS
    41.
    发明申请

    公开(公告)号:US20190171927A1

    公开(公告)日:2019-06-06

    申请号:US15833985

    申请日:2017-12-06

    Applicant: Facebook, Inc.

    Abstract: A method for performing layer-level quantization may include (1) performing an inference of an activation layer of a neural network, (2) storing a first limit value of the activation layer in a data storage system, (3) storing a second limit value of the activation layer in the data storage system, (4) determining a scaling factor based on the first and second limit values, and then (5) applying the scaling factor on a subsequent inference. Various other methods, systems, and devices are also disclosed.

Patent Agency Ranking