发明授权
- 专利标题: Systems and methods for compression and distribution of machine learning models
-
申请号: US16624497申请日: 2017-07-06
-
公开(公告)号: US11531932B2公开(公告)日: 2022-12-20
- 发明人: Jyrki Alakuijala , Robert Obryk
- 申请人: Google LLC
- 申请人地址: US CA Mountain View
- 专利权人: Google LLC
- 当前专利权人: Google LLC
- 当前专利权人地址: US CA Mountain View
- 代理机构: Dority & Manning, P.A.
- 国际申请: PCT/US2017/040798 WO 20170706
- 国际公布: WO2019/009897 WO 20190110
- 主分类号: G06N20/00
- IPC分类号: G06N20/00 ; G06N3/04 ; G06N3/08
摘要:
The present disclosure provides systems and methods for compressing and/or distributing machine learning models. In one example, a computer-implemented method is provided to compress machine-learned models, which includes obtaining, by one or more computing devices, a machine-learned model. The method includes selecting, by the one or more computing devices, a weight to be quantized and quantizing, by the one or more computing devices, the weight. The method includes propagating, by the one or more computing devices, at least a part of a quantization error to one or more non-quantized weights and quantizing, by the one or more computing devices, one or more of the non-quantized weights. The method includes providing, by the one or more computing devices, a quantized machine-learned model.
公开/授权文献
信息查询