-
公开(公告)号:US10229356B1
公开(公告)日:2019-03-12
申请号:US14581969
申请日:2014-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Baiyang Liu , Michael Reese Bastian , Bjorn Hoffmeister , Sankaran Panchapagesan , Ariya Rastrow
IPC: G06N3/08
Abstract: Features are disclosed for error tolerant model compression. Such features could be used to reduce the size of a deep neural network model including several hidden node layers. The size reduction in an error tolerant fashion ensures predictive applications relying on the model do not experience performance degradation due to model compression. Such predictive applications include automatic recognition of speech, image recognition, and recommendation engines. Partially quantized models are re-trained such that any degradation of accuracy is “trained out” of the model providing improved error tolerance with compression.