-
公开(公告)号:US20210012203A1
公开(公告)日:2021-01-14
申请号:US16508277
申请日:2019-07-10
Applicant: Advanced Micro Devices, Inc.
Inventor: Abhinav Vishnu , Prakash Sathyanath Raghavendra , Tamer M. Elsharnouby , Rachida Kebichi , Walid Ali , Jonathan Charles Gallmeier
IPC: G06N3/08
Abstract: Systems, methods, and devices for increasing inference speed of a trained convolutional neural network (CNN). A first computation speed of first filters having a first filter size in a layer of the CNN is determined, and a second computation speed of second filters having a second filter size in the layer of the CNN is determined. The size of at least one of the first filters is changed to the second filter size if the second computation speed is faster than the first computation speed. In some implementations the CNN is retrained, after changing the size of at least one of the first filters to the second filter size, to generate a retrained CNN. The size of a fewer number of the first filters is changed to the second filter size if a key performance indicator loss of the retrained CNN exceeds a threshold.