Adjustable precision for multi-stage compute processes

    公开(公告)号:US11385863B2

    公开(公告)日:2022-07-12

    申请号:US16052218

    申请日:2018-08-01

    IPC分类号: G06F7/483 G06N3/08 G06N3/063

    摘要: Disclosed techniques provide for dynamically changing precision of a multi-stage compute process. For example, changing neural network (NN) parameters on a per-layer basis depending on properties of incoming data streams and per-layer performance of an NN among other considerations. NNs include multiple layers that may each be calculated with a different degree of accuracy and therefore, compute resource overhead (e.g., memory, processor resources, etc.). NNs are usually trained with 32-bit or 16-bit floating-point numbers. Once trained, an NN may be deployed in production. One approach to reduce compute overhead is to reduce parameter precision of NNs to 16 or 8 for deployment. The conversion to an acceptable lower precision is usually determined manually before deployment and precision levels are fixed while deployed. Disclosed techniques and implementations address automatic rather than manual determination or precision levels for different stages and dynamically adjusting precision for each stage at run-time.

    ALLOCATING COHERENT AND NON-COHERENT MEMORIES

    公开(公告)号:US20180349051A1

    公开(公告)日:2018-12-06

    申请号:US15776473

    申请日:2016-02-05

    IPC分类号: G06F3/06 G06F12/0815

    CPC分类号: G06F13/1663 Y02D10/14

    摘要: A computing device includes a coherence controller and memory comprising a coherent memory region and a non-coherent memory region. The coherence controller may: determine a coherent region of the memory, determine a non-coherent region of the memory, and responsive to receiving a memory allocation request for a block of memory in the memory: allocate, based on a received memory allocation request for a memory block, the requested block of memory in the non-coherent memory region or the coherent memory region based on whether the memory allocation request indicates the requested block is to be coherent or non-coherent.