METHODS AND APPARATUS TO PERFORM AUTOMATIC COMPILER OPTIMIZATION TO ENABLE STREAMING-STORE GENERATION FOR UNALIGNED CONTIGUOUS WRITE ACCESS

    公开(公告)号:US20220012028A1

    公开(公告)日:2022-01-13

    申请号:US17483459

    申请日:2021-09-23

    Abstract: Methods, apparatus, systems and articles of manufacture (e.g., computer readable storage media) to perform automatic compiler optimization to enable streaming-store generation for unaligned contiguous write access are disclosed. Example apparatus disclosed herein are to mark a store instruction in source program code as a transformation candidate when the store instruction is associated with a group of memory accesses that are unaligned with respect to a size of a cache line in a cache. Disclosed apparatus are also to transform the store instruction that is marked as the transformation candidate to form transformed program code when a non-temporal property is satisfied, the transformed program code to replace the store instruction with (i) a write to a buffer in the cache and (ii) a streaming-store instruction that is to write contents of the buffer to memory.

    Methods and apparatus to perform automatic compiler optimization to enable streaming-store generation for unaligned contiguous write access

    公开(公告)号:US12032934B2

    公开(公告)日:2024-07-09

    申请号:US17483459

    申请日:2021-09-23

    CPC classification number: G06F8/4434

    Abstract: Methods, apparatus, systems and articles of manufacture (e.g., computer readable storage media) to perform automatic compiler optimization to enable streaming-store generation for unaligned contiguous write access are disclosed. Example apparatus disclosed herein are to mark a store instruction in source program code as a transformation candidate when the store instruction is associated with a group of memory accesses that are unaligned with respect to a size of a cache line in a cache. Disclosed apparatus are also to transform the store instruction that is marked as the transformation candidate to form transformed program code when a non-temporal property is satisfied, the transformed program code to replace the store instruction with (i) a write to a buffer in the cache and (ii) a streaming-store instruction that is to write contents of the buffer to memory.

Patent Agency Ranking