SYSTEM, METHOD AND APPARATUS FOR CONDITIONALLY OFFLOADING INSTRUCTION EXECUTION

    公开(公告)号:US20240354107A1

    公开(公告)日:2024-10-24

    申请号:US18754447

    申请日:2024-06-26

    申请人: Intel Corporation

    IPC分类号: G06F9/30 G06F9/32 G06F9/38

    摘要: In one example, a processor includes: at least one core to execute instructions; and at least one cache memory coupled to the at least one core, the at least one cache memory to store data, at least some of the data a copy of data stored in a memory. The at least one core is to determine whether to conditionally offload a sequence of instructions for execution on a compute circuit associated with the memory, based at least in part on whether one or more first data is present in the at least one cache memory, the one or more first data for use during execution of the sequence of instructions. Other embodiments are described and claimed.

    Processor instructions for data compression and decompression

    公开(公告)号:US12106104B2

    公开(公告)日:2024-10-01

    申请号:US17133328

    申请日:2020-12-23

    申请人: Intel Corporation

    IPC分类号: G06F9/30 G06F12/0862 H03M7/30

    摘要: A processor that includes compression instructions to compress multiple adjacent data blocks of uncompressed read-only data stored in memory into one compressed read-only data block and store the compressed read-only data block in multiple adjacent blocks in the memory is provided. During execution of an application to operate on the read-only data, one of the multiple adjacent blocks storing the compressed read-only block is read from memory, stored in a prefetch buffer and decompressed in the memory controller. In response to a subsequent request during execution of the application for an adjacent data block in the compressed read-only data block, the uncompressed adjacent block is read directly from the prefetch buffer.

    Coalescing adjacent gather/scatter operations

    公开(公告)号:US11003455B2

    公开(公告)日:2021-05-11

    申请号:US16398183

    申请日:2019-04-29

    申请人: Intel Corporation

    摘要: According to one embodiment, a processor includes an instruction decoder to decode a first instruction to gather data elements from memory, the first instruction having a first operand specifying a first storage location and a second operand specifying a first memory address storing a plurality of data elements. The processor further includes an execution unit coupled to the instruction decoder, in response to the first instruction, to read contiguous a first and a second of the data elements from a memory location based on the first memory address indicated by the second operand, and to store the first data element in a first entry of the first storage location and a second data element in a second entry of a second storage location corresponding to the first entry of the first storage location.