INSTRUCTION AND LOGIC TO PROVIDE VECTOR SCATTER-OP AND GATHER-OP FUNCTIONALITY
    1.
    发明申请
    INSTRUCTION AND LOGIC TO PROVIDE VECTOR SCATTER-OP AND GATHER-OP FUNCTIONALITY 有权
    指令和逻辑提供矢量扫描仪和操作界面功能

    公开(公告)号:US20140201498A1

    公开(公告)日:2014-07-17

    申请号:US13977729

    申请日:2011-09-26

    IPC分类号: G06F9/38 G06F9/30

    摘要: Instructions and logic provide vector scatter-op and/or gather-op functionality. In some embodiments, responsive to an instruction specifying: a gather and a second operation, a destination register, an operand register, and a memory address; execution units read values in a mask register, wherein fields in the mask register correspond to offset indices in the indices register for data elements in memory. A first mask value indicates the element has not been gathered from memory and a second value indicates that the element does not need to be, or has already been gathered. For each having the first value, the data element is gathered from memory into the corresponding destination register location, and the corresponding value in the mask register is changed to the second value. When all mask register fields have the second value, the second operation is performed using corresponding data in the destination and operand registers to generate results.

    摘要翻译: 指令和逻辑提供矢量分散操作和/或收集功能。 在一些实施例中,响应于指定集合和第二操作,目的地寄存器,操作数寄存器和存储器地址的指令; 执行单元读取掩码寄存器中的值,其中掩码寄存器中的字段对应于存储器中的数据元素的索引寄存器中的偏移索引。 第一个掩码值表示元素尚未从内存中收集,第二个值表示元素不需要或已经被收集。 对于具有第一值的每一个,数据元素从存储器收集到相应的目的地寄存器位置,并且掩码寄存器中的对应值被改变为第二值。 当所有屏蔽寄存器字段具有第二个值时,使用目的地和操作数寄存器中的相应数据执行第二个操作,以生成结果。

    INSTRUCTION AND LOGIC TO PROVIDE VECTOR LOADS AND STORES WITH STRIDES AND MASKING FUNCTIONALITY
    2.
    发明申请
    INSTRUCTION AND LOGIC TO PROVIDE VECTOR LOADS AND STORES WITH STRIDES AND MASKING FUNCTIONALITY 有权
    指示和逻辑提供矢量负载和存储带有条件和屏蔽功能

    公开(公告)号:US20140195775A1

    公开(公告)日:2014-07-10

    申请号:US13977730

    申请日:2011-09-26

    IPC分类号: G06F9/30

    摘要: Instructions and logic provide vector loads and/or stores with stride and mask functionality. Some embodiments, responsive to an instruction specifying: a set of loads, destination register, mask register, memory address, and stride length; execution units read values in the mask register, wherein fields in the mask register correspond to stride-length multiples from the memory address to data elements in memory. A first mask value indicates the element has not been loaded from memory and a second value indicates that the element does not need to be, or has already been loaded. For each having the first value, the corresponding multiple of said stride length is generated according to the data field's position in the mask register to load the data element from memory into the corresponding destination register location, and the corresponding value in the mask register is changed to the second value. These instructions can restart after faults.

    摘要翻译: 指令和逻辑提供带有步幅和掩码功能的向量加载和/或存储。 一些实施例,响应于指令:一组负载,目的地寄存器,掩码寄存器,存储器地址和步幅长度; 执行单元读取掩码寄存器中的值,其中掩码寄存器中的字段对应于从存储器地址到存储器中的数据元素的跨距长度倍数。 第一个掩码值表示元素尚未从内存中加载,第二个值表示元素不需要或已经被加载。 对于具有第一值的每一个,根据数据字段在掩码寄存器中的位置产生所述步幅长度的对应倍数,以将数据元素从存储器加载到相应的目的地寄存器位置,并且改变掩码寄存器中的对应值 到第二个值。 这些说明可以在故障后重新启动。

    Efficient zero-based decompression
    5.
    发明授权
    Efficient zero-based decompression 有权
    高效的零基减压

    公开(公告)号:US09575757B2

    公开(公告)日:2017-02-21

    申请号:US13991858

    申请日:2011-12-30

    IPC分类号: G06F9/30 H03M7/46

    摘要: A processor core including a hardware decode unit to decode vector instructions for decompressing a run length encoded (RLE) set of source data elements and an execution unit to execute the decoded instructions. The execution unit generates a first mask by comparing set of source data elements with a set of zeros and then counts the trailing zeros in the mask. A second mask is made based on the count of trailing zeros. The execution unit then copies the set of source data elements to a buffer using the second mask and then reads the number of RLE zeros from the set of source data elements. The buffer is shifted and copied to a result and the set of source data elements is shifted to the right. If more valid data elements are in the set of source data elements this is repeated until all valid data is processed.

    摘要翻译: 一种处理器核心,包括硬件解码单元,用于解码用于解压缩源数据元素的游程长度编码(RLE)集合的向量指令和执行单元以执行解码的指令。 执行单元通过将源数据元素的集合与一组零进行比较来生成第一掩码,然后计数掩码中的尾随零。 第二个掩码基于尾随零的计数。 执行单元然后使用第二掩码将源数据元素集合复制到缓冲器,然后从源数据元素集合读取RLE零的数目。 将缓冲区移位并复制到结果,并将源数据元素集合向右移动。 如果源数据元素集合中有更多有效的数据元素,则重复此操作,直到处理所有有效数据。

    VECTOR FREQUENCY COMPRESS INSTRUCTION
    8.
    发明申请
    VECTOR FREQUENCY COMPRESS INSTRUCTION 有权
    矢量频率压缩指令

    公开(公告)号:US20140317377A1

    公开(公告)日:2014-10-23

    申请号:US13993058

    申请日:2011-12-30

    IPC分类号: G06F9/30

    摘要: A processor core that includes a hardware decode unit to decode a vector frequency compress instruction that includes a source operand and a destination operand. The source operand specifying a source vector register that includes a plurality of source data elements including one or more runs of identical data elements that are each to be compressed in a destination vector register as a value and run length pair. The destination operand identifies the destination vector register. The processor core also includes an execution engine unit to execute the decoded vector frequency compress instruction which causes, for each source data element, a value to be copied into the destination vector register to indicate that source data element's value. One or more runs of the source data elements equal are encoded in the destination vector register as the predetermined compression value followed by a run length for that run.

    摘要翻译: 一种处理器核心,其包括用于解码包括源操作数和目的地操作数的向量频率压缩指令的硬件解码单元。 源操作数指定源向量寄存器,其包括多个源数据元素,其包括在目的地向量寄存器中各自被压缩的相同数据元素的一个或多个游程作为值和游程长度对。 目标操作数标识目标向量寄存器。 处理器核心还包括执行引擎单元,用于执行解码的向量频率压缩指令,其对于每个源数据元素,其将被复制到目的地向量寄存器中的值指示源数据元素的值。 源数据元素相等的一个或多个运行在目标向量寄存器中被编码为预定压缩值,后跟该运行的运行长度。

    INSTRUCTION AND LOGIC TO PROVIDE VECTOR HORIZONTAL COMPARE FUNCTIONALITY
    10.
    发明申请
    INSTRUCTION AND LOGIC TO PROVIDE VECTOR HORIZONTAL COMPARE FUNCTIONALITY 有权
    指令和逻辑提供矢量水平比较功能

    公开(公告)号:US20140258683A1

    公开(公告)日:2014-09-11

    申请号:US13977733

    申请日:2011-11-30

    IPC分类号: G06F9/30

    摘要: Instructions and logic provide vector horizontal compare functionality. Some embodiments, responsive to an instruction specifying: a destination operand, a size of the vector elements, a source operand, and a mask corresponding to a portion of the vector element data fields in the source operand; read values from data fields of the specified size in the source operand, corresponding to the mask and compare the values for equality. In some embodiments, responsive to a detection of inequality, a trap may be taken. In some alternative embodiments, a flag may be set. In other alternative embodiments, a mask field may be set to a masked state for the corresponding unequal value(s). In some embodiments, responsive to all unmasked data fields of the source operand being equal to a particular value, that value may be broadcast to all data fields of the specified size in the destination operand.

    摘要翻译: 指令和逻辑提供向量横向比较功能。 一些实施例,响应于指定目的地操作数,向量元素的大小,源操作数和对应于源操作数中的向量元素数据字段的一部分的掩码的指令; 从源操作数中的指定大小的数据字段读取值,对应于掩码,并比较相等的值。 在一些实施例中,响应于不等式的检测,可以采取陷阱。 在一些替代实施例中,可以设置标志。 在其他替代实施例中,可以将掩模字段设置为对应不等值的掩蔽状态。 在一些实施例中,响应于源操作数的所有未屏蔽的数据字段等于特定值,该值可以广播到目的地操作数中指定大小的所有数据字段。