DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING SCAN OPERATIONS
    1.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING SCAN OPERATIONS 有权
    数据处理设备和执行扫描操作的方法

    公开(公告)号:US20150212972A1

    公开(公告)日:2015-07-30

    申请号:US14165967

    申请日:2014-01-28

    Applicant: ARM LIMITED

    Abstract: A data processing apparatus and method are provided for executing a vector scan instruction. The data processing apparatus comprises a vector register store configured to store vector operands, and processing circuitry configured to perform operations on vector operands retrieved from said vector register store. Further, control circuitry is configured to control the processing circuitry to perform the operations required by one or more instructions, said one or more instructions including a vector scan instruction specifying a vector operand comprising N vector elements and defining a scan operation to be performed on a sequence of vector elements within the vector operand. The control circuitry is responsive to the vector scan instruction to partition the N vector elements of the specified vector operand into P groups of adjacent vector elements, where P is between 2 and N/2, and to control the processing circuitry to perform a partitioned scan operation yielding the same result as the defined scan operation. The processing circuitry is configured to perform the partitioned scan operation by performing separate scan operations on those vector elements of the sequence contained within each group to produce intermediate results for each group, and to perform a computation operation to combine the intermediate results into a final result vector operand containing a sequence of result vector elements. The partitioned scan operation approach of the present invention enables a balance to be achieved between energy consumption and performance.

    Abstract translation: 提供了一种用于执行向量扫描指令的数据处理装置和方法。 数据处理装置包括被配置为存储向量操作数的向量寄存器存储器,以及被配置为对从所述向量寄存器存储器检索的向量操作数执行操作的处理电路。 此外,控制电路被配置为控制处理电路执行一个或多个指令所需的操作,所述一个或多个指令包括指定包括N个向量元素的向量操作数的向量扫描指令,并且定义要在 向量操作数中向量元素的序列。 控制电路响应于矢量扫描指令将指定矢量操作数的N个向量元素划分为相邻矢量元素的P组,其中P在2和N / 2之间,并且控制处理电路执行分区扫描 操作产生与定义的扫描操作相同的结果。 处理电路被配置为通过对包含在每个组中的序列的那些矢量元素执行单独的扫描操作来执行分割扫描操作,以产生每个组的中间结果,并且执行计算操作以将中间结果组合成最终结果 向量操作数包含一系列结果向量元素。 本发明的划分扫描操作方法能够在能量消耗和性能之间实现平衡。

    DATA PROCESSING APPARATUS AND METHOD FOR PROCESSING VECTOR OPERANDS
    2.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD FOR PROCESSING VECTOR OPERANDS 审中-公开
    数据处理装置和处理矢量运算的方法

    公开(公告)号:US20150254077A1

    公开(公告)日:2015-09-10

    申请号:US14601598

    申请日:2015-01-21

    Applicant: ARM Limited

    Abstract: A data processing apparatus has processing circuitry for processing vector operands from a vector register store in response to vector micro-operations, some of which have control information identifying which data elements of the vector operands are selected for processing. Control circuitry detects vector micro-operations for which the control information specifies that a portion of the vector operand to be processed has no selected elements. If this is the case, then the control circuitry controls the processing circuitry to process a lower latency replacement micro-operation instead of the original micro-operation. This provides better performance than if a branch instruction is used to bypass the micro-operation if there are no selected elements.

    Abstract translation: 数据处理装置具有处理电路,用于响应向量微操作来处理来自向量寄存器存储器的向量操作数,其中一些具有标识向量操作数的哪些数据元素被选择用于处理的控制信息。 控制电路检测矢量微操作,其中控制信息指定要处理的矢量操作数的一部分没有选定的元件。 如果是这种情况,则控制电路控制处理电路来处理较低等待时间的替换微操作,而不是原始的微操作。 这提供了比使用分支指令绕过微操作(如果没有选定的元素)更好的性能。

    DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING VECTOR SCAN OPERATION
    3.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING VECTOR SCAN OPERATION 有权
    数据处理装置和执行矢量扫描运算的方法

    公开(公告)号:US20150254076A1

    公开(公告)日:2015-09-10

    申请号:US14601431

    申请日:2015-01-21

    Applicant: ARM LIMITED

    CPC classification number: G06F9/30036 G06F9/3001 G06F9/30098

    Abstract: A vector scan operation is performed to generate M data elements of a result vector, where each result data element corresponds to a combination of an additional data element with at least some of the data elements of a source vector operand V. The vector scan operation is performed using a plurality of steps, each step comprising one or more combination operations for combining data elements. At least one of the steps includes two or more combination operations performed in parallel. At least two of the steps comprise a combination operation for combining a data element with the additional data element S. This approach enables the vector scan operation to be performed in fewer steps in the case where fewer than M data elements are active, so that the vector scan operation can be performed more quickly.

    Abstract translation: 执行矢量扫描操作以产生结果矢量的M个数据元素,其中每个结果数据元素对应于附加数据元素与源向量操作数V的至少一些数据元素的组合。矢量扫描操作是 使用多个步骤执行,每个步骤包括用于组合数据元素的一个或多个组合操作。 至少一个步骤包括并行执行的两个或多个组合操作。 至少两个步骤包括用于将数据元素与附加数据元素S组合的组合操作。在少于M个数据元素有效的情况下,该方法使得能够以更少的步骤执行向量扫描操作,使得 可以更快地执行向量扫描操作。

    DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING SEGMENTED OPERATIONS
    4.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING SEGMENTED OPERATIONS 有权
    数据处理装置和执行分离操作的方法

    公开(公告)号:US20150227367A1

    公开(公告)日:2015-08-13

    申请号:US14175268

    申请日:2014-02-07

    Applicant: ARM LIMITED

    Abstract: A data processing apparatus and method are provided for performing segmented operations. The data processing apparatus comprises a vector register store for storing vector operands, and vector processing circuitry providing N lanes of parallel processing, and arranged to perform a segmented operation on up to N data elements provided by a specified vector operand, each data element being allocated to one of the N lanes. The up to N data elements forms a plurality of segments, and performance of the segmented operation comprises performing a separate operation on the data elements of each segment, the separate operation involving interaction between the lanes containing the data elements of the associated segment. Predicate generation circuitry is responsive to a compute descriptor instruction specifying an input vector operand comprising a plurality of segment descriptors, to generate per lane predicate information used by the vector processing circuitry when performing the segmented operation to maintain a boundary between each of the plurality of segments. As a result, interaction between lanes containing data elements from different segments is prevented. This allows very effective utilisation of the lanes of parallel processing within the vector processing circuitry to be achieved.

    Abstract translation: 提供了一种用于执行分段操作的数据处理装置和方法。 数据处理装置包括用于存储向量操作数的向量寄存器存储器和提供N个并行处理通道的向量处理电路,并且被布置为对由指定向量操作数提供的多达N个数据元素执行分段操作,每个数据元素被分配 到N条车道之一。 最多N个数据元素形成多个段,并且分段操作的执行包括对每个段的数据元素执行单独的操作,该单独操作涉及包含相关段的数据元素的通道之间的交互。 谓词生成电路响应于指定包括多个段描述符的输入向量操作数的计算描述符指令,以在执行分割操作时生成由向量处理电路使用的每通道谓词信息,以维持多个段中的每个段之间的边界 。 结果,阻止了包含来自不同段的数据元素的通道之间的相互作用。 这允许在矢量处理电路内非常有效地利用并行处理的通道。

Patent Agency Ranking