Method of rotating data in a plurality of processing elements
    1.
    发明授权
    Method of rotating data in a plurality of processing elements 有权
    在多个处理元件中旋转数据的方法

    公开(公告)号:US08135940B2

    公开(公告)日:2012-03-13

    申请号:US13048082

    申请日:2011-03-15

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F15/76

    CPC分类号: G06F5/015

    摘要: A method of rotating data in a plurality of processing elements comprises a plurality of shifting operations and a plurality of storing operations, with the shifting and storing operations coordinated to enable a three shears operation to be performed on the data. The plurality of storing operations is responsive to the processing element's positions.

    摘要翻译: 一种在多个处理元件中旋转数据的方法包括多个移位操作和多个存储操作,其中移动和存储操作被协调以使得能够对数据执行三个剪切操作。 多个存储操作响应于处理元件的位置。

    Method of obtaining interleave interval for two data values
    2.
    发明授权
    Method of obtaining interleave interval for two data values 有权
    获取两个数据值的交织间隔的方法

    公开(公告)号:US07503046B2

    公开(公告)日:2009-03-10

    申请号:US10689280

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F9/46 G06F15/00

    CPC分类号: G06F9/5066

    摘要: A method of determining an interleave pattern for n lots of A and y lots of B, when n plus y equals a power of two such that the expression 2z−n may be used to represent the value of y, includes generating a key including the reverse bit order of a serially indexed count from 0 to 2z. An interleave pattern can be generated from the key in which all values less than n are replace by A and all other values are replaced by B. The key can be used to generate a table that contains all possible combinations of values of A and B. The table can then be stored such that an interleave pattern can be automatically selected based on either the number of lots of A or the number of lots of B.

    摘要翻译: 一种确定A和Y批次B的n个批次的交织模式的方法,当n加y等于2的幂时,使得表达式2z-n可以用于表示y的值,包括生成包括 串行索引计数的反向位顺序从0到2z。 可以从其中小于n的所有值替换为A并且所有其他值被B替换的密钥生成交织模式。该密钥可用于生成包含A和B值的所有可能组合的表。 然后可以存储表,使得可以基于A的批数或B的批次数来自动选择交织模式。

    Method for load balancing a line of parallel processing elements
    3.
    发明授权
    Method for load balancing a line of parallel processing elements 有权
    负责平衡一行并行处理单元的方法

    公开(公告)号:US07430742B2

    公开(公告)日:2008-09-30

    申请号:US10689345

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5088

    摘要: A method for balancing the load of a parallel processing system having parallel processing elements (PEs) linked serially in a line with first and second ends, wherein each of the PEs has a local number of tasks associated therewith, the method comprising determining a total number of tasks present on the line; notifying each of the PEs of the total number of tasks, calculating a local mean number of tasks for each of the PEs, and calculating a local deviation for each of the PEs. The method also comprises determining a first local cumulative deviation for each of the PEs, determining a second local cumulative deviation for each of the PEs, and redistributing tasks among the PEs in response to the first local cumulative deviation and the second local cumulative deviation.

    摘要翻译: 一种用于平衡并行处理系统的负载的方法,所述并行处理系统具有与第一和第二端串联连接的并行处理元件(PE),其中每个PE具有与其相关联的本地数量的任务,所述方法包括确定总数 的任务出现在线上 通知每个PE的总任务数量,计算每个PE的本地平均任务数,以及计算每个PE的局部偏差。 该方法还包括确定每个PE的第一本地累积偏差,确定每个PE的第二本地累积偏差,以及响应于第一局部累积偏差和第二局部累积偏差在PE之间重新分配任务。

    Method for manipulating data in a group of processing elements to transpose the data using a memory stack
    4.
    发明授权
    Method for manipulating data in a group of processing elements to transpose the data using a memory stack 有权
    用于处理一组处理元件中的数据以使用存储器栈来转置数据的方法

    公开(公告)号:US07263543B2

    公开(公告)日:2007-08-28

    申请号:US10689300

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F7/78

    摘要: A method for transposing data in a plurality of processing elements is comprised of a plurality of shifting operations and a plurality of storing operations. The shifting and storing operations are coordinated to enable data to be stored along a diagonal of processing elements from a first direction or first pair of directions and to be output from the diagonal in a second direction or a second pair of directions perpendicular to the first pair of directions, respectively. The plurality of storing operations are responsive to the processing elements' positions. The first and second pairs of directions are selected from among the dimensions of the array, e.g., the +x/−x, +z/−z and +y/−y pairs of directions.

    摘要翻译: 用于在多个处理元件中转置数据的方法包括多个移位操作和多个存储操作。 协调移位和存储操作,使数据沿着第一方向或第一对方向的处理元件的对角线存储,并且在垂直于第一对的第二方向或第二对方向上从对角线输出 的方向。 多个存储操作响应于处理元件的位置。 从阵列的尺寸中选择第一和第二对方向,例如+ x / -x,+ z / -z和+ y / -y对方向。

    System of rotating data in a plurality of processing elements
    5.
    发明授权
    System of rotating data in a plurality of processing elements 有权
    在多个处理元件中旋转数据的系统

    公开(公告)号:US08856493B2

    公开(公告)日:2014-10-07

    申请号:US13372683

    申请日:2012-02-14

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F15/76 G06F5/01

    CPC分类号: G06F5/015

    摘要: A method of rotating data in a plurality of processing elements comprises a plurality of shifting operations and a plurality of storing operations, with the shifting and storing operations coordinated to enable a three shears operation to be performed on the data. The plurality of storing operations is responsive to the processing element's positions.

    摘要翻译: 一种在多个处理元件中旋转数据的方法包括多个移位操作和多个存储操作,其中移动和存储操作被协调以使得能够对数据执行三个剪切操作。 多个存储操作响应于处理元件的位置。

    System Of Rotating Data In A Plurality Of Processing Elements
    6.
    发明申请
    System Of Rotating Data In A Plurality Of Processing Elements 有权
    多种处理元件中旋转数据的系统

    公开(公告)号:US20120144155A1

    公开(公告)日:2012-06-07

    申请号:US13372683

    申请日:2012-02-14

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    CPC分类号: G06F5/015

    摘要: A method of rotating data in a plurality of processing elements comprises a plurality of shifting operations and a plurality of storing operations, with the shifting and storing operations coordinated to enable a three shears operation to be performed on the data. The plurality of storing operations is responsive to the processing element's positions.

    摘要翻译: 一种在多个处理元件中旋转数据的方法包括多个移位操作和多个存储操作,其中移动和存储操作被协调以使得能够对数据执行三个剪切操作。 多个存储操作响应于处理元件的位置。

    Method for manipulating data in a group of processing elements to perform a reflection of the data
    7.
    发明授权
    Method for manipulating data in a group of processing elements to perform a reflection of the data 有权
    用于处理一组处理元件中的数据以执行数据反映的方法

    公开(公告)号:US07930518B2

    公开(公告)日:2011-04-19

    申请号:US12695474

    申请日:2010-01-28

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F15/00

    摘要: A method for generating a reflection of data in a plurality of processing elements comprises shifting the data along, for example, each row in the array until each processing element in the row has received all the data held by every other processing element in that row. Each processing element stores and outputs final data as a function of its position in the row. A similar reflection along a horizontal line can be achieved by shifting data along columns instead of rows. Also disclosed is a method for reflecting data in a matrix of processing elements about a vertical line comprising shifting data between processing elements arranged in rows. An initial count is set in each processing element according to the expression (2×Col_Index) MOD (array size). In one embodiment, a counter counts down from the initial count in each processing element as a function of the number of shifts that have peen performed. Output is selected as a function of the current count. A similar reflection about a horizontal line can be achieved by shifting data between processing elements arranged in columns and setting the initial count according to the expression (2×Row_Index) MOD (array size). The present invention represents an efficient method for obtaining the reflection of data.

    摘要翻译: 用于产生多个处理元件中的数据的反射的方法包括沿着例如阵列中的每一行移动数据,直到该行中的每个处理元件已经接收到该行中每个其他处理元件保持的所有数据。 每个处理元件根据其在行中的位置存储和输出最终数据。 沿着水平线的类似反射可以通过沿着列而不是行移动数据来实现。 还公开了一种用于在围绕垂直线的处理元件的矩阵中反映数据的方法,包括在以行排列的处理元件之间移动数据。 根据表达式(2×Col_Index)MOD(数组大小)在每个处理元素中设置初始计数。 在一个实施例中,计数器从每个处理元件中的初始计数向下计数,作为已经执行的移位次数的函数。 作为当前计数的函数选择输出。 可以通过在列中排列的处理元件之间移动数据并根据表达式(2×Row_Index)MOD(数组大小)设置初始计数来实现与水平行相似的反射。 本发明代表用于获得数据反射的有效方法。

    Method of rotating data in a plurality of processing elements
    8.
    发明授权
    Method of rotating data in a plurality of processing elements 有权
    在多个处理元件中旋转数据的方法

    公开(公告)号:US07913062B2

    公开(公告)日:2011-03-22

    申请号:US10689390

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F15/76

    CPC分类号: G06F5/015

    摘要: A method of rotating data in a plurality of processing elements comprises a plurality of shifting operations and a plurality of storing operations, with the shifting and storing operations coordinated to enable a three shears operation to be performed on the data. The plurality of storing operations is responsive to the processing element's positions.

    摘要翻译: 一种在多个处理元件中旋转数据的方法包括多个移位操作和多个存储操作,其中移动和存储操作被协调以使得能够对数据执行三个剪切操作。 多个存储操作响应于处理元件的位置。

    Method for finding local extrema of a set of values for a parallel processing element
    9.
    发明授权
    Method for finding local extrema of a set of values for a parallel processing element 有权
    用于找到并行处理元件的一组值的局部极值的方法

    公开(公告)号:US07454451B2

    公开(公告)日:2008-11-18

    申请号:US10689335

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F7/00

    摘要: A method for finding a local extrema for a single processing element having a set of values associated therewith includes separating the set of values into an odd set of values and an even set of values, determining a first extrema from the odd set of values, determining a second extrema from the even set of values, and determining the local extrema from the first extrema and the second extrema. The first extrema is found by comparing each odd-numbered value in the set to each other odd-numbered value in the set and the second extrema is found by comparing each even-numbered value in the set to each other even-numbered value in the set.

    摘要翻译: 一种用于找到具有与其相关联的一组值的单个处理元件的局部极值的方法,包括将所述值集合分成奇数组值和偶数值集合,从奇数组值确定第一极值,确定 从偶数值的第二个极值,并确定从第一极值和第二极值的局部极值。 通过将集合中的每个奇数值与集合中的奇数编号的值进行比较,找到第一个极值,通过将集合中的每个偶数值与其中的偶数值进行比较,找到第二个极值偶数值 组。

    Method for using filtering to load balance a loop of parallel processing elements
    10.
    发明授权
    Method for using filtering to load balance a loop of parallel processing elements 有权
    使用滤波来平衡一个并行处理单元的循环的方法

    公开(公告)号:US07448038B2

    公开(公告)日:2008-11-04

    申请号:US10689355

    申请日:2003-10-20

    申请人: Mark Beaumont

    发明人: Mark Beaumont

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5088

    摘要: One aspect of the present invention relates to a method for balancing the load of a parallel processing system having a plurality of parallel processing elements arranged in a loop, wherein each processing element has a local number of tasks associated therewith. The method comprises determining within each processing element a total number of tasks present within the loop, calculating a local mean number of tasks within each processing element, assigning a weight to each of said plurality of processing elements, and calculating a local weighted deviation within each processing element. The method also comprises determining the sum weighted deviations within each processing element for one-half the loop in an anti-clockwise direction and in a clockwise direction, determining clockwise and anti-clockwise transfer parameters within each processing element, and redistributing tasks among the processing elements in response to the clockwise and anti-clockwise transfer parameters.

    摘要翻译: 本发明的一个方面涉及一种用于平衡具有以循环布置的多个并行处理元件的并行处理系统的负载的方法,其中每个处理元件具有与其相关联的本地数量的任务。 该方法包括在每个处理元件内确定存在于该循环内的任务的总数,计算每个处理元件内的任务的局部平均数,将权重分配给所述多个处理元件中的每一个,以及计算每个 处理元件。 该方法还包括在逆时针方向和顺时针方向上确定每个处理元件内的半个循环的加和偏差,确定每个处理元件内的顺时针和逆时针传递参数,以及在处理之间重新分配任务 响应顺时针和逆时针传递参数的元件。