Approximate order statistics of real numbers in generic data

    公开(公告)号:US09645975B2

    公开(公告)日:2017-05-09

    申请号:US14255981

    申请日:2014-04-18

    Applicant: Splunk Inc.

    Inventor: Steve Yu Zhang

    Abstract: A method, system, and processor-readable storage medium are directed towards calculating approximate order statistics on a collection of real numbers. In one embodiment, the collection of real numbers is processed to create a digest comprising hierarchy of buckets. Each bucket is assigned a real number N having P digits of precision and ordinality O. The hierarchy is defined by grouping buckets into levels, where each level contains all buckets of a given ordinality. Each individual bucket in the hierarchy defines a range of numbers—all numbers that, after being truncated to that bucket's P digits of precision, are equal to that bucket's N. Each bucket additionally maintains a count of how many numbers have fallen within that bucket's range. Approximate order statistics may then be calculated by traversing the hierarchy and performing an operation on some or all of the ranges and counts associated with each bucket.

    Analog Sorter
    45.
    发明申请
    Analog Sorter 审中-公开

    公开(公告)号:US20170116262A1

    公开(公告)日:2017-04-27

    申请号:US14924005

    申请日:2015-10-27

    CPC classification number: G06F7/22

    Abstract: A list of digital elements to be sorted are converted to a group of analog signals. The group of analog signals are simultaneously compared to each other to determine the largest analog signal in the group. The largest analog signal is then compared to each of the analog signals in the group to determine which one or more of the analog signals in the group matches the largest analog signal. The matching one or more of the analog signals is removed from the group and the process is repeated until the group of analog signals have been sorted.

    INSTRUCTION AND LOGIC TO PROVIDE VECTOR HORIZONTAL MAJORITY VOTING FUNCTIONALITY
    46.
    发明申请
    INSTRUCTION AND LOGIC TO PROVIDE VECTOR HORIZONTAL MAJORITY VOTING FUNCTIONALITY 审中-公开
    指令和逻辑提供向量水平主要投票功能

    公开(公告)号:US20170003962A1

    公开(公告)日:2017-01-05

    申请号:US15267668

    申请日:2016-09-16

    Abstract: Instructions and logic provide vector horizontal majority voting functionality. Some embodiments, responsive to an instruction specifying: a destination operand, a size of the vector elements, a source operand, and a mask corresponding to a portion of the vector element data fields in the source operand; read a number of values from data fields of the specified size in the source operand, corresponding to the mask specified by the instruction and store a result value to that number of corresponding data fields in the destination operand, the result value computed from the majority of values read from the number of data fields of the source operand.

    Abstract translation: 指令和逻辑提供向量横向多数投票功能。 一些实施例,响应于指定目的地操作数,向量元素的大小,源操作数和对应于源操作数中的向量元素数据字段的一部分的掩码的指令; 从源操作数中的指定大小的数据字段读取一些数值,对应于指令指定的掩码,并将结果值存储到目标操作数中的相应数据字段数,从大多数 从源操作数的数据字段数读取的值。

    EFFICIENT SORTING OF LARGE DATA SET WITH DUPLICATE VALUES

    公开(公告)号:US20160378832A1

    公开(公告)日:2016-12-29

    申请号:US15191893

    申请日:2016-06-24

    Inventor: Yuke ZHUGE

    Abstract: Techniques are disclosed for sorting an input data set. A sort tool determines a distribution of values of a data set that includes a plurality of data records. The sort tool partitions the data set into a plurality of subsets based on the distribution. Each of the data records is inserted into one of the subsets based on a corresponding sort value of the data record. The sort tool identifies one or more of the subsets that contain at least two distinct sort values. In each of the identified subsets, the data records are sorted by a corresponding sort value of the data record.

    EFFICIENT SORTING OF LARGE DATA SET WITH DUPLICATE VALUES
    48.
    发明申请
    EFFICIENT SORTING OF LARGE DATA SET WITH DUPLICATE VALUES 有权
    大量数据的高效分配与重复值

    公开(公告)号:US20160378801A1

    公开(公告)日:2016-12-29

    申请号:US14750385

    申请日:2015-06-25

    Inventor: Yuke ZHUGE

    Abstract: Techniques are disclosed for sorting an input data set. A sort tool determines a distribution of values of a data set that includes a plurality of data records. The sort tool partitions the data set into a plurality of subsets based on the distribution. Each of the data records is inserted into one of the subsets based on a corresponding sort value of the data record. The sort tool identifies one or more of the subsets that contain at least two distinct sort values. In each of the identified subsets, the data records are sorted by a corresponding sort value of the data record.

    Abstract translation: 公开了用于排序输入数据集的技术。 排序工具确定包括多个数据记录的数据集的值的分布。 排序工具基于分布将数据集分成多个子集。 基于数据记录的相应分类值,将每个数据记录插入到一​​个子集中。 排序工具识别一个或多个包含至少两个不同排序值的子集。 在每个识别的子集中,数据记录按照数据记录的相应排序值排序。

    Fast approach to finding minimum and maximum values in a large data set using SIMD instruction set architecture
    49.
    发明授权
    Fast approach to finding minimum and maximum values in a large data set using SIMD instruction set architecture 有权
    使用SIMD指令集架构在大型数据集中找到最小值和最大值的快速方法

    公开(公告)号:US09152663B2

    公开(公告)日:2015-10-06

    申请号:US13853589

    申请日:2013-03-29

    CPC classification number: G06F17/30336 G06F7/22

    Abstract: Systems and methods may determine a boundary value data unit in a large data set in parallel with determining an associated index of the determined boundary value data unit into the large data set using a single instruction multiple data (SIMD) instruction set architecture and a specialized data layout of array entries. In one example, the specialized data layout of array entries combines a data value and its associated index to an array into a single array entry.

    Abstract translation: 系统和方法可以使用单个指令多数据(SIMD)指令集架构和专用数据来确定大数据集中的边界值数据单元并行确定所确定的边界值数据单元到大数据集中的相关索引 阵列条目的布局。 在一个示例中,阵列条目的专门的数据布局将数据值及其相关联的索引组合到一个阵列条目中。

    TUNABLE HARDWARE SORT ENGINE FOR PERFORMING COMPOSITE SORTING ALGORITHMS
    50.
    发明申请
    TUNABLE HARDWARE SORT ENGINE FOR PERFORMING COMPOSITE SORTING ALGORITHMS 有权
    用于执行复合分选算法的TUNABLE HARDWARE SORT发动机

    公开(公告)号:US20150046476A1

    公开(公告)日:2015-02-12

    申请号:US14018650

    申请日:2013-09-05

    Abstract: Embodiments include methods, systems and computer program products for performing a composite sort on a tunable hardware sort engine includes determining desired sort performance parameters, configuring a composite sort engine based on the desired sort performance parameters, and receiving a plurality of keys having a payload associated with each of the plurality of keys. The method also includes reserving DRAM storage for each of the payloads, generating a tag for each of the plurality of keys, the tag identifying the DRAM storage reserved for each of the payloads, and storing the payloads in the portions of the DRAM storage. The method further includes generating a composite key for each of the plurality of keys, sorting the composite keys by the composite sort engine, and retrieving the payloads associated with the sorted composite keys from the DRAM storage. The method also includes outputting the payloads associated the sorted composite keys.

    Abstract translation: 实施例包括用于在可调谐硬件排序引擎上执行复合排序的方法,系统和计算机程序产品,包括确定期望的排序性能参数,基于所需排序性能参数配置复合排序引擎,以及接收具有有效载荷的多个键 与多个键中的每一个。 该方法还包括为每个有效负载保留DRAM存储,为多个密钥中的每一个生成标签,该标签标识为每个有效载荷保留的DRAM存储器,以及将有效载荷存储在DRAM存储器的部分中。 所述方法还包括为所述多个密钥中的每一个生成复合密钥,由所述复合排序引擎对所述复合密钥进行排序,以及从所述DRAM存储器检索与所述排序的复合密钥相关联的有效载荷。 该方法还包括输出与排序的复合密钥相关联的有效载荷。

Patent Agency Ranking