Method for processing a database query

    公开(公告)号:US09959323B2

    公开(公告)日:2018-05-01

    申请号:US15064662

    申请日:2016-03-09

    CPC classification number: G06F17/30536 G06F17/30424 G06F17/30864

    Abstract: The invention relates to a computer-implemented method for processing a query in a database, the query comprising a search value. The database comprises a plurality of datasets the datasets comprising entries, wherein distance statistics are assigned to the datasets. The distance statistics describe the minimum and maximum distance between the values of the entries of a dataset of the plurality of datasets and a reference value. The method comprises determining the distance between the search value and the reference value, said determination resulting in a search distance, determining a subset of datasets from the plurality of datasets for which the search distance is within the limits given by the minimum and maximum distances described by the respective distance statistics, and searching for the search value in the subset of datasets.

    APPROXIMATE STRING MATCHING OPTIMIZATION FOR A DATABASE

    公开(公告)号:US20170228472A1

    公开(公告)日:2017-08-10

    申请号:US15494874

    申请日:2017-04-24

    CPC classification number: G06F17/30442 G06F17/30536

    Abstract: Software for processing a database query that includes: (i) receiving a query of a database including a search value; (ii) determining a distance between the search value and at least one reference value; (iii) determining a maximum distance from the search value to be used in searching a plurality of datasets of the database, wherein the maximum distance from the search value defines a search range and is based, at least in part, on the determined distance between the search value and the at least one reference value; (iv) determining a subset of datasets from the plurality of datasets that includes datasets for which a data range with respect to each reference value overlaps with the search range; and (v) performing approximate string matching for the search value on the subset of datasets.

    Efficient Calculation and Organization of Approximate Order Statistics of Real Numbers

    公开(公告)号:US20170206185A1

    公开(公告)日:2017-07-20

    申请号:US15476899

    申请日:2017-03-31

    Applicant: Splunk Inc.

    Inventor: Steve Yu Zhang

    Abstract: A method, system, and processor-readable storage medium are directed towards calculating approximate order statistics on a collection of real numbers. In one embodiment, the collection of real numbers is processed to create a digest comprising hierarchy of buckets. Each bucket is assigned a real number N having P digits of precision and ordinality O. The hierarchy is defined by grouping buckets into levels, where each level contains all buckets of a given ordinality. Each individual bucket in the hierarchy defines a range of numbers—all numbers that, after being truncated to that bucket's P digits of precision, are equal to that bucket's N. Each bucket additionally maintains a count of how many numbers have fallen within that bucket's range. Approximate order statistics may then be calculated by traversing the hierarchy and performing an operation on some or all of the ranges and counts associated with each bucket.

Patent Agency Ranking