Indexing partitions using distributed bloom filters

    公开(公告)号:US11531666B1

    公开(公告)日:2022-12-20

    申请号:US16998922

    申请日:2020-08-20

    Abstract: Methods, systems, and computer-readable media for indexing partitions using distributed Bloom filters are disclosed. A data indexing system generates a plurality of indices for a plurality of partitions in a distributed object store. The indices comprise a plurality of Bloom filters. An individual one of the Bloom filters corresponds to one or more fields of an individual one of the partitions. Using the Bloom filters, the data indexing system determines a first portion of the partitions that possibly comprise a value and a second portion of the partitions that do not comprise the value. Based (at least in part) on a scan of the first portion of the partitions and not the second portion of the partitions, the data indexing system determines one or more partitions of the first portion of the partitions that comprise the value.

    Distributed data validation service

    公开(公告)号:US10248508B1

    公开(公告)日:2019-04-02

    申请号:US14310429

    申请日:2014-06-20

    Abstract: A data validation service may validate data sets maintained for one or more data sources. Several rule sets may describe various rules used to validate one or more data sets. The rule sets may be automatically applied to respective data sets in order to validate the respective data sets according to a dynamically determined schedule for the application of the rule sets. Reporting events may be detected which correspond to a rule set. In response to detecting a reporting event, a responsive action may be performed as described in the rule set, such as providing notification of the reporting event.

Patent Agency Ranking