Row secure table plan generation
    1.
    发明授权

    公开(公告)号:US11868349B2

    公开(公告)日:2024-01-09

    申请号:US16867228

    申请日:2020-05-05

    IPC分类号: G06F16/2453 G06F21/62

    CPC分类号: G06F16/24544 G06F21/6227

    摘要: Methods, computer program products, and systems are presented. The method computer program products, and systems can include, for instance: identifying an invoked database query for execution on a database, the invoked database query being associated to a user; generating an execution plan for executing the database query on the database; wherein the generating the execution plan for execution of the database query on the database includes establishing an ordering of first and second tables, the ordering of the first and second tables being in dependence on an access privilege attribute of the user in respect to the first table; and executing the database query according to the execution plan.

    Method for migrating data records from a source database to a target database

    公开(公告)号:US10884998B2

    公开(公告)日:2021-01-05

    申请号:US16131101

    申请日:2018-09-14

    摘要: The present disclosure relates to a method using a database engine for migrating data records from a source database to a target database, where said data records are arranged in a sorted source table of the source database in accordance with a clustered-base-table order following the Hilbert-Filling-Curve algorithm. The method comprises configuring the database engine for storing the CBT order of each data record of the source table in a reference table to said source table or a in a column of said source table. A reading step may be performed to read said data records from said source table, wherein in said reading step said stored CBT order is used. The read data and the stored CBT order may be transferred to the target database in accordance with the stored CBT order. The transferred data may be written at the target database in accordance with the CBT order.

    Efficient processing of data extents

    公开(公告)号:US10776354B2

    公开(公告)日:2020-09-15

    申请号:US15833244

    申请日:2017-12-06

    摘要: The present disclosure relates to a computer-implemented method, computer program product, and computer system, for optimization of query processing a set of data extents on which a table is stored. Attribute value information may be maintained for each data extent. The attribute value information indicate as ranges the minimum and maximum values of an attribute of the entries stored in the respective extent. A first metric of a first data extent of the set may determine splitting the first data extent into sub-extents increases query processing efficiency. A second metric of a second data extent and a third data extent may determine merging the second data extent and the third data extent increases query processing efficiency.

    Joining two data tables on a join attribute

    公开(公告)号:US10380112B2

    公开(公告)日:2019-08-13

    申请号:US15663896

    申请日:2017-07-31

    IPC分类号: G06F16/00 G06F16/2453

    摘要: The present disclosure relates to a computer-implemented method for joining two data tables on a join attribute. The data tables have at least a first and a second attribute. The second attribute is the join attribute. The method includes providing a function for associating a computing node to a given record. The function may be used to determine the associated computing node. The records of the two data tables may be distributed to the respective determined computing nodes. The relationship between the values of the first and second attributes may be modelled using a predefined dataset. For each record of the two data tables the values of the first attribute may be re-determined using the corresponding values of the second attribute. The function may be used to re-determine the associated computing node.

    Providing multidimensional attribute value information

    公开(公告)号:US10360240B2

    公开(公告)日:2019-07-23

    申请号:US15230509

    申请日:2016-08-08

    IPC分类号: G06F17/30 G06F16/28

    摘要: The invention relates to a method, computer program product and computer system for providing attribute value information for a data extent comprising a set of data entries. For each multidimensional reference point of a set of one or more multidimensional reference points the method comprises: calculating for each multidimensional data entry a reference-point-specific distance between the respective multidimensional data entry and the multidimensional reference point resulting in a set of reference-point-specific distances for the data extent, the respective reference-point-specific distance being calculated using a combination of the attribute values of the multidimensional data entry and a combination of the reference attribute values of the respective multidimensional reference point; determining a minimum reference-point-specific distance and a maximum reference-point-specific distance of the set of reference-point-specific distances; storing for the data extent as attribute value information for further use with query processing the minimum reference-point-specific distance and maximum reference-point-specific distance.

    Storing data records
    7.
    发明授权

    公开(公告)号:US10353885B2

    公开(公告)日:2019-07-16

    申请号:US15223119

    申请日:2016-07-29

    IPC分类号: G06F17/30 G06F16/22

    摘要: Embodiments of the present invention provide a method, computer program product, and a computer system for storing data records in extents. According to one embodiment a data record comprising an attribute value is received. One or more data records stored in a first extent, are identified, wherein the stored one or more data records in the first extent have at least one attribute value. The attribute value of the received data record is compared to the attribute values of the identified data records stored in the first extent. It is then determined whether to store the received data record in the first extent. Responsive to determining, not to store the received data record in the first extent, the received data record is stored in a second extent. If the first received data record is stored in a second extent, determining, an attribute value information of the second extent.

    Selectivity estimation for query execution planning in a database

    公开(公告)号:US10162860B2

    公开(公告)日:2018-12-25

    申请号:US14517964

    申请日:2014-10-20

    IPC分类号: G06F17/30

    摘要: A computer-implemented method of estimating selectivity of a query may include generating, for data stored in a database in a memory, a one-dimensional value distribution for each of a plurality of attributes of the data. A multidimensional histogram may be generated, wherein the multidimensional histogram includes the one-dimensional value distributions for the plurality of attributes of the data. The multidimensional histogram may be converted to a one-dimensional histogram by assigning each bucket of the multidimensional histogram to corresponding buckets of the one-dimensional histogram and ordering the corresponding buckets according to a space-filling curve. One or more bucket ranges of the one-dimensional histogram may be determined by mapping the query conditions on the one-dimensional histogram. The selectivity of the query may be estimated by estimating how many data values in the one or more bucket ranges will meet the query conditions.

    Approximate string matching optimization for a database

    公开(公告)号:US10095808B2

    公开(公告)日:2018-10-09

    申请号:US15494874

    申请日:2017-04-24

    IPC分类号: G06F17/30

    摘要: Software for processing a database query that includes: (i) receiving a query of a database including a search value; (ii) determining a distance between the search value and at least one reference value; (iii) determining a maximum distance from the search value to be used in searching a plurality of datasets of the database, wherein the maximum distance from the search value defines a search range and is based, at least in part, on the determined distance between the search value and the at least one reference value; (iv) determining a subset of datasets from the plurality of datasets that includes datasets for which a data range with respect to each reference value overlaps with the search range; and (v) performing approximate string matching for the search value on the subset of datasets.