Joining two data tables on a join attribute

    公开(公告)号:US11163769B2

    公开(公告)日:2021-11-02

    申请号:US16443958

    申请日:2019-06-18

    Abstract: A computer-implemented method for joining two data tables on a join attribute, where the data tables have at least a first and a second attribute and the second attribute is the join attribute. The method provides a function for associating a computing node to a given record. The function may be used to determine the associated computing node. The records of the two data tables may be distributed to the respective determined computing nodes. The relationship between the values of the first and second attributes may be modelled using a predefined dataset. For each record of the two data tables the values of the first attribute may be re-determined using the corresponding values of the second attribute. The function may be used to re-determine the associated computing node.

    Data warehouse single-row operation optimization

    公开(公告)号:US10353924B2

    公开(公告)日:2019-07-16

    申请号:US14945768

    申请日:2015-11-19

    Abstract: An approach for optimizing single-row operations in a data warehouse. Single-row operations are determined based on receiving database operations. Extends identifiers are received based on the single-row operations. Single-row usage statistics are stored in extends lists stores where single-row usage statistics include at least one of a hit count and a last hit date. Single-row usage statistics are maintained for extends identifiers based on single-row operations. A logical sequence of the extends identifiers in extends lists stores are sorted based on single-row usage statistics and the logical sequence is maintained based on determining a further single-row operations.

    Storing data records
    6.
    发明授权

    公开(公告)号:US10318513B2

    公开(公告)日:2019-06-11

    申请号:US15831535

    申请日:2017-12-05

    Abstract: Embodiments of the present invention provide a method, computer program product, and a computer system for storing data records in extents. According to one embodiment a data record comprising an attribute value is received. One or more data records stored in a first extent, are identified, wherein the stored one or more data records in the first extent have at least one attribute value. The attribute value of the received data record is compared to the attribute values of the identified data records stored in the first extent. It is then determined whether to store the received data record in the first extent. Responsive to determining, not to store the received data record in the first extent, the received data record is stored in a second extent. If the first received data record is stored in a second extent, determining, an attribute value information of the second extent.

    Method for processing a database query

    公开(公告)号:US09953065B2

    公开(公告)日:2018-04-24

    申请号:US14621466

    申请日:2015-02-13

    CPC classification number: G06F17/30536 G06F17/30424 G06F17/30864

    Abstract: The invention relates to a computer-implemented method for processing a query in a database, the query comprising a search value. The database comprises a plurality of datasets the datasets comprising entries, wherein distance statistics are assigned to the datasets. The distance statistics describe the minimum and maximum distance between the values of the entries of a dataset of the plurality of datasets and a reference value. The method comprises determining the distance between the search value and the reference value, said determination resulting in a search distance, determining a subset of datasets from the plurality of datasets for which the search distance is within the limits given by the minimum and maximum distances described by the respective distance statistics, and searching for the search value in the subset of datasets.

    EFFICIENT PROCESSING OF DATA EXTENTS
    8.
    发明申请

    公开(公告)号:US20180060386A1

    公开(公告)日:2018-03-01

    申请号:US15249509

    申请日:2016-08-29

    CPC classification number: G06F17/30448 G06F17/30395

    Abstract: The present disclosure relates to a computer-implemented method, computer program product, and computer system, for optimization of query processing a set of data extents on which a table is stored. Attribute value information may be maintained for each data extent. The attribute value information indicate as ranges the minimum and maximum values of an attribute of the entries stored in the respective extent. A first metric of a first data extent of the set may determine splitting the first data extent into sub-extents increases query processing efficiency. A second metric of a second data extent and a third data extent may determine merging the second data extent and the third data extent increases query processing efficiency.

Patent Agency Ranking