-
公开(公告)号:US11163769B2
公开(公告)日:2021-11-02
申请号:US16443958
申请日:2019-06-18
Applicant: International Business Machines Corporation
Inventor: Michal Bodziony , Konrad K. Skibski , Tomasz Kazalski , Artur M. Gruszecki , Lukasz Gaza
IPC: G06F16/00 , G06F16/2453
Abstract: A computer-implemented method for joining two data tables on a join attribute, where the data tables have at least a first and a second attribute and the second attribute is the join attribute. The method provides a function for associating a computing node to a given record. The function may be used to determine the associated computing node. The records of the two data tables may be distributed to the respective determined computing nodes. The relationship between the values of the first and second attributes may be modelled using a predefined dataset. For each record of the two data tables the values of the first attribute may be re-determined using the corresponding values of the second attribute. The function may be used to re-determine the associated computing node.
-
公开(公告)号:US10929434B2
公开(公告)日:2021-02-23
申请号:US16424548
申请日:2019-05-29
Applicant: International Business Machines Corporation
Inventor: Piotr L. Kania , Bartlomiej T. Malecki , Michal S. Paluch , Konrad K. Skibski
Abstract: An approach for optimizing single-row operations in a data warehouse. Single-row operations are determined based on receiving database operations. Extends identifiers are received based on the single-row operations. Single-row usage statistics are maintained for extends identifiers based on single-row operations. A logical sequence of the extends identifiers in extends lists stores are sorted based on single-row usage statistics and the logical sequence is maintained based on determining further single-row operations.
-
公开(公告)号:US20190340050A1
公开(公告)日:2019-11-07
申请号:US16507771
申请日:2019-07-10
Applicant: International Business Machines Corporation
Inventor: Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Bartlomiej T. Malecki , Konrad K. Skibski , Tomasz Stradomski
IPC: G06F11/07 , G06F16/21 , G06F16/2457 , G06F16/2455 , G06F16/25 , G06F11/34
Abstract: A method, system and computer program product for providing early diagnosis of hardware, software or configuration problems in a data warehouse system. A received query is parsed to determine the properties of the query. The query may then be joined to existing groups of queries if those groups have shared properties of the query. After executing the query according to an execution plan, results from the execution of the query is received, which may include problem(s) that occurred during execution of the query. For those problems that reach a pre-defined threshold of becoming a “group problem” in those groups joined by the query, the problem is reported to the end user concerning those groups where the problem exceeds the pre-defined threshold. In this manner, an early diagnosis of the problems in the data warehouse system that can cause delay and failure of the processing of queries is able to occur.
-
公开(公告)号:US10353924B2
公开(公告)日:2019-07-16
申请号:US14945768
申请日:2015-11-19
Applicant: International Business Machines Corporation
Inventor: Piotr L. Kania , Bartlomiej T. Malecki , Michal S. Paluch , Konrad K. Skibski
Abstract: An approach for optimizing single-row operations in a data warehouse. Single-row operations are determined based on receiving database operations. Extends identifiers are received based on the single-row operations. Single-row usage statistics are stored in extends lists stores where single-row usage statistics include at least one of a hit count and a last hit date. Single-row usage statistics are maintained for extends identifiers based on single-row operations. A logical sequence of the extends identifiers in extends lists stores are sorted based on single-row usage statistics and the logical sequence is maintained based on determining a further single-row operations.
-
5.
公开(公告)号:US10318517B2
公开(公告)日:2019-06-11
申请号:US15281278
申请日:2016-09-30
Applicant: International Business Machines Corporation
Inventor: Umit Bektas , Pawel T. Januszek , Piotr Kania , Konrad K. Skibski
IPC: G06F17/30 , G06F16/23 , G06F11/30 , G06F16/2455 , H04L12/26
Abstract: A method for collecting monitoring data on a set of monitored computers. The method comprises providing definitions for collection methods for a set of monitoring data, wherein the monitoring data is collected from agents running at monitoring computers, wherein each item of monitoring data has at least one respective collection method. The method further comprises determining a subset of monitoring data to be updated. The method further comprises identifying collection methods corresponding to the subset of monitoring data and triggering the agents running at the monitoring computers to collect monitoring data using the identified collection methods.
-
公开(公告)号:US10318513B2
公开(公告)日:2019-06-11
申请号:US15831535
申请日:2017-12-05
Applicant: International Business Machines Corporation
Inventor: Michal Bodziony , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski
Abstract: Embodiments of the present invention provide a method, computer program product, and a computer system for storing data records in extents. According to one embodiment a data record comprising an attribute value is received. One or more data records stored in a first extent, are identified, wherein the stored one or more data records in the first extent have at least one attribute value. The attribute value of the received data record is compared to the attribute values of the identified data records stored in the first extent. It is then determined whether to store the received data record in the first extent. Responsive to determining, not to store the received data record in the first extent, the received data record is stored in a second extent. If the first received data record is stored in a second extent, determining, an attribute value information of the second extent.
-
公开(公告)号:US09953065B2
公开(公告)日:2018-04-24
申请号:US14621466
申请日:2015-02-13
Applicant: International Business Machines Corporation
Inventor: Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski
CPC classification number: G06F17/30536 , G06F17/30424 , G06F17/30864
Abstract: The invention relates to a computer-implemented method for processing a query in a database, the query comprising a search value. The database comprises a plurality of datasets the datasets comprising entries, wherein distance statistics are assigned to the datasets. The distance statistics describe the minimum and maximum distance between the values of the entries of a dataset of the plurality of datasets and a reference value. The method comprises determining the distance between the search value and the reference value, said determination resulting in a search distance, determining a subset of datasets from the plurality of datasets for which the search distance is within the limits given by the minimum and maximum distances described by the respective distance statistics, and searching for the search value in the subset of datasets.
-
公开(公告)号:US20180060386A1
公开(公告)日:2018-03-01
申请号:US15249509
申请日:2016-08-29
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michal Bodziony , Andreas Brodt , Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski
IPC: G06F17/30
CPC classification number: G06F17/30448 , G06F17/30395
Abstract: The present disclosure relates to a computer-implemented method, computer program product, and computer system, for optimization of query processing a set of data extents on which a table is stored. Attribute value information may be maintained for each data extent. The attribute value information indicate as ranges the minimum and maximum values of an attribute of the entries stored in the respective extent. A first metric of a first data extent of the set may determine splitting the first data extent into sub-extents increases query processing efficiency. A second metric of a second data extent and a third data extent may determine merging the second data extent and the third data extent increases query processing efficiency.
-
公开(公告)号:US09785515B2
公开(公告)日:2017-10-10
申请号:US14614847
申请日:2015-02-05
Applicant: International Business Machines Corporation
Inventor: Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski
CPC classification number: G06F11/1458 , G06F11/1448 , G06F11/1464 , G06F11/1469 , G06F17/30073 , G06F2201/80 , H04L67/10 , H04L67/1095
Abstract: Creating a data backup of data on a first computer system to restore to a second computer system, each of the first and second computer system including one or more nodes, each node configured to manage a subset of the data. Receiving, by the first computer system, identification of data to back up and node configuration information for the second computer system. Creating, by the first computer system, a backup of the data from the one or more nodes of the first computer system, configured in accordance with the node configuration information of the second computer system, such that the backed up data is directly manageable by the one or more nodes of the second computer system.
-
公开(公告)号:US20170269982A1
公开(公告)日:2017-09-21
申请号:US15617201
申请日:2017-06-08
Applicant: International Business Machines Corporation
Inventor: Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Bartlomiej T. Malecki , Konrad K. Skibski , Tomasz Stradomski
Abstract: A method, system and computer program product for providing early diagnosis of hardware, software or configuration problems in a data warehouse system. A received query is parsed to determine the properties of the query. The query may then be joined to existing groups of queries if those groups have shared properties of the query. After executing the query according to an execution plan, results from the execution of the query is received, which may include problem(s) that occurred during execution of the query. For those problems that reach a pre-defined threshold of becoming a “group problem” in those groups joined by the query, the problem is reported to the end user concerning those groups where the problem exceeds the pre-defined threshold. In this manner, an early diagnosis of the problems in the data warehouse system that can cause delay and failure of the processing of queries is able to occur.
-
-
-
-
-
-
-
-
-