Automated partitioning of a distributed database system

    公开(公告)号:US11914586B2

    公开(公告)日:2024-02-27

    申请号:US17709802

    申请日:2022-03-31

    摘要: An embodiment includes generating a partition schema for a distributed database based on historical usage data indicative of usage of the distributed database, where the generating of the partition schema comprises determining a partition range of a partition of the partition schema. The embodiment also includes generating a node identifier for the partition using a hash function and a first weight value assigned to the partition. The embodiment also includes monitoring performance data indicative of a performance of the distributed database, the monitoring comprising detecting a failure of the performance to satisfy a performance threshold. The embodiment also includes initiating, responsive to detecting the failure, a redistribution procedure by changing the node identifier of the partition by replacing the first weight value with a second weight value.

    Dynamically inheriting accumulated attribution

    公开(公告)号:US11853697B2

    公开(公告)日:2023-12-26

    申请号:US17238515

    申请日:2021-04-23

    IPC分类号: G06F40/284 G06N7/01

    CPC分类号: G06F40/284 G06N7/01

    摘要: An approach is provided in which a method, system, and program product build a time series prediction model based on one or more relationships between a first set of keywords in a set of first news articles and a second set of keywords in a set of second news articles. The time series prediction model includes a time-based interest level adjustment corresponding to a publication time between the set of first news articles and the set second of news articles. The method, system, and program product use the time series prediction model to compute an inherited initial interest level of a third news article that includes a set of new keywords based on the set of new keywords and the time-based interest level adjustment. The method, system, and program product assign the inherited initial interest level to the third news article.

    Performance of SQL execution sequence in production database instance

    公开(公告)号:US11847120B2

    公开(公告)日:2023-12-19

    申请号:US17541322

    申请日:2021-12-03

    摘要: A method, computer program product, and computer system for improving performance of a SQL execution sequence of SQL statements. The SQL execution sequence is recorded in an event log. Original results of executing the SQL statements and an original CPU cost of executing the SQL statements in accordance with the original access path are recorded in a logical log. A new access path is generated from analysis of the event log and the logical log. The SQL statements are executed in accordance with the new access path resulting in new results of executing the SQL statements including a new CPU cost of executing the SQL statements in accordance with the new access path. In response to a determination that the new results replicate the original results and that the new CPU cost is less than the original CPU cost, the original access path is replaced with the new access path.

    DATABASE COMPRESSION ORIENTED TO COMBINATIONS OF RECORD FIELDS

    公开(公告)号:US20230401204A1

    公开(公告)日:2023-12-14

    申请号:US17806359

    申请日:2022-06-10

    摘要: This disclosure provides a computer-implemented method, a computer system and a computer program product for database compression oriented to combinations of fields of a database record. One or more combinations of fields of a record of a database are determined that satisfy a frequency criterion indicating that access frequencies of the one or more combinations of fields are higher than an access frequency threshold. The record is reorganized based on the one or more combinations of fields to store fields of each combination of the one or more combinations of fields in a respective contiguous storage space. The reorganized record is compressed by applying a compression scheme to the one or more combinations of fields.

    Database buffer pool optimization

    公开(公告)号:US11734277B2

    公开(公告)日:2023-08-22

    申请号:US17248757

    申请日:2021-02-05

    摘要: An approach is provided for optimizing a database buffer pool. Based on statistics about read and write operations in a range of pages, the range of pages is determined to be a candidate for a hot read range for which locks and latches are avoided in processing operations in the hot read range. Using an on-time trigger or pre-trigger process, the hot read range is created from the range of pages by marking start and end points in the range of pages. Write operation(s) are determined to be included in the hot read range by marking an object control block. The write operation(s) are added to a write pending list. The read operations in the hot read range are performed without a latch or lock. The write operation(s) are merged from the write pending list to the range of pages and the write operation(s) are performed.

    BUFFER POOL RESOURCE ALLOCATION
    8.
    发明公开

    公开(公告)号:US20230168952A1

    公开(公告)日:2023-06-01

    申请号:US17537669

    申请日:2021-11-30

    IPC分类号: G06F9/54 G06F16/2455 G06F9/50

    摘要: A computer implemented method for managing buffer pool usage includes identifying a database comprising one or more buffer pools and a set of database objects, adding a virtual layer between the set of database objects and the one or more buffer pools, wherein the virtual layer is configured to manage buffer pool resource utilization, managing, using the added virtual layer, buffer pool resource utilization with respect to the set of database objects, wherein the managing comprises defining buffer pool usage corresponding to the set of database objects, dynamically mapping database objects to the buffer pools based, at least in part, on the defined buffer pool usage, analyzing, by one or more processors, queues to identify updated buffer pool usage, and adjusting database object allocation according to the identified updated resource usage. A computer program product and computer system corresponding to the method are also disclosed.