Heterogeneous schema discovery for unstructured data

    公开(公告)号:US11947561B2

    公开(公告)日:2024-04-02

    申请号:US17807884

    申请日:2022-06-21

    摘要: An embodiment for analyzing and tracking data flow to determine proper schemas for unstructured data. The embodiment may automatically use a sidecar to collect schema discovery rules during conversion of raw data to unstructured data. The embodiment may automatically generate multiple schemas for different tenants using the collected schema discovery rules. The embodiment may automatically use ETL to export unstructured data to SQL databases with the generated multiple schemas for the different tenants. The embodiment may automatically monitor usage data of the SQL databases and collect the usage data. The embodiment may automatically optimize schema discovery using the collected usage data. The embodiment may automatically discover schemas with hot usage and apply the discovered schemas with hot usage to other tenants for consumption and further monitoring.

    Dynamically changing query mini-plan with trustworthy AI

    公开(公告)号:US11914594B1

    公开(公告)日:2024-02-27

    申请号:US18147435

    申请日:2022-12-28

    IPC分类号: G06F16/2453

    CPC分类号: G06F16/24542 G06F16/24549

    摘要: A disclosed database system and enhanced methods implement enhanced mini-plans and dynamically changing a query mini-plan with trustworthy Artificial Intelligence (AI) to improve query execution performance in a database system. An AI cost model evaluates candidate mini-plans for executing a query. AI truth monitors evaluate the execution of the mini-plans, such as predicted input factors and adjusted mini-plans of one or more AI running data models. The AI truth monitors provide feedback to adjust the AI cost model based on evaluating the execution of the mini-plans. The AI truth monitors validate adjusted mini-plans, provide feedback to the AI cost model with improved overall prediction accuracy, and enhanced mini-plans to gain query performance.

    DATABASE RESOURCE MANAGEMENT USING PREDICTIVE MODELS

    公开(公告)号:US20230196182A1

    公开(公告)日:2023-06-22

    申请号:US17557303

    申请日:2021-12-21

    摘要: Systems and methods are provided to predict connection configurations in a distributed database environment. In embodiments, a method includes: building, by a computing device, a dynamic machine learning predictive model for a first new node connection in a distributed database environment based on configuration data of the first new node connection; determining, by the computing device, a group of nodes in the distributed database environment of a same type of node as the new node connection; federating, by the computing device, the dynamic machine learning predictive model of the first new node connection with other dynamic machine learning predictive models of other nodes in the group of nodes, thereby generating a federated predictive model for the same type of node; and generating, by the computing device, a predicted connection configuration for a second new node connection of the same type of node using the federated predictive model.

    Application-based query transformations

    公开(公告)号:US11573960B2

    公开(公告)日:2023-02-07

    申请号:US17205044

    申请日:2021-03-18

    摘要: A computer-implemented method provides application-based query transformations. The method includes determining an application is initiated. The method includes identifying a set of execution units included in the application. The execution units are based on of a set of queries in the application and a set of actions in the application. The method also includes building a query dependency graph (QDG) comprising a plurality of nodes, wherein each node of the plurality of nodes is correlated to an execution unit, and each node is linked to at least one additional node, the link indicating a relative execution order and a common attribute each node and the additional node. The method includes merging, based on a performance architecture, two or more of the set of execution units into a section. The method includes processing the application according to the QDG.

    Parallel execution of applications

    公开(公告)号:US11573823B2

    公开(公告)日:2023-02-07

    申请号:US16686243

    申请日:2019-11-18

    IPC分类号: G06F9/46 G06F9/48

    摘要: In an approach, a processor, responsive to requesting to perform a plurality of applications including a first application and a second application, determines that the first application and the second application have been performed sequentially during a previous time period. A processor, responsive to determining that the first and second applications have been performed in sequence during the previous time period, obtains, a first set of database operations associated with the first application and a second set of database operations associated with the second application. A processors, responsive to determining that the first set of database operations and the second set of database operations are free of conflict, generates an execution schedule indicating that the first application and the second application are to be performed in parallel. A processors performs the plurality of applications based on the execution schedule.

    Database log performance
    8.
    发明授权

    公开(公告)号:US11526490B1

    公开(公告)日:2022-12-13

    申请号:US17348832

    申请日:2021-06-16

    摘要: A computer-implemented method improves the performance of write ahead logging. The method includes generating a set of query data and a set of log data, where the set of log data is configured to create a write ahead log, and the set of query data is configured to make changes to data in a database. The method also includes writing the set of query data to a virtual file system. The method further includes separating the set of log data into a set of control intervals, where each control interval includes an entry. The method includes writing, each entry into a log buffer, where the writing the set of query data to the virtual file system and the writing each entry into the log buffer are performed in parallel. The method also includes combining each entry into a complete log.

    APPLICATION - BASED QUERY TRANSFORMATIONS

    公开(公告)号:US20220300507A1

    公开(公告)日:2022-09-22

    申请号:US17205044

    申请日:2021-03-18

    摘要: A computer-implemented method provides application-based query transformations. The method includes determining an application is initiated. The method includes identifying a set of execution units included in the application. The execution units are based on of a set of queries in the application and a set of actions in the application. The method also includes building a query dependency graph (QDG) comprising a plurality of nodes, wherein each node of the plurality of nodes is correlated to an execution unit, and each node is linked to at least one additional node, the link indicating a relative execution order and a common attribute each node and the additional node. The method includes merging, based on a performance architecture, two or more of the set of execution units into a section. The method includes processing the application according to the QDG.

    ACCESS PATH FOR DATABASE OPTIMIZER
    10.
    发明申请

    公开(公告)号:US20220188314A1

    公开(公告)日:2022-06-16

    申请号:US17121367

    申请日:2020-12-14

    摘要: In a computer-implemented method for improving performance of a database optimizer, a processor calculates a first execution time for a database query executing through a first access path of a database system. A processor generates a second access path. A processor determines a second execution time via a simulation of the database query through the second access path. A processor determines that the second execution time is less than the first execution time. A processor, responsive to determining that the second execution time is less than the first execution time, sends the second access path to the database optimizer.