-
公开(公告)号:US11947561B2
公开(公告)日:2024-04-02
申请号:US17807884
申请日:2022-06-21
发明人: Peng Hui Jiang , Jun Su , Sheng Yan Sun , Hong Mei Zhang , Meng Wan
IPC分类号: G06F16/25 , G06F16/21 , G06F16/242 , G06F16/28
CPC分类号: G06F16/258 , G06F16/211 , G06F16/2433 , G06F16/254 , G06F16/284
摘要: An embodiment for analyzing and tracking data flow to determine proper schemas for unstructured data. The embodiment may automatically use a sidecar to collect schema discovery rules during conversion of raw data to unstructured data. The embodiment may automatically generate multiple schemas for different tenants using the collected schema discovery rules. The embodiment may automatically use ETL to export unstructured data to SQL databases with the generated multiple schemas for the different tenants. The embodiment may automatically monitor usage data of the SQL databases and collect the usage data. The embodiment may automatically optimize schema discovery using the collected usage data. The embodiment may automatically discover schemas with hot usage and apply the discovered schemas with hot usage to other tenants for consumption and further monitoring.
-
公开(公告)号:US11914594B1
公开(公告)日:2024-02-27
申请号:US18147435
申请日:2022-12-28
发明人: Hong Mei Zhang , Meng Wan , Sheng Yan Sun , Peng Hui Jiang
IPC分类号: G06F16/2453
CPC分类号: G06F16/24542 , G06F16/24549
摘要: A disclosed database system and enhanced methods implement enhanced mini-plans and dynamically changing a query mini-plan with trustworthy Artificial Intelligence (AI) to improve query execution performance in a database system. An AI cost model evaluates candidate mini-plans for executing a query. AI truth monitors evaluate the execution of the mini-plans, such as predicted input factors and adjusted mini-plans of one or more AI running data models. The AI truth monitors provide feedback to adjust the AI cost model based on evaluating the execution of the mini-plans. The AI truth monitors validate adjusted mini-plans, provide feedback to the AI cost model with improved overall prediction accuracy, and enhanced mini-plans to gain query performance.
-
公开(公告)号:US11704151B2
公开(公告)日:2023-07-18
申请号:US17034022
申请日:2020-09-28
发明人: Hong Mei Zhang , Xiaobo Wang , Sheng Yan Sun , Shuo Li
CPC分类号: G06F9/4881 , G06F9/3836 , G06F9/48 , G06F9/4806 , G06F9/4887 , G06F9/50 , G06F9/5005 , G06F9/5038 , G06F9/5061 , G06F9/5066
摘要: A method, system, and computer program product to plan and schedule executions of various utility tasks of a utility command during a maintain window, the method including receiving a utility command. The method may also include identifying possible utility tasks used to execute the utility command. The method may also include determining preferred utility tasks. The method may also include calculating a degree of parallelism for the preferred utility tasks. The method may also include generating a utility execution plan for the utility command. The method may also include analyzing the utility execution plan against resource constraints of a time window and sub time windows of the time window. The method may also include generating a time window execution plan for each sub time window of the sub time windows. The method may also include updating the utility execution plan with the time window execution plans.
-
公开(公告)号:US20230196182A1
公开(公告)日:2023-06-22
申请号:US17557303
申请日:2021-12-21
发明人: Hong Mei Zhang , Shuo Li , Xiaobo Wang , Sheng Yan Sun
CPC分类号: G06N20/00 , G06F16/27 , G06F16/256 , G06F9/4881
摘要: Systems and methods are provided to predict connection configurations in a distributed database environment. In embodiments, a method includes: building, by a computing device, a dynamic machine learning predictive model for a first new node connection in a distributed database environment based on configuration data of the first new node connection; determining, by the computing device, a group of nodes in the distributed database environment of a same type of node as the new node connection; federating, by the computing device, the dynamic machine learning predictive model of the first new node connection with other dynamic machine learning predictive models of other nodes in the group of nodes, thereby generating a federated predictive model for the same type of node; and generating, by the computing device, a predicted connection configuration for a second new node connection of the same type of node using the federated predictive model.
-
公开(公告)号:US20230169077A1
公开(公告)日:2023-06-01
申请号:US17539612
申请日:2021-12-01
发明人: Hong Mei Zhang , Shuo Li , Xiaobo Wang , Sheng Yan Sun
IPC分类号: G06F16/2453 , G06F11/34 , G06N5/02
CPC分类号: G06F16/24532 , G06F16/24542 , G06F11/3419 , G06N5/022
摘要: A method includes training, by a computing device, a model to identify SQL queries; identifying, by the computing device, incoming SQL queries; grouping, by the computing device, the incoming SQL queries into groups; monitoring, by the computing device, resource usage of the incoming SQL queries in the groups; and allocating, by the computing device, resources to an SQL query of the incoming SQL queries based on the monitoring.
-
公开(公告)号:US11573960B2
公开(公告)日:2023-02-07
申请号:US17205044
申请日:2021-03-18
发明人: Shuo Li , Xiaobo Wang , Hong Mei Zhang , Sheng Yan Sun
IPC分类号: G06F11/34 , G06F16/2453 , G06F16/2455 , G06F16/248 , G06F16/28
摘要: A computer-implemented method provides application-based query transformations. The method includes determining an application is initiated. The method includes identifying a set of execution units included in the application. The execution units are based on of a set of queries in the application and a set of actions in the application. The method also includes building a query dependency graph (QDG) comprising a plurality of nodes, wherein each node of the plurality of nodes is correlated to an execution unit, and each node is linked to at least one additional node, the link indicating a relative execution order and a common attribute each node and the additional node. The method includes merging, based on a performance architecture, two or more of the set of execution units into a section. The method includes processing the application according to the QDG.
-
公开(公告)号:US11573823B2
公开(公告)日:2023-02-07
申请号:US16686243
申请日:2019-11-18
发明人: Shuo Li , ShengYan Sun , Xiaobo Wang , Hong Mei Zhang
摘要: In an approach, a processor, responsive to requesting to perform a plurality of applications including a first application and a second application, determines that the first application and the second application have been performed sequentially during a previous time period. A processor, responsive to determining that the first and second applications have been performed in sequence during the previous time period, obtains, a first set of database operations associated with the first application and a second set of database operations associated with the second application. A processors, responsive to determining that the first set of database operations and the second set of database operations are free of conflict, generates an execution schedule indicating that the first application and the second application are to be performed in parallel. A processors performs the plurality of applications based on the execution schedule.
-
公开(公告)号:US11526490B1
公开(公告)日:2022-12-13
申请号:US17348832
申请日:2021-06-16
发明人: Hong Mei Zhang , Shuo Li , Xiaobo Wang , Sheng Yan Sun
IPC分类号: G06F15/16 , G06F16/23 , G06F16/14 , G06F16/188
摘要: A computer-implemented method improves the performance of write ahead logging. The method includes generating a set of query data and a set of log data, where the set of log data is configured to create a write ahead log, and the set of query data is configured to make changes to data in a database. The method also includes writing the set of query data to a virtual file system. The method further includes separating the set of log data into a set of control intervals, where each control interval includes an entry. The method includes writing, each entry into a log buffer, where the writing the set of query data to the virtual file system and the writing each entry into the log buffer are performed in parallel. The method also includes combining each entry into a complete log.
-
公开(公告)号:US20220300507A1
公开(公告)日:2022-09-22
申请号:US17205044
申请日:2021-03-18
发明人: Shuo Li , Xiaobo Wang , Hong Mei Zhang , Sheng Yan Sun
IPC分类号: G06F16/2453 , G06F16/2455 , G06F16/248 , G06F16/28 , G06F11/34
摘要: A computer-implemented method provides application-based query transformations. The method includes determining an application is initiated. The method includes identifying a set of execution units included in the application. The execution units are based on of a set of queries in the application and a set of actions in the application. The method also includes building a query dependency graph (QDG) comprising a plurality of nodes, wherein each node of the plurality of nodes is correlated to an execution unit, and each node is linked to at least one additional node, the link indicating a relative execution order and a common attribute each node and the additional node. The method includes merging, based on a performance architecture, two or more of the set of execution units into a section. The method includes processing the application according to the QDG.
-
公开(公告)号:US20220188314A1
公开(公告)日:2022-06-16
申请号:US17121367
申请日:2020-12-14
发明人: Shuo Li , Xiaobo Wang , Sheng Yan Sun , Hong Mei Zhang
IPC分类号: G06F16/2453 , G06F16/21 , G06F11/34 , G06F16/2458 , G06F16/25
摘要: In a computer-implemented method for improving performance of a database optimizer, a processor calculates a first execution time for a database query executing through a first access path of a database system. A processor generates a second access path. A processor determines a second execution time via a simulation of the database query through the second access path. A processor determines that the second execution time is less than the first execution time. A processor, responsive to determining that the second execution time is less than the first execution time, sends the second access path to the database optimizer.
-
-
-
-
-
-
-
-
-