-
公开(公告)号:US11914586B2
公开(公告)日:2024-02-27
申请号:US17709802
申请日:2022-03-31
发明人: Hong Mei Zhang , Sheng Yan Sun , Meng Wan , Peng Hui Jiang
IPC分类号: G06F16/2453 , G06F11/34 , G06F16/22 , G06F16/25
CPC分类号: G06F16/2453 , G06F11/3409 , G06F16/2255 , G06F16/2282 , G06F16/256
摘要: An embodiment includes generating a partition schema for a distributed database based on historical usage data indicative of usage of the distributed database, where the generating of the partition schema comprises determining a partition range of a partition of the partition schema. The embodiment also includes generating a node identifier for the partition using a hash function and a first weight value assigned to the partition. The embodiment also includes monitoring performance data indicative of a performance of the distributed database, the monitoring comprising detecting a failure of the performance to satisfy a performance threshold. The embodiment also includes initiating, responsive to detecting the failure, a redistribution procedure by changing the node identifier of the partition by replacing the first weight value with a second weight value.
-
公开(公告)号:US11874830B2
公开(公告)日:2024-01-16
申请号:US17654110
申请日:2022-03-09
发明人: Sheng Yan Sun , Hong Mei Zhang , Meng Wan , Peng Hui Jiang
IPC分类号: G06F16/2453 , G06F16/242 , G06F11/34 , G06F16/23
CPC分类号: G06F16/24542 , G06F11/3409 , G06F16/2386 , G06F16/2433
摘要: In a computer-implemented method for improving performance of a database, a processor receives batch jobs for a relational database. The batch jobs may include a first member with a first buffer pool, and a second member with a second buffer pool. The processor may also identify a first actual object and an isolation level for the batch jobs, generate related queries based on the first actual object and the isolation level, calculate a cost for the first member and the second member to run the batch jobs based on the related queries, and assign the batch jobs to the first member based on a lower calculated cost.
-
公开(公告)号:US11853697B2
公开(公告)日:2023-12-26
申请号:US17238515
申请日:2021-04-23
发明人: Shuo Li , June-Ray Lin , Sheng Yan Sun , Xiaobo Wang
IPC分类号: G06F40/284 , G06N7/01
CPC分类号: G06F40/284 , G06N7/01
摘要: An approach is provided in which a method, system, and program product build a time series prediction model based on one or more relationships between a first set of keywords in a set of first news articles and a second set of keywords in a set of second news articles. The time series prediction model includes a time-based interest level adjustment corresponding to a publication time between the set of first news articles and the set second of news articles. The method, system, and program product use the time series prediction model to compute an inherited initial interest level of a third news article that includes a set of new keywords based on the set of new keywords and the time-based interest level adjustment. The method, system, and program product assign the inherited initial interest level to the third news article.
-
公开(公告)号:US11847120B2
公开(公告)日:2023-12-19
申请号:US17541322
申请日:2021-12-03
发明人: Shuo Li , Xiaobo Wang , Min Li , Sheng Yan Sun
IPC分类号: G06F16/30 , G06F16/2453 , G06F11/34 , G06F16/23 , G06F16/2452
CPC分类号: G06F16/24545 , G06F11/3409 , G06F16/2358 , G06F16/24524
摘要: A method, computer program product, and computer system for improving performance of a SQL execution sequence of SQL statements. The SQL execution sequence is recorded in an event log. Original results of executing the SQL statements and an original CPU cost of executing the SQL statements in accordance with the original access path are recorded in a logical log. A new access path is generated from analysis of the event log and the logical log. The SQL statements are executed in accordance with the new access path resulting in new results of executing the SQL statements including a new CPU cost of executing the SQL statements in accordance with the new access path. In response to a determination that the new results replicate the original results and that the new CPU cost is less than the original CPU cost, the original access path is replaced with the new access path.
-
公开(公告)号:US20230401204A1
公开(公告)日:2023-12-14
申请号:US17806359
申请日:2022-06-10
发明人: Ying Zhang , Xiaobo Wang , Shuo Li , Sheng Yan Sun
IPC分类号: G06F16/242 , G06F16/23 , G06F16/22
CPC分类号: G06F16/244 , G06F16/2358 , G06F16/2282
摘要: This disclosure provides a computer-implemented method, a computer system and a computer program product for database compression oriented to combinations of fields of a database record. One or more combinations of fields of a record of a database are determined that satisfy a frequency criterion indicating that access frequencies of the one or more combinations of fields are higher than an access frequency threshold. The record is reorganized based on the one or more combinations of fields to store fields of each combination of the one or more combinations of fields in a respective contiguous storage space. The reorganized record is compressed by applying a compression scheme to the one or more combinations of fields.
-
6.
公开(公告)号:US20230325471A1
公开(公告)日:2023-10-12
申请号:US17715190
申请日:2022-04-07
发明人: Meng Wan , Sheng Yan Sun , Peng Hui Jiang , Hong Mei Zhang
IPC分类号: G06K9/62 , G06F16/2458 , G06N3/08
CPC分类号: G06K9/6215 , G06K9/6219 , G06K9/628 , G06F16/2477 , G06N3/08
摘要: A supervised similarity measure machine learning method, system, and computer program product that includes generating embeddings by training a supervised deep neural network (DNN) on a feature data to determine which nodes correspond to which clustered learning group of clustered learning groups, performing half-distributed learning by distributing data in a time-series database to the clustered learning groups, and evaluating, based on the embeddings, new tenant data in the clustered learning groups with an upward bow pose.
-
公开(公告)号:US11734277B2
公开(公告)日:2023-08-22
申请号:US17248757
申请日:2021-02-05
发明人: Shuo Li , Xiaobo Wang , Hong Mei Zhang , Sheng Yan Sun
IPC分类号: G06F16/2455 , G06F12/0882 , G06F16/25
CPC分类号: G06F16/24552 , G06F12/0882 , G06F16/25
摘要: An approach is provided for optimizing a database buffer pool. Based on statistics about read and write operations in a range of pages, the range of pages is determined to be a candidate for a hot read range for which locks and latches are avoided in processing operations in the hot read range. Using an on-time trigger or pre-trigger process, the hot read range is created from the range of pages by marking start and end points in the range of pages. Write operation(s) are determined to be included in the hot read range by marking an object control block. The write operation(s) are added to a write pending list. The read operations in the hot read range are performed without a latch or lock. The write operation(s) are merged from the write pending list to the range of pages and the write operation(s) are performed.
-
公开(公告)号:US20230168952A1
公开(公告)日:2023-06-01
申请号:US17537669
申请日:2021-11-30
发明人: Shuo Li , Xiaobo Wang , Sheng Yan Sun , Rui Wang
IPC分类号: G06F9/54 , G06F16/2455 , G06F9/50
CPC分类号: G06F9/544 , G06F16/24552 , G06F9/546 , G06F9/5016
摘要: A computer implemented method for managing buffer pool usage includes identifying a database comprising one or more buffer pools and a set of database objects, adding a virtual layer between the set of database objects and the one or more buffer pools, wherein the virtual layer is configured to manage buffer pool resource utilization, managing, using the added virtual layer, buffer pool resource utilization with respect to the set of database objects, wherein the managing comprises defining buffer pool usage corresponding to the set of database objects, dynamically mapping database objects to the buffer pools based, at least in part, on the defined buffer pool usage, analyzing, by one or more processors, queues to identify updated buffer pool usage, and adjusting database object allocation according to the identified updated resource usage. A computer program product and computer system corresponding to the method are also disclosed.
-
公开(公告)号:US11663179B2
公开(公告)日:2023-05-30
申请号:US17128277
申请日:2020-12-21
发明人: Shuo Li , Xiaobo Wang , Sheng Yan Sun , Hong Mei Zhang
IPC分类号: G06F16/215 , G06F16/2455
CPC分类号: G06F16/215 , G06F16/2456 , G06F16/24565
摘要: A simulated dataset is queried for regression by validating a structured query language (SQL) statement, determining a pattern type of the SQL statement, reconstructing the SQL statement according to a predetermined process for the pattern type, creating a mutated SQL statement for querying a simulated dataset, and validating the mutated SQL statement. The simulated dataset is based on a confidential dataset having the confidential elements removed or replaced.
-
公开(公告)号:US20230153300A1
公开(公告)日:2023-05-18
申请号:US17455490
申请日:2021-11-18
发明人: Hong Mei Zhang , Shuo Li , Xiaobo Wang , Sheng Yan Sun
IPC分类号: G06F16/2453 , G06F16/22
CPC分类号: G06F16/24537 , G06F16/2282
摘要: A computer-implemented method to generate and utilize a hybrid index. The method includes receiving a first query, where the first query is configured to perform a first command. The method further includes generating a hybrid index, where the hybrid index defines a connection between a first table and a second table of a set of tables. The method also includes executing the first query using the hybrid index. The method includes returning a set of results for the first query to a source of the query.
-
-
-
-
-
-
-
-
-