Methods and system for optimal multi-environment data synchronization/replication

    公开(公告)号:US11086903B2

    公开(公告)日:2021-08-10

    申请号:US16456156

    申请日:2019-06-28

    申请人: Teradata US, Inc.

    摘要: A data-changing request is received on a first data source system. Metadata for a data source object associated with the request is obtained. The request is executed on a second data source system and results are obtained. A selection between multiple replication processes is made based on the metadata and the results. The selected process is performed on each of the remaining data source systems to synchronize/replicate data changes on the remaining data source systems. In an embodiment, bidirectional and concurrent data connections are permitted to a source object for one data source system and a replicated version of the source object on a different source system. In an embodiment, each data source system includes an agent that establishes an out-of-band connection to a synchronization/replication server, such that synchronization/replication is processed by the server without an affirmative connection being made from the data source systems to the server.

    Value list compression (VLC) aware qualification

    公开(公告)号:US11023469B2

    公开(公告)日:2021-06-01

    申请号:US15825175

    申请日:2017-11-29

    申请人: Teradata US, Inc.

    摘要: A database includes a Value List Compression (VLC) predicate evaluator. A table identified in a query that is being processed is identified as having compressed data values. The predicate evaluator compares a query predicate of the query against actual decompressed values noted in a dictionary for the table and the predicate evaluator maintains a bitmap for selective ones of the actual values that satisfy the query predicate. The matched bitmap positions are processed against an index maintained in the table for the actual values to provide selective decompressed table entries as results for the query.

    TRANSACTION RECOVERY FROM A FAILURE ASSOCIATED WITH A DATABASE SERVER

    公开(公告)号:US20210034477A1

    公开(公告)日:2021-02-04

    申请号:US16720386

    申请日:2019-12-19

    申请人: TERADATA US, INC.

    IPC分类号: G06F11/14 G06F16/2455

    摘要: In some examples, a system sends a transaction to a database server to cause storing of data of the transaction in a cache of the database server, where the data in the cache is for inclusion in a backup of data from the database server to a remote data store (e.g., the backup may be in a cloud and may be a snapshot). The system detects a failure associated with the database server, and in response to detecting the failure, requests, from the database server or a replacement database server, transaction information of at least one transaction that was successfully applied to the remote data store, the transaction information based on the backup of data. The system causes replay one or more transactions to recover data at the database server or the replacement database server, to perform recovery of the database server or the replacement database server to a current state.

    Self organizing list machine for reordering items of a list

    公开(公告)号:US10740307B2

    公开(公告)日:2020-08-11

    申请号:US15059719

    申请日:2016-03-03

    申请人: Teradata US, Inc.

    IPC分类号: G06F16/22

    摘要: A self-organizing list machine is provided for reordering items of a list in a manner that achieves both a constant and minimum time complexity. The self-organizing list machine comprises an array of symbol index generators, wherein (i) each symbol index generator has a unique location in the array between zero and N−1 inclusive, and (ii) N is the total number of unique symbols in a symbol alphabet.

    SYSTEM AND METHOD FOR SHARING DATABASE QUERY EXECUTION PLANS BETWEEN MULTIPLE PARSING ENGINES

    公开(公告)号:US20200151178A1

    公开(公告)日:2020-05-14

    申请号:US16236738

    申请日:2018-12-31

    申请人: Teradata US, Inc.

    IPC分类号: G06F16/2453 G06F16/2455

    摘要: Techniques for improving the execution of database queries in a multi-processor system or distributed processing system environment are provided. In a database system including multiple parsing engines (PEs) for parsing database queries, or requests, received by the system and generating execution plans for the requests, execution plans generated for requests can be saved in a global request cache accessible to each of the parsing engines. Requests which have been parsed and cached by a PE can be retrieved for use by other PEs, thereby avoiding unnecessarily parsing the same database request in multiple PEs. The global request cache may be a distributed cache consisting of request caches local to each parsing engine, with execution plans allocated to the local request caches using hashing techniques applied to the database requests associated with the execution plans.

    Multi-input SQL-MR
    18.
    发明授权

    公开(公告)号:US10452632B1

    公开(公告)日:2019-10-22

    申请号:US14317995

    申请日:2014-06-27

    申请人: Teradata US, Inc.

    摘要: A system may include an array of storage devices configured to store a data. The system may further include an array of processing nodes in communication with the array of storage devices. The array of processing nodes may receive a request to perform at least one task associated with the data. The request may include a function call to a function configured to operate on a first data table and a second data table included in the data. The array of processing nodes may partition the first data table among respective subsets of the processing nodes based on a partition key. The array of processing nodes may distribute the second data table among the partitions based on the partition key. The array of processing nodes may execute the function on the first data table and the second data table at each of the partitions. A method and computer-readable medium may also be implemented.

    Statistic normalization in a data store system

    公开(公告)号:US10372708B1

    公开(公告)日:2019-08-06

    申请号:US14588126

    申请日:2014-12-31

    申请人: Teradata US, Inc.

    发明人: Sung Jin Kim

    摘要: A data store system may include an array of persistent storage devices configured to store a plurality of data store tables and a set of rules associated with a plurality of statistics. Each rule may include a relationship between at least two statistics. The data store system may further include a processor in communication with the storage device. The processor may generate the plurality of statistics on a data store table and retrieve the set of rules. The processor may evaluate the set of rules based on the plurality of statistics and, in response to the determination that at least one rule of the set of rules is violated, adjust a value of at least one of the statistics. The processor may store values of the plurality of statistics in at least one of the persistent storage devices, receive a query on the data store table, and use the stored statistics to generate a response to the query. A method and computer-readable medium are also disclosed.