-
公开(公告)号:US11748369B2
公开(公告)日:2023-09-05
申请号:US17465298
申请日:2021-09-02
发明人: Alejandro Hernandez Saenz , Cristian Diaconu , Krystyna Ewa Reisteter , Naveen Prakash , Sheetal Shrotri , Rogério Ramos , Alexander Budovski , Hanumantha Rao Kodavalla
IPC分类号: G06F16/00 , G06F16/25 , G06F16/27 , G06F16/22 , G06F16/2455
CPC分类号: G06F16/256 , G06F16/278 , G06F16/2272 , G06F16/24557
摘要: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US11567839B2
公开(公告)日:2023-01-31
申请号:US17512337
申请日:2021-10-27
发明人: Alexander Budovski , Cristian Diaconu , Sandeep Lingam , Alejandro Hernandez Saenz , Naveen Prakash , Krystyna Ewa Reisteter , Rogerio Ramos , Huanhui Hu , Peter Byrne
摘要: Embodiments described herein detect data corruption in a distributed data set system. For example, a system comprises node(s) for processing queries with respect to a distributed data set comprising a plurality of storage segments. A write transaction resulting from a query with respect to a particular storage segment is logged in a log record that describes a modification to the storage segment. A log service provides the log record to a data server managing a portion of the distributed data set in which the storage segment is included, which performs the write transaction with respect to the storage segment. For redundancy purposes, the data server has replica(s) that manage respective replicas of the portion of the distributed data set managed thereby. For backup purposes, snapshots of the replica(s) are periodically generated. To determine a data corruption, a snapshot of one replica is cross-validated with a snapshot of another replica.
-
公开(公告)号:US11681631B2
公开(公告)日:2023-06-20
申请号:US17359087
申请日:2021-06-25
发明人: Krystyna Ewa Reisteter , Cristian Diaconu , Rogério Ramos , Sarika R. Iyer , Siddharth Deepak Mehta , Huanhui Hu
IPC分类号: G06F12/08 , G06F12/0882 , G06F3/06 , G06F12/02 , G06F12/0871
CPC分类号: G06F12/0882 , G06F3/0604 , G06F3/0656 , G06F3/0679 , G06F12/0246 , G06F12/0871
摘要: Data base performance is improved using write-behind optimization of covering cache. Non-volatile memory data cache includes a full copy of stored data file(s). Data cache and storage writes, checkpoints, and recovery may be decoupled (e.g., with separate writes, checkpoints and recoveries). A covering data cache supports improved performance by supporting database operation during storage delays or outages and/or by supporting reduced I/O operations using aggregate writes of contiguous data pages (e.g., clean and dirty pages) to stored data file(s). Aggregate writes reduce data file fragmentation and reduce the cost of snapshots. Performing write-behind operations in a background process with optimistic concurrency control may support improved database performance, for example, by not interfering with write operations to data cache. Data cache may store (e.g., in metadata) data cache checkpoint information and storage checkpoint information. A stored data file may store storage checkpoint information (e.g., in a file header).
-
公开(公告)号:US12118014B2
公开(公告)日:2024-10-15
申请号:US18351258
申请日:2023-07-12
发明人: Alejandro Hernandez Saenz , Cristian Diaconu , Krystyna Ewa Reisteter , Naveen Prakash , Sheetal Shrotri , Rogério Ramos , Alexander Budovski , Hanumantha Rao Kodavalla
IPC分类号: G06F16/00 , G06F16/25 , G06F16/27 , G06F16/22 , G06F16/2455
CPC分类号: G06F16/256 , G06F16/278 , G06F16/2272 , G06F16/24557
摘要: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node, and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US11249866B1
公开(公告)日:2022-02-15
申请号:US17237707
申请日:2021-04-22
发明人: Alexander Budovski , Cristian Diaconu , Sandeep Lingam , Alejandro Hernandez Saenz , Naveen Prakash , Krystyna Ewa Reisteter , Rogerio Ramos , Huanhui Hu , Peter Byrne
摘要: Embodiments described herein detect data corruption in a distributed data set system. For example, a system comprises node(s) for processing queries with respect to a distributed data set comprising a plurality of storage segments. A write transaction resulting from a query with respect to a particular storage segment is logged in a log record that describes a modification to the storage segment. A log service provides the log record to a data server managing a portion of the distributed data set in which the storage segment is included, which performs the write transaction with respect to the storage segment. For redundancy purposes, the data server has replica(s) that manage respective replicas of the portion of the distributed data set managed thereby. For backup purposes, snapshots of the replica(s) are periodically generated. To determine a data corruption, a snapshot of one replica is cross-validated with a snapshot of another replica.
-
-
-
-