-
公开(公告)号:US20200097566A1
公开(公告)日:2020-03-26
申请号:US16138103
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Naveen PRAKASH , Alejandro Hernandez SAENZ , Alexander BUDOVSKI
Abstract: The bifurcation of a log into a fixed-size log portion and a growable log portion. Log records are first written to the fixed-size log portion. At some point, the older log records within the fixed-size log portion are destaged to the growable log portion. Destaging may involve copying the log records from the fixed-size log portion and tacking the log record onto the end of the growable log portion. This destaging might occur one sub-portion (e.g., one virtual log file) at a time. The more recent log records are within the fixed-size log portion. Accordingly, new writes to the log occur to the fixed-size log portion. Furthermore, a large portion of the reads are of the more recent log records within the fixed-size log portion. Thus, optimizing performance within the fixed-size log portion results in significant improvement in the performance of the overall log.
-
公开(公告)号:US20200097578A1
公开(公告)日:2020-03-26
申请号:US16138238
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Naveen PRAKASH , Alexander BUDOVSKI , Huanhui HU , Alejandro Hernandez SAENZ
IPC: G06F17/30
Abstract: Brokering log records so as to prevent log records that are not yet persisted in a persistent log from being disseminated. The log records may be generated as a primary compute system performs operations. Upon receiving a request for a log record, the broker component determines whether the requested log record has been persisted in a persistent log. If the broker component determines that the log record has been persisted in the persistent log, the broker component responds to the request by causing the requested log record to be provided to the requesting entity (e.g., a secondary compute system). On the other hand, if the log record cannot yet determine that the log record has been persisted in the persistent log, the broker component prevents the log record from being provided to the requesting entity. This prevents data from being inconsistent during recovery.
-
公开(公告)号:US20240427792A1
公开(公告)日:2024-12-26
申请号:US18825671
申请日:2024-09-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Alejandro Hernandez SAENZ , Cristian DIACONU , Krystyna Ewa REISTETER , Naveen PRAKASH , Sheetal SHROTRI , Rogério RAMOS , Alexander BUDOVSKI , Hanumantha Rao KODAVALLA
IPC: G06F16/25 , G06F16/22 , G06F16/2455 , G06F16/27
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node, and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US20220197892A1
公开(公告)日:2022-06-23
申请号:US17180519
申请日:2021-02-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Craig S. FREEDMAN , Adrian-Leonard RADU , Daniel G. SCHALL , Hanumantha R. KODAVALLA , Panagiotis ANTONOPOULOS , Raghavendra Thallam KODANDARAMAIH , Alejandro Hernandez SAENZ , Naveen PRAKASH
IPC: G06F16/23 , G06F16/27 , G06F16/2455 , G06F16/21
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable compute nodes to pushdown certain query processing compute tasks to the page servers to take advantage of otherwise idle compute resources at the page servers, and to reduce the quantity of data that moves between compute nodes and page servers. A distributed database system includes a page server and a compute node, wherein the page server is configured to maintain multiple versions of stored data objects. The compute node is configured to receive a query and generate a transaction context (TC) and modified table schemas (MTS) scoped to the query, and pushdown the query, TC and MTS to the page server that is configured to determine which data objects at the page server satisfy the query, and for each such object, which version of the object should be returned based on the TC.
-
公开(公告)号:US20220197873A1
公开(公告)日:2022-06-23
申请号:US17180508
申请日:2021-02-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Craig S. FREEDMAN , Adrian-Leonard RADU , Daniel G. SCHALL , Hanumantha R. KODAVALLA , Panagiotis ANTONOPOULOS , Raghavendra Thallam KODANDARAMAIH , Alejandro Hernandez SAENZ , Naveen PRAKASH
IPC: G06F16/21 , G06F16/245
Abstract: Methods for page split detection and affinity in query processing pushdowns are performed by systems and devices. Page servers perform pushdown operations based on specific, and specifically formatted or generated, information, instructions, and data provided thereto from a compute node. Page servers also determine that page splits have occurred during reading of data pages maintained by page servers during pushdown operations, and also during fulfillment of compute node data requests. To detect a data page has split, page servers utilize information from a compute node of an expected next data page which is compared to a next data page in the page server page index. A mismatch in the comparison by page servers indicates data page was split. Compute nodes and page servers store and maintain off-row data generated during data operations via page affinity considerations where the off-row data is stored at the same page server as the data.
-
公开(公告)号:US20230350912A1
公开(公告)日:2023-11-02
申请号:US18351258
申请日:2023-07-12
Applicant: Microsoft Technology Licensing, LLC
Inventor: Alejandro Hernandez SAENZ , Cristian DIACONU , Krystyna Ewa REISTETER , Naveen PRAKASH , Sheetal SHROTRI , Rogério RAMOS , Alexander BUDOVSKI , Hanumantha Rao KODAVALLA
CPC classification number: G06F16/256 , G06F16/278 , G06F16/2272
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node, and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US20230014697A1
公开(公告)日:2023-01-19
申请号:US17935156
申请日:2022-09-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Craig S. FREEDMAN , Adrian-Leonard RADU , Daniel G. SCHALL , Hanumantha R. KODAVALLA , Panagiotis ANTONOPOULOS , Raghavendra Thallam KODANDARAMAIH , Alejandro Hernandez SAENZ , Naveen PRAKASH
IPC: G06F16/2457 , G06F16/2453 , G06F16/28 , G06F16/248 , G06F16/2455
Abstract: Methods for operation fragmentation with metadata serialization in query processing pushdowns are performed by systems and devices. A compute node receives a query directed to database data, and generates query text fragments. Portions of metadata of the database are read from different page servers, and are serialized by the compute node. Page identities of data pages in a page server that stores the data are determined from a page index at the compute node, and the compute node provides the text fragments, the serialized metadata, and the page identities to the page server storing the data. The page server compiles the text fragments based on the serialized metadata to generate an executable query plan for the query. The page server initializes and performs execution of the executable query plan against the data as a pushdown query operation, and a result of pushdown query operation is provided to the compute node.
-
公开(公告)号:US20200097384A1
公开(公告)日:2020-03-26
申请号:US16138083
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Alexander BUDOVSKI , Alejandro Hernandez SAENZ , Naveen PRAKASH , Krystyna REISTETER
Abstract: Use of a log service computing system to apply a log to storage segments in the context of there being a primary compute system that operates upon data, and secondary compute system(s) that are at the ready to be promoted to the new primary compute system should the old primary compute system fail. Rather than have each of the primary and secondary compute systems maintain their own replicated of the entire data locally, the data is separated in storage segments that are collectively distributed amongst multiple storage segment servers.
-
公开(公告)号:US20200097172A1
公开(公告)日:2020-03-26
申请号:US16138139
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Alejandro Hernandez SAENZ , Naveen PRAKASH , Alexander BUDOVSKI
Abstract: The mounting a drive to two or more computing systems. For instance, the drive may be mounted to a first computing system so as to be writable (and potentially readable) by the first computing system. But also, the drive is also mounted to one or more other computing systems so as to be only readable by those one or more computing systems. This allows for multiple computing systems to have access to the drive without risk that the data thereon will become corrupt. In one embodiment, the only user data stored on that drive is a single file of fixed size. Thus, even when user data is written into the fixed-size file, the management data stored (that keeps track of the files) on the drive does not change.
-
公开(公告)号:US20230376479A1
公开(公告)日:2023-11-23
申请号:US18362747
申请日:2023-07-31
Applicant: Microsoft Technology Licensing, LLC
Inventor: Craig S. FREEDMAN , Adrian-Leonard G. Radu , Daniel G. SCHALL , Hanumantha R. KODAVALLA , Panagiotis ANTONOPOULOS , Raghavendra Thallam KODANDARAMAIH , Alejandro Hernandez SAENZ , Naveen PRAKASH
IPC: G06F16/23 , G06F16/21 , G06F16/2455 , G06F16/27
CPC classification number: G06F16/2379 , G06F16/211 , G06F16/2455 , G06F16/27
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable compute nodes to pushdown certain query processing compute tasks to the page servers to take advantage of otherwise idle compute resources at the page servers, and to reduce the quantity of data that moves between compute nodes and page servers. A distributed database system includes a page server and a compute node, wherein the page server is configured to maintain multiple versions of stored data objects. The compute node is configured to receive a query and generate a transaction context (TC) and modified table schemas (MTS) scoped to the query, and pushdown the query, TC and MTS to the page server that is configured to determine which data objects at the page server satisfy the query, and for each such object, which version of the object should be returned based on the TC.
-
-
-
-
-
-
-
-
-