VIRTUAL MACHINE FAILOVER MANAGEMENT FOR GEO-REDUNDANT DATA CENTERS

    公开(公告)号:US20240160538A1

    公开(公告)日:2024-05-16

    申请号:US18423112

    申请日:2024-01-25

    Applicant: Rubrik, Inc.

    Abstract: A data storage infrastructure may establish a partition that includes a first data center and a second data center that is geographically separated from the first data center. The data storage infrastructure may replicate a full snapshot and one or more incremental snapshots of a virtual machine from a first data management platform to a second data management platform, where the virtual machine is migrated from a first host of the first host group to a second host of the second host group upon a failover event occurring at the first data center. The data storage infrastructure may then capture an incremental snapshot of the virtual machine based on linking a first instance of the virtual machine that was replicated from the first data management platform and a second instance of the virtual machine that is managed by the second data management platform.

    MANAGING REFERENCE SNAPSHOTS ACROSS MULTIPLE SITES FOR EFFICIENT FAILOVER/FAILBACK

    公开(公告)号:US20240143460A1

    公开(公告)日:2024-05-02

    申请号:US17977248

    申请日:2022-10-31

    Applicant: Nutanix, Inc.

    CPC classification number: G06F11/2025 G06F11/1461 G06F11/1464 G06F2201/84

    Abstract: A technique provides network efficient data failover by explicitly protecting one or more common snapshot references at sites of a multi-site data replication environment to improve granularity of control of recovery point objectives (RPO) for data across the sites. A common snapshot reference or recovery point (RP) ensures that, in the event of failure to a site, data designated for failover may be quickly protected by replicating only small incremental changes to the RP so as to maintain RPO requirements across the sites. Illustratively, the technique enhances and extends a disaster recovery (DR) application programming interface (API) protocol through an extension that defines and applies a tag to the RP, wherein the tag enables protection and/or preservation of the RP by ensuring that the sites honor the tag applied to the RP. The tag essentially functions as an advisory lock for the RP that is shared among the sites to prevent deletion of the RP at the sites throughout the duration of the lock.

    COST-EFFECTIVE, FAILURE-AWARE RESOURCE ALLOCATION AND RESERVATION IN THE CLOUD

    公开(公告)号:US20240070038A1

    公开(公告)日:2024-02-29

    申请号:US17898824

    申请日:2022-08-30

    Applicant: NetApp, Inc.

    Abstract: Systems and methods for an improved HA resource reservation approach are provided. According to one embodiment, for a given cluster of greater than two nodes in which a number (f) of concurrent node failures are to be tolerated, more efficient utilization of resources for an HA system may be achieved by distributing HA reserved capacity across more than f nodes of the cluster rather than naively concentrating the HA reserved capacity in f nodes. As node failures are not a common occurrence, those of the nodes of the cluster having HA reserved capacity may allow for some bursting of one or more units of compute executing thereon unless or until f concurrent node failures occur, thereby promoting more efficient utilization of node resources.

    Blockchain enabled fault tolerance

    公开(公告)号:US11914488B2

    公开(公告)日:2024-02-27

    申请号:US17774160

    申请日:2019-11-06

    Inventor: Jerry Wald

    CPC classification number: G06F11/2025 G06F11/2094 G06F2201/805

    Abstract: Provided is a system, method, and computer program product for handling fault tolerance in a blockchain enabled network system. The system includes a computing system with at least one of a plurality of processors arranged as an active processor node, at least one data storage device including a first ledger corresponding to a first blockchain and a second ledger corresponding to a second blockchain, at least one standby processor node, and at least one standby data storage device. The at least one active processor node is programmed or configured to analyze and record blocks corresponding to data received through the network system on the first ledger and the second ledger such that the first ledger and the second ledger have matching data, detect at least one failure or anticipated failure, and in response to detecting the at least one failure or anticipated failure, generating a switch-over command.

    State management methods, methods for switching between master application server and backup application server, and electronic devices

    公开(公告)号:US11892922B2

    公开(公告)日:2024-02-06

    申请号:US17789780

    申请日:2020-12-30

    CPC classification number: G06F11/2038 G06F11/2025

    Abstract: The present disclosure provides a state management method, a method for switching between a master application server and a backup application server, and an electronic device. In present disclosure, the management server updates the recorded backup application server state in time by querying for the connection state of the hot-backup connection between the master application server and the backup application server, and when the master application server is in failure, instead of immediately controlling the master application server and the backup application server to perform switching between the master and backup application servers, the management server controls the master application server and the backup application server to perform master-backup switching between the application servers according to the recorded backup application server state.

Patent Agency Ranking