Memory-controller-embedded apparatus and procedure for achieving system-directed checkpointing without operating-system kernel support
    1.
    再颁专利
    Memory-controller-embedded apparatus and procedure for achieving system-directed checkpointing without operating-system kernel support 有权
    内存控制器嵌入式设备和程序,用于在无需操作系统内核支持的情况下实现系统导向的检查点

    公开(公告)号:USRE45632E1

    公开(公告)日:2015-07-28

    申请号:US13683670

    申请日:2012-11-21

    IPC分类号: G06F12/00 G06F11/14

    摘要: System-directed checkpointing is enabled in otherwise standard computers through relatively straightforward augmentations to the computer's memory controller hub. Firmware routines executed by a control and dispatch unit that is normally part of any memory controller hub enable it to implement any of six different checkpointing strategies: post-image checkpointing in which an image of the system state at the time of the last checkpoint is maintained in a local shadow memory; post-image checkpointing in which an image of the system state at the time of the last checkpoint is maintained in a shadow memory located in a second, backup computer; post-image checkpointing using a bit-map memory, having one bit representing each data block in system memory, to reduce the amount of memory-to-memory copying required to establish a checkpoint; post-image checkpointing to a local shadow memory using two bit map memories to enable normal processing to continue while the shadow is being updated, post-image checkpointing to a local shadow memory using a block-state memory that eliminates the need for any memory-to-memory copying; and local pre-image checkpointing that does not require a shadow memory. Since each of these implementations has advantages and disadvantages relative to the others and since similar mechanisms are used in the memory controller hub for all of these options, it can be designed to support all of them with hardwired or settable status bits defining which is to be supported in a given situation.

    摘要翻译: 通过对计算机的存储器控​​制器集线器的相对直接的增加,在其他标准计算机中启用系统导向的检查点。 由通常是任何存储器控制器集线器的一部分的控制和调度单元执行的固件例程使其能够实现六种不同的检查点策略中的任何一种:后维护点,其中保持最后一个检查点时的系统状态的图像被维持 在当地的影子记忆中; 图像检查点,其中在最后检查点时系统状态的图像被保持在位于第二备份计算机中的影子存储器中; 使用位图存储器的后图像检查点,具有表示系统存储器中的每个数据块的一个位,以减少建立检查点所需的存储器到存储器复制的量; 使用两个位图存储器对局部阴影存储器进行后映像检查点,以在阴影更新期间使正常处理能够继续,使用块状态存储器将映像后检查点定位到本地阴影存储器,从而消除对任何存储器 - 内存复制; 和不需要影子内存的本地预图像检查点。 由于这些实现中的每一个都具有相对于其他实现的优点和缺点,并且由于用于所有这些选项的存储器控​​制器集线器中使用类似的机制,所以可以将其设计为用硬连线或可设置的状态位来支持所有这些, 在特定情况下得到支持。

    System-directed checkpointing implementation using a hypervisor layer
    2.
    发明授权
    System-directed checkpointing implementation using a hypervisor layer 有权
    使用管理程序层进行系统导向的检查点实现

    公开(公告)号:US08966315B2

    公开(公告)日:2015-02-24

    申请号:US13767415

    申请日:2013-02-14

    摘要: While system-directed checkpointing can be implemented in various ways, for example by adding checkpointing support in the memory controller or in the operating system in otherwise standard computers, implementation at the hypervisor level enables the necessary state information to be captured efficiently while providing a number of ancillary advantages over those prior-art methods. This disclosure details procedures for realizing those advantages through relatively minor modifications to normal hypervisor operations. Specifically, by capturing state information in a guest-operating-system-specific manner, any guest operating system can be rolled back independently and resumed without losing either program or input/output (I/O) continuity and without affecting the operation of the other operating systems or their associated applications supported by the same hypervisor. Similarly, by managing I/O queues as described herein, rollback can be accomplished without requiring I/O operations to be repeated and I/O device failures can be circumvented without losing any I/O data in the process.

    摘要翻译: 虽然可以以各种方式实现系统导向的检查点,例如通过在存储器控制器中或在其他标准计算机中的操作系统中添加检查点支持,但在管理程序级别的实现使得能够有效地捕获必要的状态信息,同时提供数字 优于现有技术的辅助优点。 本公开详细描述了通过对正常管理程序操作的相对较小的修改来实现这些优点的过程。 具体来说,通过以特定于客户机操作的方式捕获状态信息,任何客户操作系统都可以独立地回滚并恢复,而不会丢失任何程序或输入/输出(I / O)连续性,而不影响其他操作系统的操作 操作系统或其相关应用程序由相同的管理程序支持。 类似地,通过如本文所述管理I / O队列,可以在不需要重复I / O操作的情况下实现回滚,并且可以避免I / O设备故障而不会丢失该过程中的任何I / O数据。

    SYSTEM-DIRECTED CHECKPOINTING IMPLEMENTATION USING A HYPERVISOR LAYER
    3.
    发明申请
    SYSTEM-DIRECTED CHECKPOINTING IMPLEMENTATION USING A HYPERVISOR LAYER 有权
    使用高级层的系统指导检查实现

    公开(公告)号:US20130166951A1

    公开(公告)日:2013-06-27

    申请号:US13767415

    申请日:2013-02-14

    IPC分类号: G06F11/14

    摘要: While system-directed checkpointing can be implemented in various ways, for example by adding checkpointing support in the memory controller or in the operating system in otherwise standard computers, implementation at the hypervisor level enables the necessary state information to be captured efficiently while providing a number of ancillary advantages over those prior-art methods. This disclosure details procedures for realizing those advantages through relatively minor modifications to normal hypervisor operations. Specifically, by capturing state information in a guest-operating-system-specific manner, any guest operating system can be rolled back independently and resumed without losing either program or input/output (I/O) continuity and without affecting the operation of the other operating systems or their associated applications supported by the same hypervisor. Similarly, by managing I/O queues as described herein, rollback can be accomplished without requiring I/O operations to be repeated and I/O device failures can be circumvented without losing any I/O data in the process.

    摘要翻译: 虽然可以以各种方式实现系统导向的检查点,例如通过在存储器控制器中或在其他标准计算机中的操作系统中添加检查点支持,但在管理程序级别的实现使得能够有效地捕获必要的状态信息,同时提供数字 优于现有技术的辅助优点。 本公开详细描述了通过对正常管理程序操作的相对较小的修改来实现这些优点的过程。 具体来说,通过以特定于客户机操作的方式捕获状态信息,任何客户操作系统都可以独立地回滚并恢复,而不会丢失任何程序或输入/输出(I / O)连续性,而不影响其他操作系统的操作 操作系统或其相关应用程序由相同的管理程序支持。 类似地,通过如本文所述管理I / O队列,可以在不需要重复I / O操作的情况下实现回滚,并且可以避免I / O设备故障而不会丢失该过程中的任何I / O数据。