Firmware corruption recovery
    1.
    发明授权

    公开(公告)号:US12099612B2

    公开(公告)日:2024-09-24

    申请号:US17794942

    申请日:2020-01-27

    IPC分类号: G06F21/57 G06F11/14

    摘要: In some examples, a computing device comprises a processing resource and a memory resource storing instructions to cause the processing resource to detect, by a basic input/output system (BIOS) of the computing device, firmware corruption in a firmware component of the computing device, generate a recovery agent based on the detected firmware corruption of the firmware component, determine a location of a back-up image of the firmware component based on the generated recovery agent, determine recovery sequence based on the determination of the location of the back-up image of the firmware component; and recover the firmware of the firmware component by executing the determined recovery sequence.

    NODE FAILURE SOURCE DETECTION IN DISTRIBUTED COMPUTING ENVIRONMENTS USING MACHINE LEARNING

    公开(公告)号:US20240289224A1

    公开(公告)日:2024-08-29

    申请号:US18660839

    申请日:2024-05-10

    申请人: RED HAT, INC.

    发明人: Or Raz

    IPC分类号: G06F11/14 G06N20/00

    摘要: Sources of node failures in distributed computing environments can be determined using machine learning according to some aspects described herein. For example, prior to rebooting a node in a distributed computing environment, a computing system can execute a software agent to detect a failure with respect to the node. In response to detecting the failure, the computing system can input characteristics for the node into a trained machine learning model. The computing system can receive a source of the failure with respect to the node. The computing system can then automatically execute a recovery operation for the node based on the source of the failure.

    REPLICATION TECHNIQUES USING A METADATA LOG
    4.
    发明公开

    公开(公告)号:US20240232020A1

    公开(公告)日:2024-07-11

    申请号:US18095159

    申请日:2023-01-10

    IPC分类号: G06F11/14 G06F16/182

    摘要: Data replication techniques can include receiving, at a source system, a write directed to a source logical device configured for asynchronous remote replication to a destination system; performing processing that flushes a transaction log entry for the write; and performing replication processing that uses a replication queue including a replication queue entry corresponding to the write that stores the first content to a logical address. The processing can create a metadata (MD) log entry in a MD log for the write responsive to determining that the write is directed to the source logical device configured for asynchronous remote replication and that the first content has not been replicated. Responsive to the first content not being in cache, the first content can be retrieved using the reference to a storage location storing the first content. The reference can be obtained from the MID log entry or the replication queue entry.

    MEMORY RECOVERY PARTITIONS
    6.
    发明公开

    公开(公告)号:US20240061748A1

    公开(公告)日:2024-02-22

    申请号:US18351978

    申请日:2023-07-13

    IPC分类号: G06F11/14 G06F21/60 G06F21/57

    摘要: Methods, systems, and devices for memory recovery partitions are described. A memory system may include a memory array configured with one or more logical partitions. In some examples, a primary boot image may be stored to a first logical partition and a recovery boot image may be stored to a second logical partition. During a boot operation, the memory system may determine whether the primary boot image includes one or more errors. If the primary boot image includes relatively few (or no) errors, the memory system may boot using the primary boot image. If the primary boot image includes a relatively high quantity of errors (e.g., higher than a threshold quantity of errors), the memory system may autonomously load a recovery boot image stored to the second logical partition.

    Monitoring and reporting a status of a memory device

    公开(公告)号:US11880291B2

    公开(公告)日:2024-01-23

    申请号:US17354690

    申请日:2021-06-22

    摘要: Methods, systems, and devices for monitoring and reporting a status of a memory device are described. A memory device may include monitoring circuitry that may be configured to monitor health and wear information for the memory device. A host device may write to a dedicated register of the memory device, to configure the memory device with health status information reporting parameters. The memory device may monitor and report the health status information of the memory device based on the received reporting configuration or based on a default configuration, and may write one or more values indicative of the health status information to a dedicated register. The host device may perform a read on the readout register to obtain the health status information, as indicated by the one or more values, and may adjust operating procedures or take other actions based on the received health status information.

    METHOD, SYSTEM AND APPARATUS FOR FAULT POSITIONING IN STARTING PROCESS OF SERVER

    公开(公告)号:US20240012706A1

    公开(公告)日:2024-01-11

    申请号:US18036573

    申请日:2021-09-28

    发明人: Hongrui HAN

    IPC分类号: G06F11/07 G06F11/14 G06F11/22

    摘要: A method, system and apparatus for fault location in a boot process of a server, the method including: dividing a whole boot process of a server into a plurality of stages in advance, and setting a respective fault monitoring policy for each of the stages (S1); when the boot process proceeds to a target stage, monitoring a current boot process of the server according to the fault monitoring policy corresponding to the target stage, and obtaining a fault monitoring result corresponding to the target stage (S2); and in response to a boot failure of the server, performing server fault location according to the obtained fault monitoring result (S3).

    BOOT DATA READING SYSTEM, BOOT DATA READING METHOD, AND PROCESSOR CIRCUIT

    公开(公告)号:US20230409428A1

    公开(公告)日:2023-12-21

    申请号:US18314128

    申请日:2023-05-08

    IPC分类号: G06F11/14 G06F9/4401

    摘要: A boot data reading system includes a storage circuit and a processor circuit. The storage circuit is configured to store first boot data and second boot data. The first boot data includes a first segment and a second segment. The second boot data includes a third segment, and the third segment corresponds to the first segment. The processor circuit is coupled to the storage circuit. The processor circuit reads the first segment and determines whether the first segment is correct or not. When the first segment is correct, the processor circuit reads the second segment and determines whether the second segment is correct or not. When the first segment is incorrect, the processor circuit reads the third segment and determines whether the third segment is correct or not.