System and method for inspection of system state during testing

    公开(公告)号:US09747181B2

    公开(公告)日:2017-08-29

    申请号:US14596910

    申请日:2015-01-14

    申请人: Red Hat, Inc.

    IPC分类号: G06F11/00 G06F11/22 G06F11/30

    摘要: A system and method for inspecting system state during testing includes determining one or more inspection modules for examining respective portions of a state of the system using a test inspector, initializing each of the inspection modules, saving the respective portions of the state of the system using the inspection modules, executing a test of the system, checking the respective portions of the state of the system using the inspection modules, and repeating the saving, executing, and checking for each additional test of the system. The test inspector is executed by one or more processors of the system. In some examples, saving a first one of the respective portions of the state of the system includes determining state variables and corresponding values associated with the first respective portion of the state of the system and saving the state variables and corresponding values in a state repository.

    INFORMATION PROCESSING DEVICE AND METHOD OF STORING FAILURE INFORMATION

    公开(公告)号:US20170235655A1

    公开(公告)日:2017-08-17

    申请号:US15398089

    申请日:2017-01-04

    申请人: FUJITSU LIMITED

    IPC分类号: G06F11/22

    CPC分类号: G06F11/2268 G06F11/2284

    摘要: An information processing device includes a processor configured to perform a diagnosis of hardware of the information processing device. The processor is configured to generate plural pieces of failure information. The plural pieces of failure information are classified into groups corresponding to different importance levels. The processor is configured to store the plural pieces of failure information in consecutive storage areas. The consecutive storage areas are divided into storage sections corresponding to the respective groups in order of importance level. The processor is configured to store first piece of failure information in a head of a second storage section in absence of free areas in first storage section. The first storage section is secured for a first group including the first piece of failure information. The second storage section is secured for a second group corresponding to a second importance level lower than the first importance level by one level.

    Systems and methods for smart diagnoses and triage of failures with identity continuity

    公开(公告)号:US09665452B2

    公开(公告)日:2017-05-30

    申请号:US14742253

    申请日:2015-06-17

    IPC分类号: G06F11/00 G06F11/22

    摘要: Systems and methods for smart diagnoses and triage of failures with identity continuity. In some embodiments, an Information Handling System (IHS) includes a processor and a memory coupled to the processor, the memory having program instructions stored thereon that, upon execution by the processor, cause the IHS to: execute a Power-On Self Test (POST) routine; in response to a determination that the POST routine has failed, execute a firmware-based diagnostics routine; in response to a determination that the firmware-based diagnostics routine has failed, execute, via a service Operating System (OS), a service OS-based diagnostics routine configured to identify whether the firmware-based diagnostics failure is due to a hardware or software fault; and in response to the service OS-based diagnostics routine identifying a hardware fault or failing to remediate a software fault, obtain a user's account information and report the hardware fault or the software remediation failure.

    IDENTIFYING ROOT CAUSES OF FAILURES IN A DEPLOYED DISTRIBUTED APPLICATION USING HISTORICAL FINE GRAINED MACHINE STATE DATA
    10.
    发明申请
    IDENTIFYING ROOT CAUSES OF FAILURES IN A DEPLOYED DISTRIBUTED APPLICATION USING HISTORICAL FINE GRAINED MACHINE STATE DATA 有权
    使用历史精细机械状态数据识别分布式应用中故障的根本原因

    公开(公告)号:US20170075744A1

    公开(公告)日:2017-03-16

    申请号:US14852006

    申请日:2015-09-11

    摘要: Methods and arrangements for identifying root causes of system failures in a distributed system said method including: utilizing at least one processor to execute computer code that performs the steps of: recording, in a storage device, collected machine state data, wherein the collected machine state data are added to historical machine state data; creating, based on the historical machine state data, a healthy map model; detecting at least one failed machine state in the distributed system; comparing the failed machine state against the healthy map model; identifying, based on the comparison, at least one root cause of the failed machine state; and displaying, on a display device, a ranked list comprising the at least one root cause. Other variants and embodiments are broadly contemplated herein.

    摘要翻译: 用于识别分布式系统中的系统故障的根本原因的方法和装置,所述方法包括:利用至少一个处理器执行计算机代码,其执行以下步骤:在存储装置中记录收集的机器状态数据,其中所收集的机器状态 数据被添加到历史机器状态数据; 根据历史机器状态数据创建健康的地图模型; 在分布式系统中检测至少一个故障机器状态; 将失败的机器状态与健康的地图模型进行比较; 基于比较来识别故障机器状态的至少一个根本原因; 以及在显示设备上显示包括所述至少一个根本原因的排名列表。 本文中广泛考虑了其他变型和实施例。