Method and sequencer for detecting a malfunction occurring in a high performance computer

    公开(公告)号:US10152365B2

    公开(公告)日:2018-12-11

    申请号:US14944385

    申请日:2015-11-18

    Applicant: BULL SAS

    Abstract: A method for monitoring the operation of an IT infrastructure including a plurality of calculation nodes, includes selecting calculation nodes for performing a calculation, performing the calculation via the selected calculation nodes, attributing, via the sequencer, a score to each one of the calculation nodes having participated in the calculation performed, with each score reflecting a difference between a measured operating parameter of the calculation node for which the score is attributed and a reference operating parameter of the calculation node for which the score is attributed, verifying the operation of the calculation nodes having participated in the calculation performed, the verification being carried out using scores attributed to the calculation nodes having participated in the calculation.

Patent Agency Ranking