-
公开(公告)号:US10783410B1
公开(公告)日:2020-09-22
申请号:US16779563
申请日:2020-01-31
Applicant: CORE SCIENTIFIC, INC.
Inventor: Eric Hullander
Abstract: A system and method for managing large numbers of computing devices in a data center are disclosed. The computing devices are configured to flash their indicator lights in a pattern that encodes a device ID, and an image capture device such as a mobile phone or tablet captures the flashes in a series of images/video of the data center. The images/video are processed to create a three-dimensional (3D) model of the data center with computing device IDs positioned therein. The 3D model, including correctly positioned device ID indicators, can be rendered for the user of the mobile device to enable the user to more easily identify computing device locations.
-
公开(公告)号:US11249835B2
公开(公告)日:2022-02-15
申请号:US16879157
申请日:2020-05-20
Applicant: CORE SCIENTIFIC, INC.
Inventor: Ian Ferreira , Ganesh Balakrishnan , Evan Adams , Carla Cortez , Eric Hullander
Abstract: A management device for managing a plurality of computing devices in a data center may comprise a network interface, a first module that periodically sends health status queries to the computing devices via the network interface, a second module configured to receive responses to the health status queries and collect and store health status data for the computing devices, a third module configured to create support tickets, and/or a fourth module configured to (i) create and periodically update a Cox proportional hazards (CPH) model based on the health status data; (ii) apply a deep neural network (DNN) to the input of the CPH model; (iii) determine a probability of failure for each computing device; (iv) compare each probability of failure with a threshold; and (v) cause the third module to generate a pre-failure support ticket for each computing device having determined probabilities of failure above the threshold.
-
公开(公告)号:US10691528B1
公开(公告)日:2020-06-23
申请号:US16776213
申请日:2020-01-29
Applicant: CORE SCIENTIFIC, INC.
Inventor: Ian Ferreira , Ganesh Balakrishnan , Evan Adams , Carla Cortez , Eric Hullander
Abstract: A system and method for automating management and repair of a plurality of computing devices located in a data center is disclosed. Health status queries are issued for one or more of the computing devices. If responses not indicative of good device health are received, one or more repair instructions are automatically sent to the unhealthy computing device to repair the computing device by moving it to an acceptable state. If the repair instructions are not successful, a support ticket is automatically generated for the corresponding computing device or devices. Problematic statuses across areas of the data center may be detected and ticketed in addition to individual problematic devices. So-called repeat offender devices may be detected and ticketed even if the repair instructions are successful.
-
公开(公告)号:US20210240953A1
公开(公告)日:2021-08-05
申请号:US16992093
申请日:2020-08-12
Applicant: CORE SCIENTIFIC, INC.
Inventor: Eric Hullander
Abstract: A system and method for managing large numbers of computing devices in a data center are disclosed. The computing devices are configured to flash their indicator lights in a pattern that encodes a device ID, and an image capture device such as a mobile phone or tablet captures the flashes in a series of images/video of the data center. The images/video are processed to create a three-dimensional (3D) model of the data center with computing device IDs positioned therein. The 3D model, including correctly positioned device ID indicators, can be rendered for the user of the mobile device to enable the user to more easily identify computing device locations.
-
公开(公告)号:US20210026729A1
公开(公告)日:2021-01-28
申请号:US16879157
申请日:2020-05-20
Applicant: CORE SCIENTIFIC, INC.
Inventor: Ian Ferreira , Ganesh Balakrishnan , Evan Adams , Carla Cortez , Eric Hullander
IPC: G06F11/07
Abstract: A management device for managing a plurality of computing devices in a data center may comprise a network interface, a first module that periodically sends health status queries to the computing devices via the network interface, a second module configured to receive responses to the health status queries and collect and store health status data for the computing devices, a third module configured to create support tickets, and/or a fourth module configured to (i) create and periodically update a Cox proportional hazards (CPH) model based on the health status data; (ii) apply a deep neural network (DNN) to the input of the CPH model; (iii) determine a probability of failure for each computing device; (iv) compare each probability of failure with a threshold; and (v) cause the third module to generate a pre-failure support ticket for each computing device having determined probabilities of failure above the threshold.
-
-
-
-