Detecting lagging nodes in a time-synchronized distributed environment
摘要:
Systems and methods include determining log data for a time period at a plurality of senders, wherein each sender is a node in the cloud-based system, and the log data is associated with one or more cloud services; providing the log data to one or more storage clusters, via one or more distributors, for the time period; responsive to all of the plurality of senders performing the providing, moving to a next time period and repeating the determining and the providing; detecting a given sender is a faulty data source or a slow data source; and moving the given sender to a deferred processing list where the given sender does not hold up the moving to the next time period.
信息查询
0/0