Fault tolerant duplex computer system and its control method
    61.
    发明授权
    Fault tolerant duplex computer system and its control method 失效
    容错双工计算机系统及其控制方法

    公开(公告)号:US07751310B2

    公开(公告)日:2010-07-06

    申请号:US11312116

    申请日:2005-12-20

    申请人: Masahiro Yoshida

    发明人: Masahiro Yoshida

    摘要: There is disclosed a fault tolerant duplex computer system capable of increasing accuracy of processing to be continued by collecting trouble information without stopping duplex running. CPU's (112, 122), memories (113, 123), and IO processors (114, 124) of systems (110, 120) announce a reparable trouble to fault diagnosis processors (116, 126) when the generated trouble can be repaired, and an irreparable trouble when the generated trouble cannot be repaired. When an out-of-sync situation is confirmed, the out-of-sync situation is announced. A fault monitoring section (130) updates reparable trouble information (131) of a relevant system when the reparable trouble is received, and irreparable trouble information (132) of a relevant system when the irreparable trouble is received. Upon reception of the out-of-sync situation, a synchronous processing instruction is made by setting the system of a smaller amount of trouble information as an active system and the system of a larger number of trouble information as a standby system.

    摘要翻译: 公开了一种容错双工计算机系统,其能够通过在不停止双工运行的情况下收集故障信息来提高要继续的处理精度。 当所产生的故障可以被修复时,系统(110,120)的CPU(112,122),存储器(113,123)和IO处理器(114,124)向故障诊断处理器(116,126)通知可修复的故障, 当无法修复产生的故障时,无法弥补的麻烦。 当同步异常情况得到确认时,就会宣布不同步的情况。 当接收到可修复的故障时,故障监视部分(130)更新相关系统的可修复故障信息(131),以及当接收到不可修复的故障时相关系统的不可修复的故障信息(132)。 在接收到不同步的情况下,通过将较小数量的故障信息的系统设置为活动系统,并将更多数量的故障信息的系统设置为备用系统来进行同步处理指令。

    Self-Monitoring Cluster of Network Security Devices
    62.
    发明申请
    Self-Monitoring Cluster of Network Security Devices 审中-公开
    网络安全设备的自我监控集群

    公开(公告)号:US20100162036A1

    公开(公告)日:2010-06-24

    申请号:US12643548

    申请日:2009-12-21

    IPC分类号: G06F11/07 G06F15/16

    摘要: A computing device may be joined to a cluster by discovering the device, determining whether the device is eligible to join the cluster, configuring the device, and assigning the device a cluster role. A device may be assigned to act as a cluster master, backup master, active device, standby device, or another role. The cluster master may be configured to assign tasks, such as network flow processing to the cluster devices. The cluster master and backup master may maintain global, run-time synchronization data pertaining to each of the network flows, shared resources, cluster configuration, and the like. The devices within the cluster may monitor one another. Monitoring may include transmitting status messages comprising indicators of device health to the other devices in the cluster. In the event a device satisfies failover conditions, a failover operation to replace the device with another standby device, may be performed.

    摘要翻译: 计算设备可以通过发现设备,确定设备是否有资格加入集群,配置设备以及为设备分配集群角色而与集群相连。 可以将设备分配为集群主控,备份主控,主动设备,备用设备或其他角色。 集群主机可以被配置为向集群设备分配诸如网络流处理的任务。 集群主备份主机可以维护与每个网络流,共享资源,集群配置等相关的全局运行时同步数据。 群集中的设备可能会相互监视。 监视可以包括将包括设备运行状况的指示符的状态消息发送到集群中的其他设备。 在设备满足故障切换条件的情况下,可以执行将设备替换为另一备用设备的故障转移操作。

    Fast Paxos recovery
    63.
    发明授权
    Fast Paxos recovery 有权
    快速Paxos恢复

    公开(公告)号:US07555516B2

    公开(公告)日:2009-06-30

    申请号:US10996350

    申请日:2004-11-23

    申请人: Leslie B. Lamport

    发明人: Leslie B. Lamport

    IPC分类号: G06F15/16 G06F15/173

    摘要: A distributed computing system can achieve consensus while introducing fewer message delays by using an algorithm that allows the constituent devices to vote on functions received directly from one or more clients. If a conflict occurs, a leader device from among the devices can be selected such that the leader device already knows of the other devices' previous votes, and can determine an appropriate function to propose, using an immediately subsequent proposal number, without performing the first phase of the Paxos algorithm. Alternatively, each device can independently determine, by using the same repeatable mechanism used by a leader device, what function the leader device would propose, and can then vote for that function using the immediately subsequent proposal number. If the devices' votes again result in a conflict, the Paxos algorithm can be used, or additional iterations can be performed prior to resorting to the Paxos algorithm.

    摘要翻译: 分布式计算系统可以通过使用允许组成设备对从一个或多个客户端直接接收的功能进行投票的算法来实现共同点,同时引入更少的消息延迟。 如果发生冲突,则可以选择来自设备中的引导设备,使得领导者设备已经知道其他设备的先前投票,并且可以使用即时后续的提案号码来确定适当的功能来提出,而不执行第一 Paxos算法的相位。 或者,每个设备可以通过使用领导者设备使用的相同的可重复机制来独立地确定领导者设备将提出什么功能,并且然后可以使用紧随其后的提案号码对该功能进行投票。 如果设备的投票再次导致冲突,则可以使用Paxos算法,或者在使用Paxos算法之前可以执行其他迭代。

    Cheap paxos
    66.
    发明申请
    Cheap paxos 失效
    便宜的paxos

    公开(公告)号:US20050283659A1

    公开(公告)日:2005-12-22

    申请号:US10872338

    申请日:2004-06-18

    IPC分类号: G06F11/00

    CPC分类号: G06F11/187 G06F11/182

    摘要: A distributed computing system can be operated in a fault tolerant manner using a collection of auxiliary computing devices and more main computing devices than the number of faults the system can tolerate. A quorum of all of the main computing devices can be used. In the event of a failure, an alternative quorum from a selected set of quorums, comprising at least one main computing device and some or all of the auxiliary computing devices, can be used to complete pending operations and to select a new set of quorums. Alternatively, another state machine, comprising at least one main computing device and some or all of the auxiliary computing devices, can select a new quorum comprising the currently operating main computing devices, and the new quorum can then complete pending operations and can continue to select proposals using the proposal number assigned by the other state machine.

    摘要翻译: 分布式计算系统可以使用辅助计算设备的集合和比系统可容忍的故障数量更多的主计算设备以容错方式操作。 可以使用所有主要计算设备的法定人数。 在发生故障的情况下,可以使用包括至少一个主计算设备和一些或所有辅助计算设备的所选仲裁集合的备选法定人数来完成待决操作并选择一组新的仲裁。 或者,包括至少一个主计算设备和一些或全部辅助计算设备的另一状态机可以选择包括当前操作的主计算设备的新的定额,并且新的仲裁可以完成等待操作并且可以继续选择 使用其他状态机分配的提案编号的提案。

    Redundant processing architecture for single fault tolerance
    67.
    发明申请
    Redundant processing architecture for single fault tolerance 有权
    用于单容错的冗余处理架构

    公开(公告)号:US20050278567A1

    公开(公告)日:2005-12-15

    申请号:US10867894

    申请日:2004-06-15

    IPC分类号: G06F11/00 G06F11/18

    摘要: An electronic module is provided. The module includes a first logic device having at least two processors and a first comparator and a second logic device having at least one processor and a second comparator. Each of the at least two processors are coupled to each of the first and second comparators. The first and second comparators operate as a distributed comparator system. Each comparator independently identifies faults in the processors.

    摘要翻译: 提供电子模块。 该模块包括具有至少两个处理器的第一逻辑器件和具有至少一个处理器和第二比较器的第一比较器和第二逻辑器件。 所述至少两个处理器中的每一个耦合到第一和第二比较器中的每一个。 第一和第二比较器作为分布式比较器系统运行。 每个比较器独立地识别处理器中的故障。

    Method and apparatus for processing control using a multiple redundant processor control system
    69.
    发明申请
    Method and apparatus for processing control using a multiple redundant processor control system 有权
    使用多重冗余处理器控制系统处理控制的方法和装置

    公开(公告)号:US20050022054A1

    公开(公告)日:2005-01-27

    申请号:US10848542

    申请日:2004-05-17

    IPC分类号: G05B9/03 G05B19/042 G06F11/00

    摘要: A system and method for synchronizing a plurality of main processors. At a first time and in response to a first time reference, a first rendezvous signal is sent from a first to a second of the plurality of main processors. At a second time, and in response to a second time reference, a second rendezvous signal is sent from the second of the plurality of main processors, to the first of said plurality of main processors. After the first rendezvous signal is received by the second of the plurality of main processors and the second rendezvous signal is received by the first of said plurality of main processors, substantially simultaneous scanning of control information is initiated by the first and second of the plurality of main processors. In variations, a difference between the first and second times signals a fault condition.

    摘要翻译: 一种用于同步多个主处理器的系统和方法。 在第一时间并且响应于第一次参考,第一会合信号从多个主处理器中的第一到第二信号发送。 在第二时间,并且响应于第二时间参考,第二会合信号从多个主处理器中的第二个发送到所述多个主处理器中的第一个。 在第一会话信号被多个主处理器中的第二个处理器接收到第一会合信号并且第二会合信号由所述多个主处理器中的第一个处理器接收后,控制信息的基本上同时的扫描由多个主处理器中的第一和第二处理器 主要处理器。 在变化中,第一和第二次之间的差异表示故障状况。

    Method and apparatus for improved security in distributed-environment voting
    70.
    发明授权
    Method and apparatus for improved security in distributed-environment voting 有权
    改善分布式环境投票安全性的方法和装置

    公开(公告)号:US06704887B2

    公开(公告)日:2004-03-09

    申请号:US09804751

    申请日:2001-03-08

    IPC分类号: G06F1100

    CPC分类号: G06F11/182

    摘要: A method and apparatus is disclosed which provides improved security in distributed-environment voting. At least three voting processors running a voting algorithm are connected to a local area network (LAN) and exchange their individually determined results of a process application. Each result is committed to an interface module where it is checked, authenticated and buffered. The allotted time for receiving and buffering committed results is constrained by a first timed interval within the interface module. The first timed interval may be reset several times. The allotted time for checking and comparing the committed results from each processor is constrained by a second timed interval within each voting processor. A majority vote of those authenticated committed results is formed once all necessary iterations of the both the first and second timed intervals are completed. Enhanced security is thereby afforded to the overall voting process and yields a majority vote that is correct despite the introduction of errors associated with faulty or hostile processors.

    摘要翻译: 公开了一种在分布式环境投票中提供改进的安全性的方法和装置。 运行投票算法的至少三个投票处理器连接到局域网(LAN),并交换其单独确定的流程应用的结果。 每个结果都被提交到一个接口模块,在该模块中进行检查,验证和缓冲。 接收和缓存提交结果的分配时间受到接口模块内第一个定时间隔的限制。 第一个定时间隔可以重置多次。 用于检查和比较来自每个处理器的提交结果的分配时间受到每个投票处理器中的第二计时间隔的限制。 一旦完成了第一和第二定时间隔的所有必要的迭代,形成那些经认证的承诺结果的多数票。 因此,增强的安全性可以提供整体投票过程,尽管引入与故障或敌对处理器相关的错误,但是多数投票是正确的。