Method and system for cooperatively backing up data on computers in a network

    公开(公告)号:US07529834B1

    公开(公告)日:2009-05-05

    申请号:US09668643

    申请日:2000-09-22

    CPC分类号: G06F11/1464 Y10S707/99953

    摘要: A method for backing up data in a plurality of computers connected via a network. The method includes forming partnerships between the plurality of computers such that each computer in a partnership commits under agreements to help backup the data of its partners. The method further includes periodically verifying that previously backed up data is being retained by the computers committed to act as backup partners in accordance with the agreements. In another embodiment, the method provides a distributed cooperative backing up of data in a system that includes a loose confederation of computers connected via a network. In this embodiment the method includes selecting computers as potential backup partners from among the loose confederation of computers connected via the network based on predetermined criteria, and negotiating a reciprocal backup partnership agreement between the computers based on predetermined requirements, including backup requirements. Once the negotiations are complete and the agreements are made, the method proceeds to form partnerships between the computers. The computers become backup partners by agreeing to cooperatively provide backup services to each other so that a distributed cooperative backing up of data can be administered in the absence of central control. The method further includes periodically backing up data at the backup partners, where the data being backed up is encoded. The method also includes periodically verifying that previously backed up data is retained by the backup partners. Another aspect of the invention is a distributed cooperative backup system that includes a network and a loose confederation of computers connected via the network. A plurality of computers from among the loose confederation of computers is configured for distributed cooperative backing up of data and for functioning as backup partners. Each computer of the plurality of computers has a storage that can be used for providing reciprocal backup services. Each computer of the plurality of computers respectively also has a computer readable medium embodying computer program code configured to cause the computer to perform functions comparable to the method steps as described above.

    Copying a differential data store into temporary storage media in response to a request
    12.
    发明授权
    Copying a differential data store into temporary storage media in response to a request 有权
    响应请求将差分数据存储复制到临时存储介质中

    公开(公告)号:US09141621B2

    公开(公告)日:2015-09-22

    申请号:US12432807

    申请日:2009-04-30

    IPC分类号: G06F7/00 G06F17/30 G06F3/06

    摘要: A plurality of differential data stores are stored in persistent storage media. In response to receiving a first request to store a particular data object, one of the differential data stores that are stored in the persistent storage media is selected, wherein selecting the one differential data store is according to a criterion relating to compression of data objects in the differential data stores. The selected differential data store is copied into temporary storage media, where the copying is not delayed after receiving the first request to await receipt of more requests. The particular data object is inserted into the copy of the selected differential data store in the temporary storage media, where the inserting is performed without having to retrieve more data from the selected differential store in the persistent storage media. The selected differential data store in the persistent storage media is replaced with the copy of the selected differential data store in the temporary storage media that has been modified.

    摘要翻译: 多个差分数据存储器存储在持久存储介质中。 响应于接收到存储特定数据对象的第一请求,选择存储在永久存储介质中的差分数据存储之一,其中选择一个差分数据存储是根据与数据对象的压缩有关的标准 差分数据存储。 所选择的差分数据存储被复制到临时存储介质中,其中在接收到等待接收更多请求的第一请求之后复制不被延迟。 将特定数据对象插入临时存储介质中所选择的差分数据存储的副本,其中执行插入,而不必从永久存储介质中的所选择的差分存储中检索更多的数据。 永久存储介质中所选择的差分数据存储被所修改的临时存储介质中所选差分数据存储的副本所替代。

    DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA
    14.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA 有权
    数据处理装置和数据处理方法

    公开(公告)号:US20090112945A1

    公开(公告)日:2009-04-30

    申请号:US12257659

    申请日:2008-10-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30162 G06F11/1451

    摘要: Data processing apparatus comprising: a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of a data set and each of which comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only some specimen data chunks, the processor being operable to: process input data into input data chunks; identify manifests having at least one reference to one of said specimen data chunks that corresponds to one of said input data chunks and on which there is information contained in the sparse chunk index; and prioritize the identified manifests for subsequent operation.

    摘要翻译: 数据处理装置,包括:包含标本数据块的块存储器,包含多个清单的清单存储器,每个清单代表数据集的至少一部分,每个清单包括至少一个对所述样本 数据块,仅包含一些标本数据块的信息的稀疏组块索引,所述处理器可操作以:将输入数据处理成输入数据块; 识别具有至少一个对所述样本数据块中的一个的对应于所述输入数据块中的一个的清单,并且其中包含在所述稀疏块索引中的信息; 并将识别的清单优先于后续操作。

    Policy applicability determination
    16.
    发明授权
    Policy applicability determination 失效
    政策适用性确定

    公开(公告)号:US08627403B1

    公开(公告)日:2014-01-07

    申请号:US11831342

    申请日:2007-07-31

    IPC分类号: H04L29/06

    摘要: Provided are, among other things, systems, methods and techniques for determining applicability of a policy defined by reference to a source document. A first sketch that was generated based on content of the source document is obtained, and a matching criterion is defined based on the first sketch. Also obtained is a second sketch that was generated based on content of a potential target document. A determination is made as to whether the policy applies to the potential target document based on whether the second sketch satisfies the matching criterion and, if the policy applies, a notification regarding applicability of the policy automatically is provided and/or an action automatically is blocked so as to prevent a violation of the policy.

    摘要翻译: 除其他之外,提供了用于确定通过引用源文档定义的策略的适用性的系统,方法和技术。 获得基于源文档的内容生成的第一个草图,并且基于第一个草图来定义匹配标准。 还获得了基于潜在目标文档的内容生成的第二个草图。 根据第二草图是否满足匹配标准,确定该策略是否适用于潜在目标文档,如果策略适用,则自动提供关于策略适用性的通知和/或自动阻止的动作 以防止违反政策。

    Identifying similar files in an environment having multiple client computers
    18.
    发明授权
    Identifying similar files in an environment having multiple client computers 有权
    在具有多个客户端计算机的环境中识别类似的文件

    公开(公告)号:US08489612B2

    公开(公告)日:2013-07-16

    申请号:US12409978

    申请日:2009-03-24

    IPC分类号: G06F17/30

    CPC分类号: G06N5/02 G06F17/3015

    摘要: To identify similar files in an environment having multiple client computers, a first client computer receives, from a coordinator computer, a request to find files located at the first client computer that are similar to at least one comparison file, wherein the request has also been sent to other client computers by the coordinator computer to request that the other client computers also find files that are similar to the at least one comparison file. In response to the request, the first client computer compares signatures of the files located at the first client computer with a signature of the at least one comparison file to identify at least a subset of the files located at the first client computer that are similar to the at least one comparison file according to a comparison metric. The first client computer sends, to the coordinator computer, a response relating to the comparing.

    摘要翻译: 为了在具有多个客户端计算机的环境中识别类似的文件,第一客户端计算机从协调器计算机接收查找位于第一客户端计算机上的文件的请求,其类似于至少一个比较文件,其中该请求也已被 由协调器计算机发送到其他客户端计算机,以请求其他客户端计算机还查找与至少一个比较文件类似的文件。 响应于该请求,第一客户端计算机将位于第一客户端计算机的文件的签名与至少一个比较文件的签名进行比较,以识别位于第一客户端计算机的文件的至少一个子集,其类似于 所述至少一个比较文件根据比较度量。 第一个客户端计算机向协调者计算机发送与比较有关的响应。

    PROCESSING A REQUEST TO RESTORE DEDUPLICATED DATA
    19.
    发明申请
    PROCESSING A REQUEST TO RESTORE DEDUPLICATED DATA 有权
    处理恢复重复数据的请求

    公开(公告)号:US20120317359A1

    公开(公告)日:2012-12-13

    申请号:US13155566

    申请日:2011-06-08

    IPC分类号: G06F12/00 G06F12/08

    摘要: For a restore request, at least a portion of a recipe that refers to chunks is read. Based on the recipe portion, a container having plural chunks is retrieved. From the recipe portion, it is identified which of the plural chunks of the container to save, where some of the chunks identified do not, at a time of the identifying, have to be presently communicated to a requester. The identified chunks are stored in a memory area from which chunks are read for the restore operation.

    摘要翻译: 对于恢复请求,读取指向块的配方的至少一部分。 基于配方部分,检索具有多个块的容器。 从配方部分中,识别容器的多个块中的哪一个要保存,其中在识别时识别的一些块不在当前被传送到请求者。 所识别的块被存储在从其读取块以用于恢复操作的存储器区域中。

    Data processing apparatus and method of processing data
    20.
    发明授权
    Data processing apparatus and method of processing data 有权
    数据处理装置及数据处理方法

    公开(公告)号:US08332404B2

    公开(公告)日:2012-12-11

    申请号:US12257659

    申请日:2008-10-24

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30162 G06F11/1451

    摘要: Data processing apparatus comprising: a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of a data set and each of which comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only some specimen data chunks, the processor being operable to: process input data into input data chunks; identify manifests having at least one reference to one of said specimen data chunks that corresponds to one of said input data chunks and on which there is information contained in the sparse chunk index; and prioritize the identified manifests for subsequent operation.

    摘要翻译: 数据处理装置,包括:包含标本数据块的块存储器,包含多个清单的清单存储器,每个清单代表数据集的至少一部分,每个清单包括至少一个对所述样本 数据块,仅包含一些标本数据块的信息的稀疏组块索引,所述处理器可操作以:将输入数据处理成输入数据块; 识别具有至少一个对所述样本数据块中的一个的对应于所述输入数据块中的一个的清单,并且其中包含在所述稀疏块索引中的信息; 并将识别的清单优先于后续操作。