Identifying similar files in an environment having multiple client computers
    1.
    发明授权
    Identifying similar files in an environment having multiple client computers 有权
    在具有多个客户端计算机的环境中识别类似的文件

    公开(公告)号:US08489612B2

    公开(公告)日:2013-07-16

    申请号:US12409978

    申请日:2009-03-24

    IPC分类号: G06F17/30

    CPC分类号: G06N5/02 G06F17/3015

    摘要: To identify similar files in an environment having multiple client computers, a first client computer receives, from a coordinator computer, a request to find files located at the first client computer that are similar to at least one comparison file, wherein the request has also been sent to other client computers by the coordinator computer to request that the other client computers also find files that are similar to the at least one comparison file. In response to the request, the first client computer compares signatures of the files located at the first client computer with a signature of the at least one comparison file to identify at least a subset of the files located at the first client computer that are similar to the at least one comparison file according to a comparison metric. The first client computer sends, to the coordinator computer, a response relating to the comparing.

    摘要翻译: 为了在具有多个客户端计算机的环境中识别类似的文件,第一客户端计算机从协调器计算机接收查找位于第一客户端计算机上的文件的请求,其类似于至少一个比较文件,其中该请求也已被 由协调器计算机发送到其他客户端计算机,以请求其他客户端计算机还查找与至少一个比较文件类似的文件。 响应于该请求,第一客户端计算机将位于第一客户端计算机的文件的签名与至少一个比较文件的签名进行比较,以识别位于第一客户端计算机的文件的至少一个子集,其类似于 所述至少一个比较文件根据比较度量。 第一个客户端计算机向协调者计算机发送与比较有关的响应。

    Scheduling Data Analysis Operations In A Computer System
    3.
    发明申请
    Scheduling Data Analysis Operations In A Computer System 有权
    计算机系统中的计划数据分析操作

    公开(公告)号:US20100251256A1

    公开(公告)日:2010-09-30

    申请号:US12413969

    申请日:2009-03-30

    IPC分类号: G06F9/46

    摘要: A technique receiving identifiers from a plurality of nodes. Each identifier identifies an associated data object, and at least some of the data objects being replicated on different nodes. The technique includes scheduling analysis of the data objects on the nodes based at least in part on a distribution of replicas of the data objects among the nodes and modeled performances of the nodes.

    摘要翻译: 一种从多个节点接收标识符的技术。 每个标识符标识相关联的数据对象,并且至少一些数据对象被复制在不同的节点上。 该技术包括至少部分地基于节点之间的数据对象的副本的分布和节点的建模性能来对节点上的数据对象进行调度分析。

    IDENTIFYING SIMILAR FILES IN AN ENVIRONMENT HAVING MULTIPLE CLIENT COMPUTERS
    4.
    发明申请
    IDENTIFYING SIMILAR FILES IN AN ENVIRONMENT HAVING MULTIPLE CLIENT COMPUTERS 有权
    在具有多个客户端计算机的环境中识别类似文件

    公开(公告)号:US20100250480A1

    公开(公告)日:2010-09-30

    申请号:US12409978

    申请日:2009-03-24

    IPC分类号: G06N5/02 G06F17/30 G06Q10/00

    CPC分类号: G06N5/02 G06F17/3015

    摘要: To identify similar files in an environment having multiple client computers, a first client computer receives, from a coordinator computer, a request to find files located at the first client computer that are similar to at least one comparison file, wherein the request has also been sent to other client computers by the coordinator computer to request that the other client computers also find files that are similar to the at least one comparison file. In response to the request, the first client computer compares signatures of the files located at the first client computer with a signature of the at least one comparison file to identify at least a subset of the files located at the first client computer that are similar to the at least one comparison file according to a comparison metric. The first client computer sends, to the coordinator computer, a response relating to the comparing.

    摘要翻译: 为了在具有多个客户端计算机的环境中识别类似的文件,第一客户端计算机从协调器计算机接收查找位于第一客户端计算机上的文件的请求,其类似于至少一个比较文件,其中该请求也已被 由协调器计算机发送到其他客户端计算机,以请求其他客户端计算机还查找与至少一个比较文件类似的文件。 响应于该请求,第一客户端计算机将位于第一客户端计算机的文件的签名与至少一个比较文件的签名进行比较,以识别位于第一客户端计算机的文件的至少一个子集,其类似于 所述至少一个比较文件根据比较度量。 第一个客户端计算机向协调者计算机发送与比较相关的响应。

    Scheduling data analysis operations in a computer system
    6.
    发明授权
    Scheduling data analysis operations in a computer system 有权
    在计算机系统中调度数据分析操作

    公开(公告)号:US08650571B2

    公开(公告)日:2014-02-11

    申请号:US12413969

    申请日:2009-03-30

    IPC分类号: G06F9/46

    摘要: A technique receiving identifiers from a plurality of nodes. Each identifier identifies an associated data object, and at least some of the data objects being replicated on different nodes. The technique includes scheduling analysis of the data objects on the nodes based at least in part on a distribution of replicas of the data objects among the nodes and modeled performances of the nodes.

    摘要翻译: 一种从多个节点接收标识符的技术。 每个标识符标识相关联的数据对象,并且至少一些数据对象被复制在不同的节点上。 该技术包括至少部分地基于节点之间的数据对象的副本的分布和节点的建模性能来对节点上的数据对象进行调度分析。

    ASSIGNING RESOURCES FOR TASKS
    9.
    发明申请
    ASSIGNING RESOURCES FOR TASKS 有权
    资助任务资助

    公开(公告)号:US20120291041A1

    公开(公告)日:2012-11-15

    申请号:US13105294

    申请日:2011-05-11

    IPC分类号: G06F9/50

    CPC分类号: G06F9/5011

    摘要: A processing subsystem has plural processing stages, where output of one of the plural processing stages is provided to another of the processing stages. Resources are dynamically assigned to the plural processing stages.

    摘要翻译: 处理子系统具有多个处理级,其中多个处理级之一的输出被提供给另一个处理级。 资源被动态分配给多个处理阶段。