Systems and methods for load-balancing by secondary processors in parallelized indexing

    公开(公告)号:US10572515B2

    公开(公告)日:2020-02-25

    申请号:US15728066

    申请日:2017-10-09

    申请人: Nuix Pty Ltd

    IPC分类号: G06F17/30 G06F16/31 G06F9/50

    摘要: The invention relates to electronic indexing, and more particularly, to the parallelization of indexing. Systems and methods of the invention index data archives by breaking a job into work items and sending the work items to multiple processors that can each determine whether to index data associated with the work item or to create a new work item and have a different processor index the data. This gives the system an internal load-balancing that results in indexing jobs during which no processor stands idle while another processor indexes data of unexpected complexity.

    Data processing system for parallelizing electronic document indexing

    公开(公告)号:US10185717B2

    公开(公告)日:2019-01-22

    申请号:US15606248

    申请日:2017-05-26

    申请人: Nuix Pty Ltd

    IPC分类号: G06F15/16 G06F17/30 G06F9/50

    摘要: A system and method for parallelizing document indexing in a data processing system. The data processing system includes a primary processor for receiving a list of data having embedded data associated therewith, at lest one secondary processor to process the data as provided by the primary processor, a data processor to determine a characteristic of the embedded data and process the embedded data based upon the characteristic, and a messaging module to exchange at least one status message between the primary processor and the at least one secondary processor.

    Systems and methods for scalable delocalized information governance

    公开(公告)号:US11030170B2

    公开(公告)日:2021-06-08

    申请号:US15935231

    申请日:2018-03-26

    申请人: Nuix Pty Ltd

    IPC分类号: G06F16/22 G06F16/31 G06F9/50

    摘要: The invention relates to electronic indexing, and more particularly, to the indexing, in a cloud, data held in a cloud. Systems and methods of the invention index data by accessing the data in place in the cloud and breaking a job into work items and sending the work items to multiple cloud processes that can each determine whether to index data associated with the work item or to create a new work item and have a different cloud process index the data. Each cloud process is proximal to an item that it indexes. This gives the system scale as well as an internal load-balancing.

    SYSTEMS AND METHODS FOR SCALABLE DELOCALIZED INFORMATION GOVERNANCE
    6.
    发明申请
    SYSTEMS AND METHODS FOR SCALABLE DELOCALIZED INFORMATION GOVERNANCE 有权
    可扩展信息管理的系统和方法

    公开(公告)号:US20140081984A1

    公开(公告)日:2014-03-20

    申请号:US14083742

    申请日:2013-11-19

    申请人: Nuix Pty Ltd.

    IPC分类号: G06F17/30

    摘要: The invention relates to electronic indexing, and more particularly, to the indexing, in a cloud, data held in a cloud. Systems and methods of the invention index data by accessing the data in place in the cloud and breaking a job into work items and sending the work items to multiple cloud processes that can each determine whether to index data associated with the work item or to create a new work item and have a different cloud process index the data. Each cloud process is proximal to an item that it indexes. This gives the system scale as well as an internal load-balancing.

    摘要翻译: 本发明涉及电子索引,更具体地说,涉及云中保存在云中的数据索引。 本发明的系统和方法通过访问云中的现有数据并将作业分解成工作项并将工作项发送到多个云处理来指数数据,每个云处理可以确定是否索引与工作项相关联的数据或创建一个 新的工作项目和具有不同的云处理索引的数据。 每个云过程都靠近它所索引的项目。 这给出了系统规模以及内部负载平衡。

    PARALLELIZATION OF ELECTRONIC DISCOVERY DOCUMENT INDEXING

    公开(公告)号:US20180004738A1

    公开(公告)日:2018-01-04

    申请号:US15606248

    申请日:2017-05-26

    申请人: Nuix Pty Ltd

    IPC分类号: G06F17/30 G06F9/50

    摘要: A system and method for parallelizing document indexing in a data processing system. The data processing system includes a primary processor for receiving a list of data having embedded data associated therewith, at least one secondary processor to process the data as provided by the primary processor, a data processor to determine a characteristic of the embedded data and process the embedded data based upon the characteristic, and a messaging module to exchange at least one status message between the primary processor and the at least one secondary processor.

    SYSTEMS AND METHODS FOR LOAD-BALANCING BY SECONDARY PROCESSORS IN PARALLELIZED INDEXING
    9.
    发明申请
    SYSTEMS AND METHODS FOR LOAD-BALANCING BY SECONDARY PROCESSORS IN PARALLELIZED INDEXING 有权
    二次处理器并行索引负载均衡的系统与方法

    公开(公告)号:US20130325873A1

    公开(公告)日:2013-12-05

    申请号:US13961030

    申请日:2013-08-07

    申请人: Nuix Pty Ltd

    IPC分类号: G06F17/30

    摘要: The invention relates to electronic indexing, and more particularly, to the parallelization of indexing. Systems and methods of the invention index data archives by breaking a job into work items and sending the work items to multiple processors that can each determine whether to index data associated with the work item or to create a new work item and have a different processor index the data. This gives the system an internal load-balancing that results in indexing jobs during which no processor stands idle while another processor indexes data of unexpected complexity.

    摘要翻译: 本发明涉及电子索引,更具体地涉及索引的并行化。 本发明的系统和方法通过将作业分解成工作项目并将工作项目发送到多个处理器来索引数据归档,每个处理器可以确定是否索引与工作项目相关联的数据,或者创建新的工作项目并具有不同的处理器索引 数据。 这使得系统具有内部负载平衡,从而导致索引作业,在此期间,任何处理器都处于空闲状态,而另一个处理器会对未知复杂性的数据进行索引。

    SYSTEMS AND METHODS FOR SCALABLE DELOCALIZED INFORMATION GOVERNANCE

    公开(公告)号:US20210286788A1

    公开(公告)日:2021-09-16

    申请号:US17332526

    申请日:2021-05-27

    申请人: Nuix Pty Ltd

    IPC分类号: G06F16/22 G06F16/31 G06F9/50

    摘要: The invention relates to electronic indexing, and more particularly, to the indexing, in a cloud, data held in a cloud. Systems and methods of the invention index data by accessing the data in place in the cloud and breaking a job into work items and sending the work items to multiple cloud processes that can each determine whether to index data associated with the work item or to create a new work item and have a different cloud process index the data. Each cloud process is proximal to an item that it indexes. This gives the system scale as well as an internal load-balancing.