Data set scoring
    1.
    发明授权

    公开(公告)号:US10339147B1

    公开(公告)日:2019-07-02

    申请号:US15189735

    申请日:2016-06-22

    Abstract: Technology is provided for data set scoring. In one example, a method includes analyzing first and second characteristics of a data set. The first and second characteristics represent a quality of data values in the data set. At least the first characteristic is independent of the data values in the data set. The method further includes assigning a score to the data set based on the first and second characteristics. The data set may be ranked against a plurality of other data sets based on the score. The score of the data set may be provided together with a scoring scale to enable a determination of the quality of the data values based on the score.

    Job execution with managed compute environments

    公开(公告)号:US11281498B1

    公开(公告)日:2022-03-22

    申请号:US15195893

    申请日:2016-06-28

    Abstract: Methods, systems, and computer-readable media for job execution with managed compute environments are disclosed. A specification of a managed compute environment comprises one or more constraints associated with computing resources in the managed compute environment. A queue or other data structure that is associated with the managed compute environment is monitored. The data structure is configured to store jobs. Data indicative of a job is detected in the data structure. One or more computing resources are reserved for the job from a pool of available computing resources. The one or more computing resources are selected for the job based at least in part on the one or more constraints associated with computing resources in the managed compute environment. Execution of the job using the one or more computing resources is initiated.

    Remote durable logging for journaling file systems

    公开(公告)号:US11868324B2

    公开(公告)日:2024-01-09

    申请号:US16415944

    申请日:2019-05-17

    CPC classification number: G06F16/1873

    Abstract: A journaling file system may implement remote durable logging. Updates to a file system may be received, and log records describing the updates may be stored in a locally-accessible file system change log. The update may then be acknowledged as committed. The log records may then be sent to be stored in a network-based data store in a remote version of the file system change log. Once it may be determined that the log records are stored in the remote version, storage space for the log records in the local file system change log may be reclaimed. Various types of restoration and duplication techniques may be implemented based on the remote version of the change log to restore a file system at an originating device or to duplicate the file system at a different device.

    Job scheduling based on job execution history

    公开(公告)号:US11507417B2

    公开(公告)日:2022-11-22

    申请号:US16739870

    申请日:2020-01-10

    Abstract: Methods, systems, and computer-readable media for job scheduling based on job execution history are disclosed. A request is received to schedule a workload comprising a plurality of jobs. A resource allocation score for the workload is determined. The resource allocation score represents (at least in part) an estimated likelihood of successful execution of the workload. A first portion of the workload is scheduled for execution, and a remaining portion (if any) of the workload is delayed. A quantity of jobs in the first portion of the workload is determined based (at least in part) on the resource allocation score. Execution of the first portion of the workload is initiated.

    Job scheduling based on job execution history

    公开(公告)号:US10534655B1

    公开(公告)日:2020-01-14

    申请号:US15188865

    申请日:2016-06-21

    Abstract: Methods, systems, and computer-readable media for job scheduling based on job execution history are disclosed. A request is received to schedule a workload comprising a plurality of jobs. A resource allocation score for the workload is determined. The resource allocation score represents (at least in part) an estimated likelihood of successful execution of the workload. A first portion of the workload is scheduled for execution, and a remaining portion (if any) of the workload is delayed. A quantity of jobs in the first portion of the workload is determined based (at least in part) on the resource allocation score. Execution of the first portion of the workload is initiated.

    Task-level optimization with compute environments

    公开(公告)号:US10402227B1

    公开(公告)日:2019-09-03

    申请号:US15253699

    申请日:2016-08-31

    Abstract: Methods, systems, and computer-readable media for task-level optimization of compute environments are disclosed. Execution is initiated of one or more tasks using a plurality of computing resources provisioned from a multi-tenant provider network. At least some of the computing resources vary in configuration. One or more metrics are determined that are associated with the execution of the one or more tasks. A configuration of the computing resources is selected based at least in part on the one or more metrics. A modified job definition associated with the one or more tasks is generated. The modified job definition indicates the selected configuration.

    Remote durable logging for journaling file systems

    公开(公告)号:US10303663B1

    公开(公告)日:2019-05-28

    申请号:US14303549

    申请日:2014-06-12

    Abstract: A journaling file system may implement remote durable logging. Updates to a file system may be received, and log records describing the updates may be stored in a locally-accessible file system change log. The update may then be acknowledged as committed. The log records may then be sent to be stored in a network-based data store in a remote version of the file system change log. Once it may be determined that the log records are stored in the remote version, storage space for the log records in the local file system change log may be reclaimed. Various types of restoration and duplication techniques may be implemented based on the remote version of the change log to restore a file system at an originating device or to duplicate the file system at a different device.

Patent Agency Ranking