Abstract:
Data set valuation techniques are provided. For example, a request is obtained from a client to utilize one or more cloud computing services managed by at least one service provider. A valuation is determined for delivering the one or more requested cloud computing services to the client. The valuation determination includes determining a valuation of one or more data sets associated with the one or more cloud computing services.
Abstract:
A data protection ecosystem-based data valuation methodology includes the following steps. One or more of backup data, metadata, and analytics results maintained by a data protection ecosystem are accessed. The backup data, metadata, and analytics results are obtained during the course of the data protection ecosystem providing data backup and recovery functionalities for a data storage environment that stores one or more data sets. A valuation is calculated for at least one of the one or more data sets of the data storage environment based on at least a portion of the accessed backup data, metadata, and analytics results maintained by the data protection ecosystem.
Abstract:
A method, article of manufacture, and apparatus for creating dynamically composed compute nodes from disaggregated hardware components is discussed. These components may be dynamically allocated from resource pools to the compute nodes.
Abstract:
A data set is obtained. A set of data relevance scores is calculated for the data set for a set of specific domains associated with an entity. The set of data relevance scores is updated as the relevance of the data set to one or more of the set of specific domains changes over time. A valuation is calculated for the data set based on the set of data relevance scores.
Abstract:
A database benchmark configuration is selected via an interface. At least one database partitioning scheme from a plurality of database partitioning schemes is selected via the interface. The selected database partitioning scheme is configured through the interface. The selected database partitioning scheme is evaluated based on the configuring step and the selected database benchmark configuration, and evaluation results are generated. A presentation is generated for the interface based on at least a portion of the evaluation results, wherein the presentation is configured to provide at least an indication of a performance of the selected database partitioning scheme given the configuring step and the selected database benchmark configuration.
Abstract:
Data generated in accordance with execution of one or more phases of an automated data analytics lifecycle associated with a given data science project is collected. At least a portion of the collected data is analyzed. At least one future outcome associated with the given data science project is predicted based at least in part on the collecting and analyzing steps.
Abstract:
Information processing techniques for generating privacy ratings for services that are available via a cloud infrastructure, e.g., cloud services. For example, a method comprises the following steps. Data indicative of privacy attributable to at least one of a service and a provider of the service accessible in a cloud infrastructure is collected. A privacy rating is generated for at least one of the service and the provider of the service based on at least a portion of the collected data.
Abstract:
A method, article of manufacture, and apparatus for creating dynamically composed compute nodes from disaggregated hardware components is discussed. These components may be dynamically allocated from resource pools to the compute nodes.
Abstract:
An initial data analytic plan for analyzing a given data set associated with a given data problem is defined. At least a portion of original data in the given data set is conditioned to generate conditioned data. At least one model is selected to analyze at least one of the original data and the conditioned data. The at least one selected model is executed on at least one of a portion of the original data and a portion of the conditioned data. Results of the model execution are communicated to at least one entity, the results comprising a refined data analytic plan for analyzing the given data set. One or more computing resources are provisioned to implement the refined data analytic plan. The defining, conditioning, selecting, executing, communicating and provisioning steps are performed on one or more processing elements associated with a computing system and automate a data analytics lifecycle.
Abstract:
At least part of an analytic process is executed on one or more data sets. Execution of the analytic process is performed within an analytic computing environment. During the course of execution of the analytic process, a data structure is generated comprising data structure elements. The data structure elements represent attributes associated with execution of the analytic process. Value is assigned to at least a portion of the data structure elements. The data structure generated during execution of the analytic process may be stored in an accessible catalog of other data structures generated during execution of other analytic processes.