摘要:
A computer-implemented method for managing datasets of a storage system is provided, wherein the datasets have respective sets of metadata, the method including: successively feeding first sets of metadata to a spiking neural network (SNN), the first sets of metadata fed corresponding to datasets of the storage system that are labeled with respect to classes they belong to, so as to be associated with class labels, for the SNN to learn representations of said classes in terms of connection weights that weight the metadata fed; successively feeding second sets of metadata to the SNN, the second sets of metadata corresponding to unlabeled datasets of the storage system, for the SNN to infer class labels for the unlabeled datasets, based on the second sets of metadata fed and the representations learned; and managing datasets in the storage system, based on class labels of the datasets, these including the inferred class labels.
摘要:
A device and method for determining number of storage devices for each of plurality of storage tiers and assignment of data to be stored in the plurality of storage tiers. The device computes an optimized number of storage devices for each tier including a classifier, which receives units as input data to be stored in the system. Based on the characteristics of the storage devices, the device outputs a data-unit-to-storage tier assignment. An optimizer receives data-unit-to-storage tier assignment from the classifier and a cost budget for the system. Thereafter, an output for the number of storage devices for each storage tier is calculated. A method for determining number of storage devices for each of a plurality of storage tiers and assignment of data to be stored in the plurality of storage tiers are also provided.
摘要:
Embodiments include methods for operating a first storage system having a first number of data storage drives for enabling access to a first set of removable media. Aspects include providing a second storage system having a number K of data storage drives for enabling access to a second set of removable media and providing a set of parameters describing operational characteristics of the second storage system. Aspects also include determining an analytical model using the set of parameters, the analytical model describing the variation of average waiting time as a function of system load over a predefined range covering multiple system load regime domains and determining values of the set of parameters using the analytical model and data of the second storage system. Aspects further include using the analytical model and the values of the set of parameters for reconfiguring the first storage system.
摘要:
Embodiments include methods, computer systems and computer program products for selecting a new redundancy scheme for data relocation in computer system having multiple data storage tiers and multiple redundancy schemes. Aspects include: receiving, at a processor of computer system, a request to relocate a data object to a destination data storage tier, retrieving a first and a second data reliability thresholds, redundancy scheme and data access pattern information of the data object, generating a set of available redundancy schemes for the data object with at least one copy on the destination data storage tier, calculating total cost for each of available redundancy schemes using performance cost, re-encoding cost, and storage cost for each of available redundancy schemes generated, selecting a redundancy scheme, and relocating data object to selected destination data storage tier by re-encoding the data object using the selected redundancy scheme.
摘要:
A computing device writes data across storage devices in an erasure-coded system. The computing device computes data blocks and parity blocks from data and computes a portion of the data to be stored in the system. The computing is performed by one or more controllers included in a redundant array of an independent disks controller. The computing device provides the locations of the data blocks and the parity blocks in storage devices of an erasure-coded system. The location is determined using one or more placement nodes. The placement nodes are configured for managing placement schemes of data blocks and parity blocks on the storage devices.
摘要:
A method for data storage tape access optimization is provided. The method identifies a plurality of data segments stored on a data storage tape. Each segment having a location and a size on the data storage tape. The method determines a subset of data segments stored on the data storage tape selected for retrieval. The method determines an access order for the subset of data segments. The access order represents a minimized tape cycle time for retrieving the subset of data segments. The method retrieves the subset of data segments from the data storage tape based on the access order.
摘要:
A method for data storage tape access optimization is provided. The method identifies a plurality of data segments stored on a data storage tape. Each segment having a location and a size on the data storage tape. The method determines a subset of data segments stored on the data storage tape selected for retrieval. The method determines an access order for the subset of data segments. The access order represents a minimized tape cycle time for retrieving the subset of data segments. The method retrieves the subset of data segments from the data storage tape based on the access order.
摘要:
The present invention is notably directed to a computer-implemented method of accessing data by an application. The method comprises detecting an application that is accessing an initial file stored in a storage system; fetching an application model of the detected application, the application model comprising an association between access patterns of the detected application when accessing first files with first metadata information of the said first files; identifying one or more second files stored in the storage system, the identified one or more second files having second metadata information that are similar to the first metadata information; and prefetching the identified one or more second files.
摘要:
A computer-implemented method for managing a tiered storage system having an archive tier and an active storage tier comprises determining a workload for moving data between the active tier and the archive tier; and determining an assignment of data to be stored across the active tier and the archive tier, based on the determined workload.
摘要:
A storage system and a method for storing a data segment, a storage capacity manager and a method for managing a capacity of a storage unit, and a storage tier relocation manager and a method for relocating a data segment. The storage system includes at least two storage tiers, an access pattern evaluator, a classification unit, a selector, and logic. The storage capacitor manager includes a monitoring unit and capacity managing unit. The storage tier relocation manager includes a target storage tier, the data segment relocated to the target storage tier, and a protection measure.