Managing datasets of a cognitive storage system with a spiking neural network

    公开(公告)号:US11823038B2

    公开(公告)日:2023-11-21

    申请号:US16015897

    申请日:2018-06-22

    IPC分类号: G06N3/08 G06N3/049

    CPC分类号: G06N3/08 G06N3/049

    摘要: A computer-implemented method for managing datasets of a storage system is provided, wherein the datasets have respective sets of metadata, the method including: successively feeding first sets of metadata to a spiking neural network (SNN), the first sets of metadata fed corresponding to datasets of the storage system that are labeled with respect to classes they belong to, so as to be associated with class labels, for the SNN to learn representations of said classes in terms of connection weights that weight the metadata fed; successively feeding second sets of metadata to the SNN, the second sets of metadata corresponding to unlabeled datasets of the storage system, for the SNN to infer class labels for the unlabeled datasets, based on the second sets of metadata fed and the representations learned; and managing datasets in the storage system, based on class labels of the datasets, these including the inferred class labels.

    Method for dimensioning and reconfiguring of a storage system with removable media

    公开(公告)号:US09933942B2

    公开(公告)日:2018-04-03

    申请号:US14941980

    申请日:2015-11-16

    IPC分类号: G06F12/00 G06F3/06

    摘要: Embodiments include methods for operating a first storage system having a first number of data storage drives for enabling access to a first set of removable media. Aspects include providing a second storage system having a number K of data storage drives for enabling access to a second set of removable media and providing a set of parameters describing operational characteristics of the second storage system. Aspects also include determining an analytical model using the set of parameters, the analytical model describing the variation of average waiting time as a function of system load over a predefined range covering multiple system load regime domains and determining values of the set of parameters using the analytical model and data of the second storage system. Aspects further include using the analytical model and the values of the set of parameters for reconfiguring the first storage system.

    Methods and computer program products of selecting a new redundancy scheme for data relocation
    4.
    发明授权
    Methods and computer program products of selecting a new redundancy scheme for data relocation 有权
    选择新的冗余方案进行数据重定位的方法和计算机程序产品

    公开(公告)号:US09535626B1

    公开(公告)日:2017-01-03

    申请号:US14964808

    申请日:2015-12-10

    IPC分类号: G06F3/06

    摘要: Embodiments include methods, computer systems and computer program products for selecting a new redundancy scheme for data relocation in computer system having multiple data storage tiers and multiple redundancy schemes. Aspects include: receiving, at a processor of computer system, a request to relocate a data object to a destination data storage tier, retrieving a first and a second data reliability thresholds, redundancy scheme and data access pattern information of the data object, generating a set of available redundancy schemes for the data object with at least one copy on the destination data storage tier, calculating total cost for each of available redundancy schemes using performance cost, re-encoding cost, and storage cost for each of available redundancy schemes generated, selecting a redundancy scheme, and relocating data object to selected destination data storage tier by re-encoding the data object using the selected redundancy scheme.

    摘要翻译: 实施例包括用于在具有多个数据存储层和多个冗余方案的计算机系统中选择用于数据重定位的新冗余方案的方法,计算机系统和计算机程序产品。 方面包括:在计算机系统的处理器处接收将数据对象重新定位到目的地数据存储层的请求,检索数据对象的第一和第二数据可靠性阈值,冗余方案和数据访问模式信息,生成 在目的地数据存储层上具有至少一个副本的数据对象的可用冗余方案集合,使用生成的每个可用冗余方案的性能成本,重新编码成本和存储成本来计算每个可用冗余方案的总成本, 选择冗余方案,并且通过使用所选择的冗余方案对所述数据对象进行重新编码来将数据对象重定位到所选择的目的地数据存储层。

    WRITING DATA ACROSS STORAGE DEVICES IN AN ERASURE-CODED SYSTEM
    5.
    发明申请
    WRITING DATA ACROSS STORAGE DEVICES IN AN ERASURE-CODED SYSTEM 有权
    在擦除编码系统中的存储设备上写数据

    公开(公告)号:US20150121169A1

    公开(公告)日:2015-04-30

    申请号:US14524382

    申请日:2014-10-27

    IPC分类号: G06F11/10

    CPC分类号: G06F11/1076 G06F2211/1009

    摘要: A computing device writes data across storage devices in an erasure-coded system. The computing device computes data blocks and parity blocks from data and computes a portion of the data to be stored in the system. The computing is performed by one or more controllers included in a redundant array of an independent disks controller. The computing device provides the locations of the data blocks and the parity blocks in storage devices of an erasure-coded system. The location is determined using one or more placement nodes. The placement nodes are configured for managing placement schemes of data blocks and parity blocks on the storage devices.

    摘要翻译: 计算设备在擦除编码系统中的存储设备之间写入数据。 计算设备从数据计算数据块和奇偶校验块,并计算要存储在系统中的一部分数据。 计算由包括在独立磁盘控制器的冗余阵列中的一个或多个控制器执行。 计算设备在擦除编码系统的存储设备中提供数据块和奇偶校验块的位置。 使用一个或多个放置节点确定位置。 配置节点配置用于管理存储设备上的数据块和奇偶校验块的布局方案。

    Access ordering for tape cycle optimization

    公开(公告)号:US11061579B2

    公开(公告)日:2021-07-13

    申请号:US16566964

    申请日:2019-09-11

    IPC分类号: G06F3/06

    摘要: A method for data storage tape access optimization is provided. The method identifies a plurality of data segments stored on a data storage tape. Each segment having a location and a size on the data storage tape. The method determines a subset of data segments stored on the data storage tape selected for retrieval. The method determines an access order for the subset of data segments. The access order represents a minimized tape cycle time for retrieving the subset of data segments. The method retrieves the subset of data segments from the data storage tape based on the access order.

    ACCESS ORDERING FOR TAPE CYCLE OPTIMIZATION

    公开(公告)号:US20210072900A1

    公开(公告)日:2021-03-11

    申请号:US16566964

    申请日:2019-09-11

    IPC分类号: G06F3/06

    摘要: A method for data storage tape access optimization is provided. The method identifies a plurality of data segments stored on a data storage tape. Each segment having a location and a size on the data storage tape. The method determines a subset of data segments stored on the data storage tape selected for retrieval. The method determines an access order for the subset of data segments. The access order represents a minimized tape cycle time for retrieving the subset of data segments. The method retrieves the subset of data segments from the data storage tape based on the access order.

    Data prefetching for large data systems

    公开(公告)号:US10397368B2

    公开(公告)日:2019-08-27

    申请号:US14749912

    申请日:2015-06-25

    摘要: The present invention is notably directed to a computer-implemented method of accessing data by an application. The method comprises detecting an application that is accessing an initial file stored in a storage system; fetching an application model of the detected application, the application model comprising an association between access patterns of the detected application when accessing first files with first metadata information of the said first files; identifying one or more second files stored in the storage system, the identified one or more second files having second metadata information that are similar to the first metadata information; and prefetching the identified one or more second files.

    STORAGE SYSTEM
    10.
    发明申请
    STORAGE SYSTEM 审中-公开
    存储系统

    公开(公告)号:US20160062689A1

    公开(公告)日:2016-03-03

    申请号:US14837297

    申请日:2015-08-27

    IPC分类号: G06F3/06 G06F12/02 G06F12/12

    摘要: A storage system and a method for storing a data segment, a storage capacity manager and a method for managing a capacity of a storage unit, and a storage tier relocation manager and a method for relocating a data segment. The storage system includes at least two storage tiers, an access pattern evaluator, a classification unit, a selector, and logic. The storage capacitor manager includes a monitoring unit and capacity managing unit. The storage tier relocation manager includes a target storage tier, the data segment relocated to the target storage tier, and a protection measure.

    摘要翻译: 用于存储数据段的存储系统和方法,存储容量管理器和用于管理存储单元的容量的方法,以及存储层重定位管理器和重新定位数据段的方法。 存储系统包括至少两个存储层,访问模式评估器,分类单元,选择器和逻辑。 存储电容器管理器包括监视单元和容量管理单元。 存储层重定位管理器包括目标存储层,重定位到目标存储层的数据段以及保护措施。