专利检索 ap:("INTERNATIONAL BUSINESS MACHINES CORPORATION") AND inv:"Taras Lehinevych" 第 1 页

1.

发明申请
DATASET RELEVANCE ESTIMATION IN STORAGE SYSTEMS 审中-公开

公开(公告)号：US20190243546A1

公开(公告)日：2019-08-08

申请号：US16390214

申请日：2019-04-22

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Giovanni Cherubini , Mark A. Lantz , Taras Lehinevych , Vinodh Venkatesan

IPC分类号： G06F3/06

CPC分类号： G06F3/064 , G06F3/0604 , G06F3/0611 , G06F3/0631 , G06F3/0649 , G06F3/067 , G06F3/0685

摘要： The invention is notably directed to computer-implemented methods and systems for managing datasets in a storage system. In such systems, it is assumed that a (typically small) subset of datasets are labeled with respect to their relevance, so as to be associated with respective relevance values. Essentially, the present methods determine, for each unlabeled dataset of the datasets, a respective probability distribution over a set of relevance values. From this probability distribution, a corresponding relevance value can be obtained. This probability distribution is computed based on distances (or similarities), in terms of metadata values, between said each unlabeled dataset and the labeled datasets. Based on their associated relevance values, datasets can then be efficiently managed in a storage system.

2.

发明授权
Dataset relevance estimation in storage systems 有权

公开(公告)号：US10592147B2

公开(公告)日：2020-03-17

申请号：US15660434

申请日：2017-07-26

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Giovanni Cherubini , Mark A. Lantz , Taras Lehinevych , Vinodh Venkatesan

IPC分类号： G06F16/30 , G06F3/06

摘要： The invention is notably directed to computer-implemented methods and systems for managing datasets in a storage system. In such systems, it is assumed that a (typically small) subset of datasets are labeled with respect to their relevance, so as to be associated with respective relevance values. Essentially, the present methods determine, for each unlabeled dataset of the datasets, a respective probability distribution over a set of relevance values. From this probability distribution, a corresponding relevance value can be obtained. This probability distribution is computed based on distances (or similarities), in terms of metadata values, between said each unlabeled dataset and the labeled datasets. Based on their associated relevance values, datasets can then be efficiently managed in a storage system.

3.

发明申请
DATASET RELEVANCE ESTIMATION IN STORAGE SYSTEMS 审中-公开

公开(公告)号：US20190034083A1

公开(公告)日：2019-01-31

申请号：US15660434

申请日：2017-07-26

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Giovanni Cherubini , Mark A. Lantz , Taras Lehinevych , Vinodh Venkatesan

IPC分类号： G06F3/06

CPC分类号： G06F3/064 , G06F3/0604 , G06F3/0611 , G06F3/0631 , G06F3/0649 , G06F3/067 , G06F3/0685

摘要： The invention is notably directed to computer-implemented methods and systems for managing datasets in a storage system. In such systems, it is assumed that a (typically small) subset of datasets are labeled with respect to their relevance, so as to be associated with respective relevance values. Essentially, the present methods determine, for each unlabeled dataset of the datasets, a respective probability distribution over a set of relevance values. From this probability distribution, a corresponding relevance value can be obtained. This probability distribution is computed based on distances (or similarities), in terms of metadata values, between said each unlabeled dataset and the labeled datasets. Based on their associated relevance values, datasets can then be efficiently managed in a storage system.