-
公开(公告)号:US11307894B1
公开(公告)日:2022-04-19
申请号:US16659798
申请日:2019-10-22
申请人: Pure Storage, Inc.
发明人: Ivan Jibaja , Stefan Dorsett , Prashant Jaikumar , Roy Kim , Curtis Pullen
摘要: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US10671434B1
公开(公告)日:2020-06-02
申请号:US16040846
申请日:2018-07-20
申请人: PURE STORAGE, INC.
发明人: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
摘要: Data transformation offloading in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within the storage system, a dataset; identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to the dataset; and generating, by the storage system in dependence upon the one or more transformations, a transformed dataset.
-
公开(公告)号:US11210140B1
公开(公告)日:2021-12-28
申请号:US16888135
申请日:2020-05-29
申请人: PURE STORAGE, INC.
发明人: Brian Gold , Emily Potyraj , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC分类号: G06N99/00 , G06F9/50 , G06F3/06 , G06N20/00 , G06F16/245 , G06F9/48 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
摘要: Data transformation offloading in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within the storage system, a dataset; identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to the dataset; and generating, by the storage system in dependence upon the one or more transformations, a transformed dataset.
-
公开(公告)号:US10275285B1
公开(公告)日:2019-04-30
申请号:US16046337
申请日:2018-07-26
申请人: PURE STORAGE, INC.
发明人: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
摘要: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.
-
公开(公告)号:US11768636B2
公开(公告)日:2023-09-26
申请号:US18146807
申请日:2022-12-27
申请人: PURE STORAGE, INC.
发明人: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC分类号: H04L67/12 , G06F3/06 , G06N20/00 , G06F16/245 , G06F16/178 , G06Q30/0242 , G06F9/48 , G06F9/50 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
CPC分类号: G06F3/0679 , G06F3/0604 , G06F3/067 , G06F3/0608 , G06F3/0646 , G06F3/0649 , G06F9/4881 , G06F9/5027 , G06F16/1794 , G06F16/245 , G06N3/063 , G06N3/08 , G06N20/00 , G06Q30/0243 , G06T1/20 , G06T1/60 , G06F16/248 , G06F16/972 , G06T2200/28
摘要: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
-
公开(公告)号:US11556280B2
公开(公告)日:2023-01-17
申请号:US16888402
申请日:2020-05-29
申请人: PURE STORAGE, INC.
发明人: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC分类号: G06F3/06 , G06N20/00 , G06F16/245 , G06F16/178 , G06Q30/02 , G06F9/48 , G06F9/50 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
摘要: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.
-
公开(公告)号:US11263095B1
公开(公告)日:2022-03-01
申请号:US17010565
申请日:2020-09-02
申请人: PURE STORAGE, INC.
发明人: Ivan Jibaja , Curtis Pullen , Prashant Jaikumar , Stefan Dorsett , Gaurav Jain , Neil Vachharajani , Srinivas Chellappa
摘要: Providing for high availability in a data analytics pipeline without replicas, including: creating a data analytics pipeline, wherein each component of the data analytics pipeline is deployed within a container; creating a failover container; detecting that a component within the data analytics pipeline has failed; and responsive to detecting that the component within the data analytics pipeline has failed, deploying the component within the data analytics pipeline that has failed in the failover container.
-
8.
公开(公告)号:US20210216631A1
公开(公告)日:2021-07-15
申请号:US17074261
申请日:2020-10-19
申请人: Pure Storage, Inc.
发明人: Roy Child , Robert Lee , Ivan Jibaja , Ronald Karr
IPC分类号: G06F21/56
摘要: An illustrative method includes a data protection system identifying a first attribute set associated with a first file stored in a storage system, determining that the first file is replaced in the storage system with a second file, identifying a second attribute set associated with the second file, and determining, based on the determining that the first file is replaced in the storage system with the second file and on one or more attributes in at least one of the first attribute set or the second attribute set, that data stored by the storage system is possibly being targeted by a security threat.
-
公开(公告)号:US12008404B2
公开(公告)日:2024-06-11
申请号:US17721175
申请日:2022-04-14
申请人: PURE STORAGE, INC.
发明人: Ivan Jibaja , Prashant Jaikumar , Stefan Dorsett , Curtis Pullen , Roy Kim
CPC分类号: G06F9/5011 , G06F9/4856 , G06F9/505 , G06F16/2272 , G06F16/258
摘要: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US11768635B2
公开(公告)日:2023-09-26
申请号:US17728886
申请日:2022-04-25
申请人: PURE STORAGE, INC.
发明人: Taher Vohra , Par Botes , Naveen Neelakantam , Ivan Jibaja
IPC分类号: G06F3/06
CPC分类号: G06F3/0665 , G06F3/0605 , G06F3/067 , G06F3/0632 , G06F3/0644 , G06F3/0653
摘要: Scaling storage resources in a storage volume, including: monitoring a usage of a volume in a storage pool that includes one or more cloud-based storage systems; determining that the usage of the volume exceeds a threshold usage; and based on the determination, expanding the resources that are included in the storage pool for servicing the volume, including: instantiating one or more new virtual drives that are included in the one or more cloud-based storage systems; and adding the one or more new virtual drives to the storage pool.
-
-
-
-
-
-
-
-
-