-
公开(公告)号:US11687243B2
公开(公告)日:2023-06-27
申请号:US17382447
申请日:2021-07-22
Applicant: EMC IP Holding Company LLC
Inventor: Venkata Ippatapu , Ramesh Doddaiah , Sweetesh Singh
CPC classification number: G06F3/061 , G06F3/067 , G06F3/0655 , G06F3/0679 , G06F12/0692
Abstract: Aspects of the present disclosure relate to reducing the latency of data deduplication. In embodiments, an input/output (IO) workload received by a storage array is monitored. Further, at least one IO write operation in the IO workload is identified. A space-efficient probabilistic data structure is used to determine if a director board is associated with the IO write. Additionally, the IO write operation is processed based on the determination.
-
公开(公告)号:US11609695B2
公开(公告)日:2023-03-21
申请号:US17009822
申请日:2020-09-02
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: John Krasner , Sweetesh Singh
Abstract: A data model is trained to determine whether data is raw, compressed, and/or encrypted. The data model may also be trained to recognize which compression algorithm was used to compress data and predict compression ratios for the data using different compression algorithms. A storage system uses the data model to independently identify raw data. The raw data is grouped based on similarity of statistical features and group members are compressed with the same compression algorithm and may be encrypted after compression with the same encryption algorithm. The data model may also be used to identify sub-optimally compressed data, which may be uncompressed and grouped for compression using a different compression algorithm.
-
公开(公告)号:US10963177B2
公开(公告)日:2021-03-30
申请号:US15966138
申请日:2018-04-30
Applicant: EMC IP Holding Company LLC
Inventor: Sweetesh Singh
Abstract: A fingerprint trie is used to store fingerprints for data portions stored on a storage system for use in implementing data deduplication on a storage system. The fingerprint trie may be used to compare fingerprint values to determine duplicate data portions, for example, in response to I/O operations. Leaf nodes of the fingerprint trie may be keyed by fingerprints, and a value of each leaf node may be a reference to the physical storage location of the data portion from which the fingerprint was generated. When an I/O operation is received, a fingerprint may be generated for each of one or more data portions included in the I/O operation. A fingerprint trie may be searched, for example by traversing multiple nodes of the trie according to pointers provided by the nodes, to determine whether there is any matching fingerprint specified in the fingerprint trie.
-
公开(公告)号:US20200348958A1
公开(公告)日:2020-11-05
申请号:US16400937
申请日:2019-05-01
Applicant: EMC IP Holding Company LLC
Inventor: Jonathan I. Krasner , Steven R. Chalmer , Chakib Ourraoui , Sweetesh Singh
Abstract: A method for processing data includes monitoring, by a virtual machine (VM), a plurality of computing resources, receiving an offload request by the VM, selecting, based on the monitoring, a computing resource from the plurality of computing resources, issuing, by the VM and in response to the offload request, the processing request to the computing resource, and servicing, by the computing resource, the processing request to obtain a result, wherein the VM and the computing resource are executing on a computing device.
-
公开(公告)号:US20190332300A1
公开(公告)日:2019-10-31
申请号:US15966138
申请日:2018-04-30
Applicant: EMC IP Holding Company LLC
Inventor: Sweetesh Singh
Abstract: A fingerprint trie is used to store fingerprints for data portions stored on a storage system for use in implementing data deduplication on a storage system. The fingerprint trie may be used to compare fingerprint values to determine duplicate data portions, for example, in response to I/O operations. Leaf nodes of the fingerprint trie may be keyed by fingerprints, and a value of each leaf node may be a reference to the physical storage location of the data portion from which the fingerprint was generated. When an I/O operation is received, a fingerprint may be generated for each of one or more data portions included in the I/O operation. A fingerprint trie may be searched, for example by traversing multiple nodes of the trie according to pointers provided by the nodes, to determine whether there is any matching fingerprint specified in the fingerprint trie.
-
公开(公告)号:US20220179829A1
公开(公告)日:2022-06-09
申请号:US17113237
申请日:2020-12-07
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: John Krasner , Sweetesh Singh
IPC: G06F16/174 , G06N5/04 , G06N20/00
Abstract: A data model is trained to predict compressibility of binary data structures based on component entropy and predict relative compression efficiency for various compression algorithms based on component size. A recommendation engine in a storage system uses the data model to predict compressibility of binary data and determines whether to compress the binary data based on predicted compressibility. If the recommendation engine determines that compression of the binary data is justified, then a compression algorithm is recommended based on predicted relative compression efficiency. For example, the compression algorithm predicted to yield the greatest compression ratio or shortest compression/decompression time may be recommended.
-
公开(公告)号:US20220026970A1
公开(公告)日:2022-01-27
申请号:US16939133
申请日:2020-07-27
Applicant: EMC IP Holding Company LLC
Inventor: John Krasner , Clifford Lim , Sweetesh Singh
IPC: G06F1/30 , G06F11/30 , G06F1/28 , G06F1/3296
Abstract: One or more aspects of the present disclosure relate to data protection techniques in response to power disruptions a power supply from a continuous power source for a storage device can be monitored. A power disruption event interrupting the power supply from the continuous power source can further be identified. In response to detecting an event, a storage system can be switched to a backup power supply, power consumption of one or more components of the storage device can be controlled based on information associated with each component and an amount of power available in the backup power supply. Further, one or more power interruption operations can be performed while the backup power supply includes sufficient power for performing the power interruption operations.
-
公开(公告)号:US20200348957A1
公开(公告)日:2020-11-05
申请号:US16400926
申请日:2019-05-01
Applicant: EMC IP Holding Company LLC
Inventor: Jonathan I. Krasner , Sweetesh Singh , Steven R. Chalmer
Abstract: A method for processing data includes receiving a write request by a host operating system during a predetermined time window, storing data associated with the write request in a shared memory, making a first determination that a threshold number of write requests are obtained within the predetermined time window, and, in response to the first determination, sending an offload request to a virtual machine (VM), wherein the offload request specifies at least the write request.
-
公开(公告)号:US11977959B2
公开(公告)日:2024-05-07
申请号:US16412970
申请日:2019-05-15
Applicant: EMC IP Holding Company LLC
Inventor: Jonathan Krasner , Sweetesh Singh
IPC: G06N20/00 , G06F9/30 , G06F18/23213 , G06F18/2413 , G06N3/088
CPC classification number: G06N20/00 , G06F9/30029 , G06F18/23213 , G06F18/24147 , G06N3/088
Abstract: Disclosed are techniques for compressing data in a data storage system comprising searching a cluster of nearest neighbors, wherein the cluster has been created using a locality sensitive hashing algorithm, to determine if a data block can be compressed. In alternate embodiments, nearest neighbor clusters can be formed using unsupervised learning. Additionally, nearest neighbors can also be formed in alternate embodiments using one or more of the following algorithms: a k-means clustering algorithm, a k-medoids clustering algorithm, a mean shift algorithm, a generalized method of moment (GMM) algorithm, or a density based spatial clustering of applications with noise (DBSCAN) algorithm.
-
公开(公告)号:US11119802B2
公开(公告)日:2021-09-14
申请号:US16400926
申请日:2019-05-01
Applicant: EMC IP Holding Company LLC
Inventor: Jonathan I. Krasner , Sweetesh Singh , Steven R. Chalmer
Abstract: A method for processing data includes receiving a write request by a host operating system during a predetermined time window, storing data associated with the write request in a shared memory, making a first determination that a threshold number of write requests are obtained within the predetermined time window, and, in response to the first determination, sending an offload request to a virtual machine (VM), wherein the offload request specifies at least the write request.
-
-
-
-
-
-
-
-
-