-
公开(公告)号:US11263128B2
公开(公告)日:2022-03-01
申请号:US15795650
申请日:2017-10-27
Applicant: Google LLC
Inventor: Wangyuan Zhang , Sandeep Singhal , Sangho Yoon , Guangda Lai , Arash Baratloo , Zhifan Zhang , Gael Hatchue Njouyep , Pramod Gaud
Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.
-
公开(公告)号:US20190129844A1
公开(公告)日:2019-05-02
申请号:US15795650
申请日:2017-10-27
Applicant: Google LLC
Inventor: Wangyuan Zhang , Sandeep Singhal , Sangho Yoon , Guangda Lai , Arash Baratloo , Zhifan Zhang , Gael Hatchue Njouyep , Pramod Gaud
Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.
-
公开(公告)号:US20210136150A1
公开(公告)日:2021-05-06
申请号:US16673128
申请日:2019-11-04
Applicant: Google LLC
Inventor: Wangyuan Zhang , Vivienne Zhang , Pramod Gaud , Sangho Yoon , Xudong Shi , Kaifeng Yao
IPC: H04L29/08 , G06F16/182 , G06N5/00 , G06N20/00 , H04L29/06
Abstract: A system and method for storing data in a distributed network having a plurality of datacenters distributed over a plurality of geographic regions. The method may involve receiving data, including metadata, uploaded to a first datacenter of the distributed network, receiving access information about previous data that was previously stored in the plurality of datacenters of the distributed network, predicting one or more of the plurality of geographic regions from which the uploaded data will be accessed based on the metadata and the access information, and instructing the uploaded data to be transferred from the first datacenter to one or more second datacenters located at each of the one or more predicted geographic regions.
-
公开(公告)号:US11588891B2
公开(公告)日:2023-02-21
申请号:US16673128
申请日:2019-11-04
Applicant: Google LLC
Inventor: Wangyuan Zhang , Vivienne Zhang , Pramod Gaud , Sangho Yoon , Xudong Shi , Kaifeng Yao
IPC: H04L67/1097 , G06F16/182 , G06N20/00 , G06N5/00 , H04L65/401 , H04L67/06 , H04L67/568
Abstract: A system and method for storing data in a distributed network having a plurality of datacenters distributed over a plurality of geographic regions. The method may involve receiving data, including metadata, uploaded to a first datacenter of the distributed network, receiving access information about previous data that was previously stored in the plurality of datacenters of the distributed network, predicting one or more of the plurality of geographic regions from which the uploaded data will be accessed based on the metadata and the access information, and instructing the uploaded data to be transferred from the first datacenter to one or more second datacenters located at each of the one or more predicted geographic regions.
-
公开(公告)号:US11954024B2
公开(公告)日:2024-04-09
申请号:US17648724
申请日:2022-01-24
Applicant: Google LLC
Inventor: Wangyuan Zhang , Sandeep Singhal , Sangho Yoon , Guangda Lai , Arash Baratloo , Zhifan Zhang , Gael Hatchue Njouyep , Pramod Gaud
CPC classification number: G06F12/0261 , G06F3/0604 , G06F3/0608 , G06F16/122 , G06F16/1734 , G06F16/182 , G06F16/2219 , G06F3/0649
Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.
-
公开(公告)号:US20230164219A1
公开(公告)日:2023-05-25
申请号:US18156541
申请日:2023-01-19
Applicant: Google LLC
Inventor: Wangyuan Zhang , Vivienne Zhang , Pramod Gaud , Sangho Yoon , Xudong Shi , Saifeng Yao
IPC: H04L67/1097 , G06F16/182 , G06N20/00 , H04L65/401 , H04L67/06 , H04L67/568 , G06N5/01
CPC classification number: H04L67/1097 , G06F16/183 , G06N20/00 , H04L65/4015 , H04L67/06 , H04L67/568 , G06N5/01
Abstract: A system and method for storing data in a distributed network having a plurality of datacenters distributed over a plurality of geographic regions. The method may involve receiving data, including metadata, uploaded to a first datacenter of the distributed network, receiving access information about previous data that was previously stored in the plurality of datacenters of the distributed network, predicting one or more of the plurality of geographic regions from which the uploaded data will be accessed based on the metadata and the access information, and instructing the uploaded data to be transferred from the first datacenter to one or more second datacenters located at each of the one or more predicted geographic regions.
-
公开(公告)号:US20220147448A1
公开(公告)日:2022-05-12
申请号:US17648724
申请日:2022-01-24
Applicant: Google LLC
Inventor: Wangyuan Zhang , Sandeep Singhal , Sangho Yoon , Guangda Lai , Arash Baratloo , Zhifan Zhang , Gael Hatchue Njouyep , Pramod Gaud
Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.
-
-
-
-
-
-