-
公开(公告)号:US20210357764A1
公开(公告)日:2021-11-18
申请号:US16476410
申请日:2019-01-11
Applicant: Google LLC
Inventor: Zachary Charles Frazier , Andreas Ulbrich , Thomas A. Vaughan , Zhe Wang , Shen-fu Tsai , Evgeny Skvortsov
Abstract: Methods, systems, and computer readable medium for reducing inconsistencies in output between an original model and a new model. The method includes receiving an original model and a new model, mapping structures of the new model to structures of the original model, classifying each structure of the new model as belonging to a group of the original model, an unused group not in the original model, a subset of a group of the original model, or a merged set of a first and a second, different group of the original model, generating a merged model based on the mapping and classifying, and classifying a unique entities, using the merged model, by applying consistent hashing to each of the unique entities.
-
公开(公告)号:US10594813B1
公开(公告)日:2020-03-17
申请号:US15487229
申请日:2017-04-13
Applicant: Google LLC
Inventor: James R. Koehler , Zachary Charles Frazier , Thomas A. Vaughan , Sheng Ma , Evgeny Skvortsov
Abstract: Methods, systems, and apparatus include computer programs encoded on a computer-readable storage medium, including a method for discovering unique entities over multiple devices. A virtual pool of entities is created and divided into subpools, each including fewer than all entities. Subpools are subdivided into delta pools. Cookies are recorded for each delta pool when the particular portion of content is presented to or accessed by entities in the delta pool. Recorded cookies are divided into cookie types based on cookie characteristics. Machine learning and statistical analysis algorithms are used to automatically determine sizes of delta pools and probabilities of each cookie type being classified as belonging to particular delta pools. Virtual entities are assigned from the virtual pool to each of the recorded cookies that were recorded when the particular portion of content was presented. A number of unique entities that accessed the particular portion of content is determined.
-