USING HIERARCHICAL RESERVOIR SAMPLING TO COMPUTE PERCENTILES AT SCALE
    1.
    发明申请
    USING HIERARCHICAL RESERVOIR SAMPLING TO COMPUTE PERCENTILES AT SCALE 有权
    使用分层储存器计算计算符号

    公开(公告)号:US20160277490A1

    公开(公告)日:2016-09-22

    申请号:US14664043

    申请日:2015-03-20

    Applicant: Yahoo! Inc.

    CPC classification number: H04L67/1029 H04L41/044 H04L43/022 H04L43/0876

    Abstract: In one embodiment, in a hierarchy of nodes, a master node having two or more child nodes obtains from the two or more child nodes two or more sets of data samples or summaries associated therewith, the two or more sets of data samples being representative of traffic processed via two or more sets of servers corresponding to the two or more child nodes, wherein a size of each of the two or more sets of data samples is proportional to an allocation of traffic among the two or more sets of servers corresponding to the two or more child nodes. Each of the two or more sets of data samples is obtained from a different one of the two or more child nodes and represents traffic processed by a corresponding one of the two or more sets of servers. The master node combines the two or more sets of data samples or summaries associated therewith such that a combined set of data is generated. The master node ascertains a numerical value from the combined set of data.

    Abstract translation: 在一个实施例中,在节点的层次中,具有两个或多个子节点的主节点从两个或更多个子节点获得两组或更多组数据样本或与其相关联的摘要,所述两组或更多组数据样本表示 经由与两个或更多个子节点对应的两组或多组服务器处理的流量,其中两组或更多组数据样本中的每一组的大小与对应于该两个或更多个子节点的两组或更多服务器集合中的流量分配成比例 两个或多个子节点。 从两个或多个子节点中的不同的一个子节点获得两组或更多组数据样本中的每一组,并且表示由两组或更多组服务器中的对应的一个服务器处理的流量。 主节点组合两组或多组数据样本或与之相关联的摘要,以便生成一组组合的数据。 主节点根据组合的数据集确定数值。

    Using hierarchical reservoir sampling to compute percentiles at scale

    公开(公告)号:US09756122B2

    公开(公告)日:2017-09-05

    申请号:US14664043

    申请日:2015-03-20

    Applicant: Yahoo! Inc.

    CPC classification number: H04L67/1029 H04L41/044 H04L43/022 H04L43/0876

    Abstract: In one embodiment, in a hierarchy of nodes, a master node having two or more child nodes obtains from the two or more child nodes two or more sets of data samples or summaries associated therewith, the two or more sets of data samples being representative of traffic processed via two or more sets of servers corresponding to the two or more child nodes, wherein a size of each of the two or more sets of data samples is proportional to an allocation of traffic among the two or more sets of servers corresponding to the two or more child nodes. Each of the two or more sets of data samples is obtained from a different one of the two or more child nodes and represents traffic processed by a corresponding one of the two or more sets of servers. The master node combines the two or more sets of data samples or summaries associated therewith such that a combined set of data is generated. The master node ascertains a numerical value from the combined set of data.

Patent Agency Ranking