Clustering cookies for identifying unique mobile devices
    1.
    发明授权
    Clustering cookies for identifying unique mobile devices 有权
    聚集Cookie以识别唯一的移动设备

    公开(公告)号:US08935194B2

    公开(公告)日:2015-01-13

    申请号:US13767695

    申请日:2013-02-14

    Applicant: Yahoo! Inc.

    CPC classification number: G06N99/005 G06N7/005

    Abstract: Embodiments are directed towards clustering cookies for identifying unique mobile devices for associating activities over a network with a given mobile device. The cookies are clustered based on a Bayes Factor similarity model that is trained from cookie features of known mobile devices. The clusters may be used to determine the number of unique mobile devices that access a website. The clusters may also be used to provide targeted content to each unique mobile device.

    Abstract translation: 实施例针对用于识别用于将网络上的活动与给定移动设备相关联的唯一移动设备的聚类cookie。 基于由已知移动设备的cookie特征训练的贝叶斯因子相似性模型,cookie是聚类的。 群集可用于确定访问网站的唯一移动设备的数量。 集群也可以用于向每个唯一移动设备提供有针对性的内容。

    System and method for performing set operations with defined sketch accuracy distribution
    2.
    发明授权
    System and method for performing set operations with defined sketch accuracy distribution 有权
    用定义的草图精度分布进行设定操作的系统和方法

    公开(公告)号:US08819038B1

    公开(公告)日:2014-08-26

    申请号:US14078301

    申请日:2013-11-12

    Applicant: Yahoo! Inc.

    Abstract: Techniques are provided for improving the speed and accuracy of analytics on big data using theta sketches, by converting fixed-size sketches to theta sketches, and by performing set operations on sketches. In a technique for performing a set operation, two sketches are analyzed to identify the maximum value of each sketch. The maximum values of the two sketches are compared. Based the comparison, one or more values are removed from the sketch whose maximum value is greater. After the removal, a set operation (e.g., union, intersection, or difference) is performed based on the modified sketch and the unmodified sketch. A result of the set operation is a third sketch, which may be used to estimate a cardinality of the larger data sets that are represented by the two input sketches.

    Abstract translation: 提供了技术,通过将固定尺寸草图转换为theta素描,以及通过在草图上执行设置操作来提高使用θ素描的大数据分析的速度和准确性。 在用于执行设置操作的技术中,分析两个草图以识别每个草图的最大值。 比较两幅草图的最大值。 基于比较,从最大值更大的草图中删除一个或多个值。 在移除之后,基于修改的草图和未修改的草图来执行设置操作(例如,联合,相交或差异)。 设置操作的结果是第三个草图,其可以用于估计由两个输入草图表示的较大数据集的基数。

    System and method for performing set operations with defined sketch accuracy distribution

    公开(公告)号:US09043348B2

    公开(公告)日:2015-05-26

    申请号:US14448487

    申请日:2014-07-31

    Applicant: Yahoo! Inc.

    Abstract: Techniques are provided for improving the speed and accuracy of analytics on big data using theta sketches, by converting fixed-size sketches to theta sketches, and by performing set operations on sketches. In a technique for performing a set operation, two sketches are analyzed to identify the maximum value of each sketch. The maximum values of the two sketches are compared. Based the comparison, one or more values are removed from the sketch whose maximum value is greater. After the removal, a set operation (e.g., union, intersection, or difference) is performed based on the modified sketch and the unmodified sketch. A result of the set operation is a third sketch, which may be used to estimate a cardinality of the larger data sets that are represented by the two input sketches.

    System and method for performing set operations with defined sketch accuracy distribution
    5.
    发明授权
    System and method for performing set operations with defined sketch accuracy distribution 有权
    用定义的草图精度分布进行设定操作的系统和方法

    公开(公告)号:US09152691B2

    公开(公告)日:2015-10-06

    申请号:US14692477

    申请日:2015-04-21

    Applicant: Yahoo! Inc.

    Abstract: Techniques are provided for improving the speed and accuracy of analytics on big data using theta sketches, by converting fixed-size sketches to theta sketches, and by performing set operations on sketches. In a technique for performing a set operation, two sketches are analyzed to identify the maximum value of each sketch. The maximum values of the two sketches are compared. Based the comparison, one or more values are removed from the sketch whose maximum value is greater. After the removal, a set operation (e.g., union, intersection, or difference) is performed based on the modified sketch and the unmodified sketch. A result of the set operation is a third sketch, which may be used to estimate a cardinality of the larger data sets that are represented by the two input sketches.

    Abstract translation: 提供了技术,通过将固定尺寸草图转换为theta素描,以及通过在草图上执行设置操作来提高使用θ素描的大数据分析的速度和准确性。 在用于执行设置操作的技术中,分析两个草图以识别每个草图的最大值。 比较两幅草图的最大值。 基于比较,从最大值更大的草图中删除一个或多个值。 在移除之后,基于修改的草图和未修改的草图来执行设置操作(例如,联合,相交或差异)。 设置操作的结果是第三个草图,其可以用于估计由两个输入草图表示的较大数据集的基数。

    SYSTEM AND METHOD FOR PERFORMING SET OPERATIONS WITH DEFINED SKETCH ACCURACY DISTRIBUTION
    6.
    发明申请
    SYSTEM AND METHOD FOR PERFORMING SET OPERATIONS WITH DEFINED SKETCH ACCURACY DISTRIBUTION 有权
    用定义的绘图精度分布进行设置操作的系统和方法

    公开(公告)号:US20150100596A1

    公开(公告)日:2015-04-09

    申请号:US14448487

    申请日:2014-07-31

    Applicant: Yahoo! Inc.

    Abstract: Techniques are provided for improving the speed and accuracy of analytics on big data using theta sketches, by converting fixed-size sketches to theta sketches, and by performing set operations on sketches. In a technique for performing a set operation, two sketches are analyzed to identify the maximum value of each sketch. The maximum values of the two sketches are compared. Based the comparison, one or more values are removed from the sketch whose maximum value is greater. After the removal, a set operation (e.g., union, intersection, or difference) is performed based on the modified sketch and the unmodified sketch. A result of the set operation is a third sketch, which may be used to estimate a cardinality of the larger data sets that are represented by the two input sketches.

    Abstract translation: 提供了技术,通过将固定尺寸草图转换为theta素描,以及通过在草图上执行设置操作来提高使用θ素描的大数据分析的速度和准确性。 在用于执行设置操作的技术中,分析两个草图以识别每个草图的最大值。 比较两幅草图的最大值。 基于比较,从最大值更大的草图中删除一个或多个值。 在移除之后,基于修改的草图和未修改的草图来执行设置操作(例如,联合,相交或差异)。 设置操作的结果是第三个草图,其可以用于估计由两个输入草图表示的较大数据集的基数。

    CLUSTERING COOKIES FOR IDENTIFYING UNIQUE MOBILE DEVICES
    8.
    发明申请
    CLUSTERING COOKIES FOR IDENTIFYING UNIQUE MOBILE DEVICES 有权
    用于识别移动设备的聚集式咖啡

    公开(公告)号:US20130159227A1

    公开(公告)日:2013-06-20

    申请号:US13767695

    申请日:2013-02-14

    Applicant: Yahoo! Inc.

    CPC classification number: G06N99/005 G06N7/005

    Abstract: Embodiments are directed towards clustering cookies for identifying unique mobile devices for associating activities over a network with a given mobile device. The cookies are clustered based on a Bayes Factor similarity model that is trained from cookie features of known mobile devices. The clusters may be used to determine the number of unique mobile devices that access a website. The clusters may also be used to provide targeted content to each unique mobile device.

    Abstract translation: 实施例针对用于识别用于将网络上的活动与给定移动设备相关联的唯一移动设备的聚类cookie。 基于由已知移动设备的cookie特征训练的贝叶斯因子相似性模型,cookie是聚类的。 群集可用于确定访问网站的唯一移动设备的数量。 集群也可以用于向每个唯一移动设备提供有针对性的内容。

    SYSTEM AND METHOD FOR PERFORMING SET OPERATIONS WITH DEFINED SKETCH ACCURACY DISTRIBUTION

    公开(公告)号:US20150227608A1

    公开(公告)日:2015-08-13

    申请号:US14692477

    申请日:2015-04-21

    Applicant: Yahoo! Inc.

    Abstract: Techniques are provided for improving the speed and accuracy of analytics on big data using theta sketches, by converting fixed-size sketches to theta sketches, and by performing set operations on sketches. In a technique for performing a set operation, two sketches are analyzed to identify the maximum value of each sketch. The maximum values of the two sketches are compared. Based the comparison, one or more values are removed from the sketch whose maximum value is greater. After the removal, a set operation (e.g., union, intersection, or difference) is performed based on the modified sketch and the unmodified sketch. A result of the set operation is a third sketch, which may be used to estimate a cardinality of the larger data sets that are represented by the two input sketches.

Patent Agency Ranking