Determining a quality measure for a resource
    1.
    发明授权
    Determining a quality measure for a resource 有权
    确定资源的质量度量

    公开(公告)号:US09558233B1

    公开(公告)日:2017-01-31

    申请号:US13731354

    申请日:2012-12-31

    Applicant: Google Inc.

    CPC classification number: G06F17/30386 G06F17/30 G06F17/30864

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a measure of quality for a resource. In one aspect, a method includes determining a seed score for each seed resource in a set. The seed score for a seed resource can be based on a number of resources that include a link to the seed resource and a number of selections of the links A set of source resources is identified. A source score is determined for each source resource. The source score for a source resource is based on the seed score for each seed resource linked to by the source resource. Source-referenced resources are identified. A resource score is determined for each source-referenced resource. The resource score for a source-referenced resource can be based on the source score for each source resource that includes a link to the source-referenced resource.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定资源的质量度量。 一方面,一种方法包括确定一组中每个种子资源的种子分数。 种子资源的种子分数可以基于包括到种子资源的链接的资源的数量以及链接的多个选择一组源资源被识别。 确定每个源资源的源分数。 源资源的源分数基于源资源链接的每个种子资源的种子分数。 源引用的资源被识别。 为每个源引用的资源确定资源分数。 源引用资源的资源分数可以基于每个源资源的源分数,其中包括到源引用资源的链接。

    SYSTEMS AND METHODS FOR RE-RANKING RANKED SEARCH RESULTS
    2.
    发明申请
    SYSTEMS AND METHODS FOR RE-RANKING RANKED SEARCH RESULTS 审中-公开
    重新排列排名搜索结果的系统和方法

    公开(公告)号:US20150169584A1

    公开(公告)日:2015-06-18

    申请号:US14401828

    申请日:2013-05-17

    Applicant: GOOGLE INC.

    CPC classification number: G06F16/24578 G06F16/2228 G06F16/248 G06F16/951

    Abstract: A system, computer-readable storage medium storing at least one program, and a computer-implemented method for re-ranking ranked search results is presented. Ranked search results satisfying a search query are obtained, where the ranked search results include a first search result corresponding to a first document associated with a first entity and a second search result corresponding to a second document associated with a second entity, and where the first search result is ranked higher than the second search result. The first document and the second document are determined to satisfy a similarity criterion. The second entity is determined to satisfy a predefined authorship differential with respect to the first entity. Responsive to determining that the second entity satisfies the predefined authorship differential with respect to the first entity, the second search result and the first search result in the ranked search results are swapped to produce re-ranked search results.

    Abstract translation: 提出了存储至少一个程序的系统,计算机可读存储介质和用于重新排列排名的搜索结果的计算机实现的方法。 获得满足搜索查询的排名搜索结果,其中排名的搜索结果包括对应于与第一实体相关联的第一文档的第一搜索结果和对应于与第二实体相关联的第二文档的第二搜索结果,并且其中第一 搜索结果排名高于第二搜索结果。 确定第一文件和第二文件以满足相似性标准。 确定第二实体以满足关于第一实体的预定义的作者差异。 响应于确定第二实体满足关于第一实体的预定义作者差异,排列的搜索结果中的第二搜索结果和第一搜索结果被交换以产生重新排序的搜索结果。

    Detecting content scraping
    3.
    发明授权
    Detecting content scraping 有权
    检测内容刮

    公开(公告)号:US08909628B1

    公开(公告)日:2014-12-09

    申请号:US13668106

    申请日:2012-11-02

    Applicant: Google Inc.

    CPC classification number: G06F17/30864 G06Q30/0201

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a plurality of n-grams in a plurality of resources found in a particular site; determining, for each of the plurality of resources, a count of n-grams that originated in the resource; determining, based on counts of n-grams that originated in the resources, a first aggregate count of n-grams that originated in the particular site; determining a second aggregate count of the plurality of n-grams that were identified in the plurality of resources found in the particular site; and determining, based on the first and second aggregate counts, a site originality score for the particular site.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别在特定站点中发现的多个资源中的多个n克; 为所述多个资源中的每一个确定源自所述资源的n克的计数; 根据源自资源的n-gram的计数确定起源于该特定地点的n克的第一个总计数; 确定在所述特定站点中发现的所述多个资源中识别的所述多个n-gram的第二聚合计数; 以及基于所述第一和第二聚合计数确定所述特定站点的站点原创性得分。

Patent Agency Ranking