Index server architecture using tiered and sharded phrase posting lists
    21.
    发明授权
    Index server architecture using tiered and sharded phrase posting lists 有权
    索引服务器架构使用分层和分层的短语发布列表

    公开(公告)号:US08943067B1

    公开(公告)日:2015-01-27

    申请号:US13842731

    申请日:2013-03-15

    Applicant: Google Inc.

    Abstract: An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are extracted from the document collection. Documents are the indexed according to their included phrases, using phrase posting lists. The phrase posting lists are stored in an cluster of index servers. The phrase posting lists can be tiered into groups, and sharded into partitions. Phrases in a query are identified based on possible phrasifications. A query schedule based on the phrases is created from the phrases, and then optimized to reduce query processing and communication costs. The execution of the query schedule is managed to further reduce or eliminate query processing operations at various ones of the index servers.

    Abstract translation: 信息检索系统使用短语来索引,检索,组织和描述文档。 短语从文档集中提取。 文件根据所包含的短语索引,使用短语发布列表。 短语发布列表存储在索引服务器的集群中。 短语列表可以分组成分组,并分成分区。 查询中的短语是根据可能的短语来确定的。 从短语中创建基于短语的查询调度,然后进行优化,以减少查询处理和通信成本。 管理查询调度的执行以进一步减少或消除索引服务器中的各个查询处理操作。

    Customizing content in a social stream

    公开(公告)号:US09762629B1

    公开(公告)日:2017-09-12

    申请号:US13693680

    申请日:2012-12-04

    Applicant: Google Inc.

    CPC classification number: H04L65/40 G06Q50/01

    Abstract: The disclosure includes a system and method for providing a customized stream of content to a user. The system includes: an item sourcer for gathering one or more content items from one or more content sources; a behavior indicator module and scorer for determining one or more behavior scores for the one or more content items; a content indicator module and scorer for determining one or more content scores for the one or more content items; a score combiner for aggregating the one or more behavior scores and the one or more content scores to generate one or more item scores for the one or more content items; a content diversifier for determining one or more diverse items from the one or more content items; and a stream generator for generating a customized stream of content for the user from the one or more diverse items.

    Pruning of Blob Replicas
    23.
    发明申请
    Pruning of Blob Replicas 有权
    修剪Blob副本

    公开(公告)号:US20140304240A1

    公开(公告)日:2014-10-09

    申请号:US14293966

    申请日:2014-06-02

    Applicant: Google Inc.

    Abstract: A method allocates object replicas in a distributed storage system. The method identifies a plurality of objects in the distributed storage system. Each object has an associated storage policy that specifies a target number of object replicas stored at distinct instances of the distributed storage system. The method identifies an object of the plurality of objects whose number of object replicas exceeds the target number of object replicas specified by the storage policy associated with the object. The method selects a first replica of the object for removal based on last access times for replicas of the object, and transmits a request to a first instance of the distributed storage system that stores the first replica. The request instructs the first instance to remove the first replica of the object.

    Abstract translation: 一种方法在分布式存储系统中分配对象副本。 该方法识别分布式存储系统中的多个对象。 每个对象具有关联的存储策略,其指定存储在分布式存储系统的不同实例处的对象副本的目标数量。 该方法识别多个对象的对象,其对象副本的数量超过与对象相关联的存储策略指定的对象副本的目标数量。 该方法基于对象的副本的最后访问时间选择要删除的对象的第一副本,并将请求发送到存储第一副本的分布式存储系统的第一实例。 请求指示第一个实例删除对象的第一个副本。

Patent Agency Ranking