Disambiguation of Entities
    1.
    发明申请
    Disambiguation of Entities 有权
    实体的歧义

    公开(公告)号:US20120096033A1

    公开(公告)日:2012-04-19

    申请号:US12904300

    申请日:2010-10-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/30672

    摘要: Methods, systems, algorithms, and media are provided for disambiguating entities present in a received search query. Lists of categories from semi-structured data from external sites as well as internal sources are used to detect if ambiguity exists in an entity within the search query. Multiple senses or categories of the ambiguous entity are determined by ascertaining the primary intent of an entity extracted from a main term of a document. The probability of each sense is calculated by computing a total amount of traffic received for each of the senses of the ambiguous entity. The sense with the highest amount of computed traffic is the most probable determined sense.

    摘要翻译: 提供了方法,系统,算法和媒体,用于消除存在于接收到的搜索查询中的实体。 来自外部站点的半结构化数据以及内部源的类别列表用于检测搜索查询中的实体是否存在歧义。 通过确定从文档的主要术语中提取的实体的主要意图来确定模糊实体的多个意义或类别。 通过计算对于模糊实体的每个感觉接收到的业务的总量来计算每个感觉的概率。 具有最高计算量的交通量的意义是最可能确定的意义。

    Disambiguation of entities
    3.
    发明授权
    Disambiguation of entities 有权
    消除实体的歧义

    公开(公告)号:US09418155B2

    公开(公告)日:2016-08-16

    申请号:US12904300

    申请日:2010-10-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/30672

    摘要: Methods, systems, algorithms, and media are provided for disambiguating entities present in a received search query. Lists of categories from semi-structured data from external sites as well as internal sources are used to detect if ambiguity exists in an entity within the search query. Multiple senses or categories of the ambiguous entity are determined by ascertaining the primary intent of an entity extracted from a main term of a document. The probability of each sense is calculated by computing a total amount of traffic received for each of the senses of the ambiguous entity. The sense with the highest amount of computed traffic is the most probable determined sense.

    摘要翻译: 提供了方法,系统,算法和媒体,用于消除存在于接收到的搜索查询中的实体。 来自外部站点的半结构化数据以及内部源的类别列表用于检测搜索查询中的实体是否存在歧义。 通过确定从文档的主要术语提取的实体的主要意图来确定模糊实体的多个感官或类别。 通过计算对于模糊实体的每个感觉接收到的业务的总量来计算每个感觉的概率。 具有最高计算量的交通量的意义是最可能确定的意义。