Classifying text into hierarchical categories
    1.
    发明授权
    Classifying text into hierarchical categories 有权
    将文本分类为分层类别

    公开(公告)号:US08725732B1

    公开(公告)日:2014-05-13

    申请号:US13426974

    申请日:2012-03-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707

    摘要: Systems, methods and program products for classifying text. A system classifies text into first subject matter categories. The system identifies one or more second subject matter categories in a collection of second subject matter categories, each of the second categories is a hierarchical classification of a collection of confirmed valid search results for queries, in which at least one query for each identified second category includes a term in the text. The system filters the identified categories by excluding identified categories whose ancestors are not among the first categories. The system selects categories from the filtered categories based on one or more thresholds in which a threshold specifies a degree of relatedness between a selected category and the text. The selected categories are a sufficient basis for recommending content to a user, the content being associated with one or more of the selected categories.

    摘要翻译: 用于分类文本的系统,方法和程序产品。 系统将文本分类为第一主题类别。 系统识别第二主题类别的集合中的一个或多个第二主题类别,第二类别中的每一个是用于查询的确认的有效搜索结果的集合的分级分类,其中针对每个识别的第二类别进行至少一个查询 在文本中包含一个术语。 系统通过排除其祖先不在第一类别中的已识别类别来过滤所识别的类别。 该系统基于一个或多个阈值来选择来自过滤的类别的类别,其中阈值指定所选类别和文本之间的相关程度。 所选择的类别是向用户推荐内容的充分基础,内容与所选择的一个或多个类别相关联。

    Anticipated query generation and processing in a search engine
    2.
    发明授权
    Anticipated query generation and processing in a search engine 有权
    搜索引擎中预期的查询生成和处理

    公开(公告)号:US08156109B2

    公开(公告)日:2012-04-10

    申请号:US12916330

    申请日:2010-10-29

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30646 G06F17/3064

    摘要: A search system monitors the input of a search query by a user. Before the user finishes entering the search query, the search system identifies and sends a portion of the query as a partial query to the search engine. Based on the partial query, the search engine creates a set of predicted queries. This process may take into account prior queries submitted by a community of users, and may take into account a user profile. The predicted queries are be sent back to the user for possible selection. The search system may also cache search results corresponding to one or more of the predicted queries in anticipation of the user selecting one of the predicted queries. The search engine may also return at least a portion of the search results corresponding to one or more of the predicted queries.

    摘要翻译: 搜索系统监视用户对搜索查询的输入。 在用户完成输入搜索查询之前,搜索系统将查询的一部分作为部分查询识别并发送到搜索引擎。 基于部分查询,搜索引擎创建一组预测查询。 此过程可能会考虑用户社区提交的先前查询,并可能会考虑用户个人资料。 预测的查询将被发送回用户以进行可能的选择。 搜索系统还可以缓存对应于一个或多个预测查询的搜索结果,预期用户选择预测查询之一。 搜索引擎还可以返回对应于一个或多个预测查询的搜索结果的至少一部分。

    Customization of search results for search queries received from third party sites
    3.
    发明授权
    Customization of search results for search queries received from third party sites 有权
    自定义从第三方网站收到的搜索查询的搜索结果

    公开(公告)号:US07565630B1

    公开(公告)日:2009-07-21

    申请号:US10869492

    申请日:2004-06-15

    IPC分类号: G06F7/00 G06F17/30

    摘要: A third party website provides a search interface to a general search engine. A site profile of the third party website describes various topics, keywords, or domains that are potentially relevant or of interest to users who access the third party website. The topics are associated with a topical directory, with domains associated with each topic; the domains in a given topic are given various weightings. When a search is submitted to the general search engine from the third party website via the search interface, the general search engine uses the site profile to customize the search results. The search results are customized by weighting the ranking of documents from websites associated with the topics in the site profile. The site profile can be manually or automatically constructed.

    摘要翻译: 第三方网站向一般搜索​​引擎提供搜索界面。 第三方网站的网站简介描述了访问第三方网站的用户可能相关或感兴趣的各种主题,关键字或域。 主题与主题目录相关联,域与每个主题相关联; 给定主题中的域被赋予各种权重。 当通过搜索界面从第三方网站向普通搜索引擎提交搜索时,一般搜索引擎使用站点配置文件自定义搜索结果。 搜索结果通过对站点配置文件中与主题相关的网站的文档排名进行加权来定制。 站点配置文件可以手动或自动构建。

    Predicted query generation from partial search query input
    4.
    发明授权
    Predicted query generation from partial search query input 有权
    部分搜索查询输入的预测查询生成

    公开(公告)号:US09245004B1

    公开(公告)日:2016-01-26

    申请号:US13402840

    申请日:2012-02-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30646 G06F17/3064

    摘要: A search system monitors the input of a search query by a user. Before the user finishes entering the search query, the search system identifies and sends a portion of the query as a partial query to the search engine. Based on the partial query, the search engine creates a set of predicted queries. This process may take into account prior queries submitted by a community of users, and may take into account a user profile. The predicted queries are to be sent back to the user for possible selection. The search system may also cache search results corresponding to one or more of the predicted queries in anticipation of the user selecting one of the predicted queries. The search engine may also return at least a portion of the search results corresponding to one or more of the predicted queries.

    摘要翻译: 搜索系统监视用户对搜索查询的输入。 在用户完成输入搜索查询之前,搜索系统将查询的一部分作为部分查询识别并发送到搜索引擎。 基于部分查询,搜索引擎创建一组预测查询。 此过程可能会考虑用户社区提交的先前查询,并可能会考虑用户个人资料。 预测的查询将被发送回用户以进行可能的选择。 搜索系统还可以缓存对应于一个或多个预测查询的搜索结果,预期用户选择预测查询之一。 搜索引擎还可以返回对应于一个或多个预测查询的搜索结果的至少一部分。

    Displaying autocompletion of partial search query with predicted search results
    5.
    发明授权
    Displaying autocompletion of partial search query with predicted search results 有权
    显示具有预测搜索结果的部分搜索查询的自动完成

    公开(公告)号:US08515954B2

    公开(公告)日:2013-08-20

    申请号:US13218416

    申请日:2011-08-25

    IPC分类号: G06F17/30

    摘要: A set of ordered predicted completion strings are presented to a user as the user enters text in a text entry box (e.g., a browser or a toolbar). The predicted completion strings can be in the form of URLs or query strings. The ordering may be based on any number of factors (e.g., a query's frequency of submission from a community of users). URLs can be ranked based on an importance value of the URL. Privacy is taken into account in a number of ways, such as using a previously submitted query only when more than a certain number of unique requestors have made the query. The sets of ordered predicted completion strings is obtained by matching a fingerprint value of the user's entry string to a fingerprint to table map which contains the set of ordered predicted completion strings.

    摘要翻译: 当用户在文本输入框(例如,浏览器或工具栏)中输入文本时,将一组有序的预测完成字符串呈现给用户。 预测的完成字符串可以是URL或查询字符串的形式。 排序可以基于任何数量的因素(例如,查询从用户社区提交的频率)。 URL可以根据URL的重要性值进行排名。 通过多种方式考虑隐私,例如仅当超过一定数量的唯一请求者进行查询时才使用以前提交的查询。 通过将用户的条目串的指纹值与包含一组有序预测完成字符串的指纹到表格映射相匹配来获得有序预测完成字符串的集合。

    Method for Detecting Link Spam in Hyperlinked Databases
    6.
    发明申请
    Method for Detecting Link Spam in Hyperlinked Databases 有权
    超链接数据库中链接垃圾邮件检测方法

    公开(公告)号:US20110270890A1

    公开(公告)日:2011-11-03

    申请号:US13149806

    申请日:2011-05-31

    IPC分类号: G06F17/30

    摘要: A computer-implemented method identifies nodes that are beneficiaries of node importance inflating links in a directed graph of linked nodes. The directed graph of linked nodes corresponds to a linked database, and the nodes correspond to documents within the linked database. The method is performed by a computer system including one or more processors and memory storing one or more programs, the one or more processors executing the one or more programs to perform the method. The method includes computing, for each of at least a subset of the nodes in the directed graph, a respective quantity corresponding to a mathematical derivative of a node importance function, and performing a remedial action on a respective node in the directed graph in accordance with the respective computed quantity computed for the respective node.

    摘要翻译: 计算机实现的方法识别在链接节点的有向图中节点重要性充气链路的受益者的节点。 链接节点的有向图对应于链接的数据库,并且节点对应于链接数据库内的文档。 该方法由包括一个或多个处理器的计算机系统和存储一个或多个程序的存储器执行,所述一个或多个处理器执行一个或多个程序以执行该方法。 该方法包括针对有向图中的节点的至少一个子集中的每一个计算对应于节点重要性函数的数学导数的相应数量,并根据有向图对相关节点执行补救动作 针对相应节点计算的相应计算量。

    Anticipated Query Generation and Processing in a Search Engine
    7.
    发明申请
    Anticipated Query Generation and Processing in a Search Engine 有权
    搜索引擎中预期的查询生成和处理

    公开(公告)号:US20110047120A1

    公开(公告)日:2011-02-24

    申请号:US12916330

    申请日:2010-10-29

    IPC分类号: G06N5/02

    CPC分类号: G06F17/30646 G06F17/3064

    摘要: A search system monitors the input of a search query by a user. Before the user finishes entering the search query, the search system identifies and sends a portion of the query as a partial query to the search engine. Based on the partial query, the search engine creates a set of predicted queries. This process may take into account prior queries submitted by a community of users, and may take into account a user profile. The predicted queries are be sent back to the user for possible selection. The search system may also cache search results corresponding to one or more of the predicted queries in anticipation of the user selecting one of the predicted queries. The search engine may also return at least a portion of the search results corresponding to one or more of the predicted queries.

    摘要翻译: 搜索系统监视用户对搜索查询的输入。 在用户完成输入搜索查询之前,搜索系统将查询的一部分作为部分查询识别并发送到搜索引擎。 基于部分查询,搜索引擎创建一组预测查询。 此过程可能会考虑用户社区提交的先前查询,并可能会考虑用户个人资料。 预测的查询将被发送回用户以进行可能的选择。 搜索系统还可以缓存对应于一个或多个预测查询的搜索结果,预期用户选择预测查询之一。 搜索引擎还可以返回对应于一个或多个预测查询的搜索结果的至少一部分。

    Variable Personalization of Search Results in a Search Engine
    8.
    发明申请
    Variable Personalization of Search Results in a Search Engine 有权
    搜索引擎中搜索结果的变量个性化

    公开(公告)号:US20100169297A1

    公开(公告)日:2010-07-01

    申请号:US12720479

    申请日:2010-03-09

    IPC分类号: G06F17/30 G06F3/048

    摘要: A search engine provides personalized rankings of search results. A user interest profile identifies topics of interest to a user. Each topic is associated with one or more sites, and a boost value, which can be used to augment an information retrieval score of any document from the site. Search results from any search are provided to the user, with a variable control of the ranking of the results. The results can be ranked by their unboosted information retrieval score, thus reflecting no personalization, or by their fully or partially boosted information retrieval scores. This allows the user to selectively control how their interests affect the ranking of the documents.

    摘要翻译: 搜索引擎提供搜索结果的个性化排名。 用户兴趣简档识别用户感兴趣的主题。 每个主题都与一个或多个站点相关联,并且可以使用增强值来增加站点中任何文档的信息检索分数。 通过对结果排名的可变控制,向用户提供来自任何搜索的搜索结果。 结果可以通过其未启动的信息检索分数进行排名,从而反映出没有个性化,或者完全或部分提升的信息检索分数。 这允许用户选择性地控制他们的兴​​趣如何影响文档的排名。

    Method and System for Autocompletion Using Ranked Results
    9.
    发明申请
    Method and System for Autocompletion Using Ranked Results 有权
    使用排名结果进行自动完成的方法和系统

    公开(公告)号:US20090119289A1

    公开(公告)日:2009-05-07

    申请号:US12345564

    申请日:2008-12-29

    IPC分类号: G06F7/06 G06F17/30

    摘要: A set of ordered predicted completion strings are presented to a user as the user enters text in a text entry box (e.g., a browser or a toolbar). The predicted completion strings can be in the form of URLs or query strings. The ordering may be based on any number of factors (e.g., a query's frequency of submission from a community of users). URLs can be ranked based on an importance value of the URL. Privacy is taken into account in a number of ways, such as using a previously submitted query only when more than a certain number of unique requesters have made the query. The sets of ordered predicted completion strings is obtained by matching a fingerprint value of the user's entry string to a fingerprint to table map which contains the set of ordered predicted completion strings.

    摘要翻译: 当用户在文本输入框(例如,浏览器或工具栏)中输入文本时,将一组有序的预测完成字符串呈现给用户。 预测的完成字符串可以是URL或查询字符串的形式。 排序可以基于任何数量的因素(例如,查询从用户社区提交的频率)。 URL可以根据URL的重要性值进行排名。 通过多种方式考虑隐私,例如仅当超过一定数量的唯一请求者进行查询时才使用以前提交的查询。 通过将用户的条目串的指纹值与包含一组有序预测完成字符串的指纹到表格映射相匹配来获得有序预测完成字符串的集合。

    Variable Personalization of Search Results in a Search Engine
    10.
    发明申请
    Variable Personalization of Search Results in a Search Engine 有权
    搜索引擎中搜索结果的变量个性化

    公开(公告)号:US20130103683A1

    公开(公告)日:2013-04-25

    申请号:US13620611

    申请日:2012-09-14

    IPC分类号: G06F17/30

    摘要: A search engine provides personalized rankings of search results. A user interest profile identifies topics of interest to a user. Each topic is associated with one or more sites, and a boost value, which can be used to augment an information retrieval score of any document from the site. Search results from any search are provided to the user, with a variable control of the ranking of the results. The results can be ranked by their unboosted information retrieval score, thus reflecting no personalization, or by their fully or partially boosted information retrieval scores. This allows the user to selectively control how their interests affect the ranking of the documents.

    摘要翻译: 搜索引擎提供搜索结果的个性化排名。 用户兴趣简档识别用户感兴趣的主题。 每个主题都与一个或多个站点相关联,并且可以使用增强值来增加站点中任何文档的信息检索分数。 通过对结果排名的可变控制,向用户提供来自任何搜索的搜索结果。 结果可以通过其未启动的信息检索分数进行排名,从而反映出没有个性化,或者完全或部分提升的信息检索分数。 这允许用户选择性地控制他们的兴​​趣如何影响文档的排名。