Search results ranking using editing distance and document information
    1.
    发明授权
    Search results ranking using editing distance and document information 有权
    使用编辑距离和文档信息搜索结果排名

    公开(公告)号:US08812493B2

    公开(公告)日:2014-08-19

    申请号:US12101951

    申请日:2008-04-11

    CPC classification number: G06F17/2211 G06F17/30864

    Abstract: Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.

    Abstract translation: 用于基于查询字符串从作为搜索结果接收的文档提取文档信息的结构,以及计算数据串和查询字符串之间的编辑距离。 编辑距离用于通过检测整个查询或部分查询的近似匹配来确定文档作为结果排名的一部分的相关性。 编辑距离评估查询字符串与包含诸如TAUC(标题,锚文本,URL,点击)信息等文档信息的给定数据流的距离。该体系结构包括索引时间分割URL中的复合术语 以便更有效地发现查询条款。 另外,使用锚文本的索引时间过滤来查找一个或多个文档结果的前N个锚点。 可以将TAUC信息输入到神经网络(例如,2层),以改进用于对搜索结果排序的相关性度量。

    Name search using a ranking function
    2.
    发明授权
    Name search using a ranking function 有权
    使用排序功能命名搜索

    公开(公告)号:US08645417B2

    公开(公告)日:2014-02-04

    申请号:US12141082

    申请日:2008-06-18

    CPC classification number: G06F17/30657 G06F17/30864

    Abstract: An approach is described for performing a name search using a name search operation and a ranking operation. The name search operation may take text as input and apply a fuzzy matching operation and a lookup operation to generate a collection of candidate names with respective probability scores. In other cases, speech or handwriting recognition may generate the collection of candidate names and probability scores. The ranking operation may then rank these candidate names using a ranking function. The ranking function may rank the candidate names based on the probability scores associated with the names and at least one other factor. One such factor may reflect whether information provided by a user matches profile information associated with a candidate name under consideration. Another factor may reflect an extent of a nexus between the user and a person associated with the candidate name. Other types of factors can be used.

    Abstract translation: 描述了使用名称搜索操作和排序操作执行姓名搜索的方法。 名称搜索操作可以将文本作为输入并应用模糊匹配操作和查找操作以生成具有相应概率得分的候选名称的集合。 在其他情况下,语音或手写识别可能产生候选名称和概率分数的集合。 然后,排序操作可以使用排序函数对这些候选名称进行排名。 排名函数可以基于与名称和至少一个其他因素相关联的概率分数对候选名称进行排名。 一个这样的因素可以反映用户提供的信息是否匹配与考虑的候选名称相关联的简档信息。 另一个因素可能反映了用户与与候选人名称相关联的人之间的关联程度。 可以使用其他类型的因素。

    USER PIPELINE CONFIGURATION FOR RULE-BASED QUERY TRANSFORMATION, GENERATION AND RESULT DISPLAY
    3.
    发明申请
    USER PIPELINE CONFIGURATION FOR RULE-BASED QUERY TRANSFORMATION, GENERATION AND RESULT DISPLAY 有权
    用户管道配置,用于基于规则的查询转换,生成和结果显示

    公开(公告)号:US20130110860A1

    公开(公告)日:2013-05-02

    申请号:US13287717

    申请日:2011-11-02

    CPC classification number: G06F17/30448

    Abstract: A query pipeline for an enterprise search system is configurable by a user of the system. A user may create rules for custom query transformation and parallel query generation, federation of queries, mixing of results and application of display layouts to the received search results. A user interface (UI) assists a user in configuring the search pipeline. For example, a user may enter condition action rules for queries that affect how a query is transformed, how parallel queries are generated, how queries are federated, how search results are ranked and displayed, how rules are ordered and the like.

    Abstract translation: 用于企业搜索系统的查询流水线可由系统的用户配置。 用户可以创建用于自定义查询转换和并行查询生成,查询联合,结果混合和显示布局应用于接收到的搜索结果的规则。 用户界面(UI)帮助用户配置搜索管道。 例如,用户可以为影响查询如何转换的查询,如何并行查询生成,查询如何联合,查询结果如何排序和显示,规则如何排序等输入条件操作规则。

    DISCOVERING EXPERTISE USING DOCUMENT METADATA IN PART TO RANK AUTHORS
    4.
    发明申请
    DISCOVERING EXPERTISE USING DOCUMENT METADATA IN PART TO RANK AUTHORS 有权
    发现使用文件元数据的部分作者

    公开(公告)号:US20120310928A1

    公开(公告)日:2012-12-06

    申请号:US13150710

    申请日:2011-06-01

    CPC classification number: G06F17/30979

    Abstract: Expertise mining features are provided based in part on the use of an expertise mining algorithm and expertise mining queries. A method of an embodiment operates to provide an expanded feedback query based in part on search results using an expertise mining query and a number of author-ranking heuristics used to rank authors and/or co-authors (e.g., primary authors, secondary authors, etc.) as part of an expertise mining operation. A search system of an embodiment includes an author ranker component to rank authors based in part on an expertise mining query and author-ranking heuristics, and a query expander component to provide expanded queries as part of identifying relevant search results. Other embodiments are also disclosed.

    Abstract translation: 专业挖掘功能部分基于专业挖掘算法和专业挖掘查询的使用而提供。 实施例的方法用于使用专业知识挖掘查询和用于对作者和/或共同作者进行排名的多个作者排名启发法(例如,主要作者,次要作者, 等等)作为专业挖掘操作的一部分。 实施例的搜索系统包括作者角色组件,其部分地基于专业挖掘查询和作者排名启发式排序作者,以及查询扩展器组件,用于提供扩展查询作为标识相关搜索结果的一部分。 还公开了其他实施例。

    Techniques to perform relative ranking for search results
    5.
    发明授权
    Techniques to perform relative ranking for search results 有权
    执行搜索结果相对排名的技术

    公开(公告)号:US08266144B2

    公开(公告)日:2012-09-11

    申请号:US13175043

    申请日:2011-07-01

    CPC classification number: G06F17/3053

    Abstract: Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.

    Abstract translation: 描述了对搜索结果执行相对排名的技术。 装置可以包括增强的搜索组件,其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。 增强搜索组件可以包括资源搜索模块,其可操作以使用来自搜索查询的多个搜索项来搜索资源,并且输出具有部分或全部搜索项的一组资源。 增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块,用于接收资源集合的邻近生成模块,检索每个资源的搜索项位置信息,以及基于搜索生成接近特征值 期限位置信息。 增强搜索组件还可以包括资源排序模块,其通信地耦合到资源搜索模块和邻近生成模块,用于接收邻近特征值的资源排名模块,以及部分地基于邻近特征值对资源进行排名。 描述和要求保护其他实施例。

    Relevant individual searching using managed property and ranking features
    6.
    发明授权
    Relevant individual searching using managed property and ranking features 有权
    使用管理财产和排名特征的相关个人搜索

    公开(公告)号:US08224847B2

    公开(公告)日:2012-07-17

    申请号:US12608181

    申请日:2009-10-29

    CPC classification number: G06F17/30699

    Abstract: Embodiments are configured to provide information relevant to individuals of interest to a searching user. In an embodiment, a method includes identifying relevant individuals of a network using a relevance model that includes the use of a number of managed properties and ranking features to identify relevant individuals of a defined network. The relevance model of one embodiment is defined by a schema that includes a textual matching ranking feature, social distance ranking feature, a levels to top ranking feature, and a proximity ranking feature.

    Abstract translation: 实施例被配置为向搜索用户提供与感兴趣的个人相关的信息。 在一个实施例中,一种方法包括使用包括使用多个管理属性和排序特征来识别所定义的网络的相关个体的相关性模型来识别网络的相关个体。 一个实施例的相关性模型由包括文本匹配排名特征,社交距离排名特征,级别与顶级排名特征以及接近度排名特征的模式来定义。

    RANKING FUNCTIONS USING DOCUMENT USAGE STATISTICS
    7.
    发明申请
    RANKING FUNCTIONS USING DOCUMENT USAGE STATISTICS 审中-公开
    使用文件使用统计的排名函数

    公开(公告)号:US20120041960A9

    公开(公告)日:2012-02-16

    申请号:US12359939

    申请日:2009-01-26

    Abstract: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.

    Abstract translation: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。

    Techniques to perform relative ranking for search results
    8.
    发明授权
    Techniques to perform relative ranking for search results 有权
    执行搜索结果相对排名的技术

    公开(公告)号:US07974974B2

    公开(公告)日:2011-07-05

    申请号:US12051847

    申请日:2008-03-20

    CPC classification number: G06F17/3053

    Abstract: Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.

    Abstract translation: 描述了对搜索结果执行相对排名的技术。 装置可以包括增强的搜索组件,其操作以接收搜索查询并且响应于搜索查询提供排名的搜索结果。 增强搜索组件可以包括资源搜索模块,其可操作以使用来自搜索查询的多个搜索项来搜索资源,并且输出具有部分或全部搜索项的一组资源。 增强搜索组件还可以包括通信地耦合到资源搜索模块的邻近生成模块,用于接收资源集合的邻近生成模块,检索每个资源的搜索项位置信息,以及基于搜索生成接近特征值 期限位置信息。 增强搜索组件还可以包括资源排序模块,其通信地耦合到资源搜索模块和邻近生成模块,用于接收邻近特征值的资源排名模块,以及部分地基于邻近特征值对资源进行排名。 描述和要求保护其他实施例。

    CUSTOM RANKING MODEL SCHEMA
    9.
    发明申请
    CUSTOM RANKING MODEL SCHEMA 有权
    自定义排名模式

    公开(公告)号:US20110137893A1

    公开(公告)日:2011-06-09

    申请号:US12630981

    申请日:2009-12-04

    CPC classification number: G06F17/30675

    Abstract: A customizable ranking model of a search engine using custom ranking model configuration and parameters of a pre-defined human-readable format. The architecture can employ a markup language schema to represent the custom ranking model. In one implementation, the schema developed utilizes XML (extensible markup language) for representing the custom ranking model. Weights for dynamic and static relevance ingredients can be altered per ranking model and new relevance ingredients can be added. Additionally, features are provided for improving relevance such as adding terms to a thesaurus for synonym expansion, for example, the ability to deal with single terms either as compounds, and/or using custom word breaking rules.

    Abstract translation: 使用自定义排名模型配置和预定义的人类可读格式的参数的可定制的搜索引擎排名模型。 该架构可以采用标记语言模式来表示自定义排名模型。 在一个实现中,开发的模式利用XML(可扩展标记语言)来表示自定义排名模型。 动态和静态相关成分的重量可以根据排名模型更改,并可添加新的相关成分。 另外,提供了用于提高相关性的功能,例如将术语添加到同义词扩展的词库中,例如,将单个术语作为化合物处理的能力和/或使用自定义单词断开规则。

    Relevant Individual Searching Using Managed Property and Ranking Features
    10.
    发明申请
    Relevant Individual Searching Using Managed Property and Ranking Features 有权
    使用托管属性和排名特征的相关个人搜索

    公开(公告)号:US20110106850A1

    公开(公告)日:2011-05-05

    申请号:US12608181

    申请日:2009-10-29

    CPC classification number: G06F17/30699

    Abstract: Embodiments are configured to provide information relevant to individuals of interest to a searching user. In an embodiment, a method includes identifying relevant individuals of a network using a relevance model that includes the use of a number of managed properties and ranking features to identify relevant individuals of a defined network. The relevance model of one embodiment is defined by a schema that includes a textual matching ranking feature, social distance ranking feature, a levels to top ranking feature, and a proximity ranking feature.

    Abstract translation: 实施例被配置为向搜索用户提供与感兴趣的个人相关的信息。 在一个实施例中,一种方法包括使用包括使用多个管理属性和排序特征来识别所定义的网络的相关个体的相关性模型来识别网络的相关个体。 一个实施例的相关性模型由包括文本匹配排名特征,社交距离排名特征,级别与顶级排名特征以及接近度排名特征的模式来定义。

Patent Agency Ranking