Contextual query suggestion in result pages
    1.
    发明授权
    Contextual query suggestion in result pages 有权
    结果页面中的上下文查询建议

    公开(公告)号:US08275759B2

    公开(公告)日:2012-09-25

    申请号:US12391274

    申请日:2009-02-24

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30867

    摘要: Described is a search technology in which a search engine constructs a results page for a query that integrates suggested queries with the individual query results (e.g., displayed URLs). When rendered, the proximity of the suggested queries to their corresponding individual query result provides context as to the specific URL to which the suggested query is related. Suggested queries may appear alongside their associated search result, e.g., a displayed URL, and/or in an expandable panel proximate that individual search result. Suggested queries may appear within text accompanying a URL, and/or in a drop down menu following interaction with such text or the like. Related queries may be found by using a search result URL to find a query, by analyzing a search result's text snippet, by accessing historical data, and/or by accessing current user session data.

    摘要翻译: 描述了一种搜索技术,其中搜索引擎构建用于将建议的查询与各个查询结果(例如,显示的URL)集成的查询的结果页面。 呈现时,建议的查询与其相应的单独查询结果的接近度提供了与建议查询相关的特定URL的上下文。 建议的查询可能与其相关联的搜索结果(例如,显示的URL)一起出现,和/或在靠近该搜索结果的可扩展面板中。 建议的查询可能出现在随URL附带的文本中,和/或在与这样的文本等相互作用之后的下拉菜单中。 可以通过使用搜索结果URL来查找查询,通过分析搜索结果的文本片段,访问历史数据和/或通过访问当前用户会话数据来找到相关查询。

    Using language models to expand wildcards
    2.
    发明授权
    Using language models to expand wildcards 有权
    使用语言模型来扩展通配符

    公开(公告)号:US07277029B2

    公开(公告)日:2007-10-02

    申请号:US11159711

    申请日:2005-06-23

    IPC分类号: H03M11/00

    CPC分类号: G06F17/276

    摘要: A method of inputting text is provided in which a first portion of an input string is received from a user, the first portion of the input string including at least one keystroke representing a wildcard character of the input string. A second portion of the input string is then received, with the second portion including one or more keystrokes all representing non-wildcard characters of the input string.

    摘要翻译: 提供一种输入文本的方法,其中从用户接收输入字符串的第一部分,输入字符串的第一部分包括表示输入字符串的通配符的至少一个按键。 然后接收输入字符串的第二部分,其中第二部分包括一个或多个击键,表示输入字符串的非通配符。

    Method for retrieving texts which are similar to a sample text
    3.
    发明授权
    Method for retrieving texts which are similar to a sample text 失效
    检索类似于示例文本的文本的方法

    公开(公告)号:US5970484A

    公开(公告)日:1999-10-19

    申请号:US654044

    申请日:1996-05-28

    IPC分类号: G06F17/30 H04N1/00 H04N1/327

    摘要: An information retrieval method wherein users may submit a query via a graphical bitmapping technique. The user provides an information retrieval system with a bitmap of a printed, written, or graphical query by either scanning the query with a graphical scanner, or employing a standard facsimile transmission machine. The information retrieval system then performs an optical image/character recognition process upon the received bitmap to determine the content of the query, information is then retried based upon the recognized characters and images. In a particular method of the invention, the user is provided with a bitmap of the retrieved information.

    摘要翻译: 一种信息检索方法,其中用户可以经由图形位图技术提交查询。 用户通过使用图形扫描仪扫描查询或使用标准传真机提供具有打印,书写或图形查询的位图的信息检索系统。 信息检索系统然后对所接收的位图执行光学图像/字符识别处理以确定查询的内容,然后基于所识别的字符和图像重新尝试信息。 在本发明的特定方法中,向用户提供所检索信息的位图。

    Data compression method and apparatus
    4.
    发明授权
    Data compression method and apparatus 有权
    数据压缩方法及装置

    公开(公告)号:US07720878B2

    公开(公告)日:2010-05-18

    申请号:US11110554

    申请日:2005-04-20

    IPC分类号: G06F17/30

    摘要: An improved data compression method and apparatus is provided, particularly with regard to the compression of data in tabular form such as database records. The present invention achieves improved compression ratios by utilizing metadata to transform the data in a manner that optimizes known compression techniques. In one embodiment of the invention, a schema is generated which is utilized to reorder and partition the data into low entropy and high entropy portions which are separately compressed by conventional compression methods. The high entropy portion is further reordered and partitioned to take advantage of row and column dependencies in the data. The present invention enables not only much greater compression ratios but increased speed than is achieved by compressing the untransformed data.

    摘要翻译: 提供了一种改进的数据压缩方法和装置,特别是关于诸如数据库记录的表格形式的数据的压缩。 本发明通过利用元数据以优化已知压缩技术的方式来变换数据来实现改进的压缩比。 在本发明的一个实施例中,生成了用于重新排序和分割数据到低熵和高熵部分的模式,这些部分由常规压缩方法分开压缩。 高熵部分被进一步重新排序并分区以利用数据中的行和列依赖性。 本发明不仅能够通过压缩未转换的数据而实现更大的压缩比,而且增加速度。

    Contingency table estimation via sketches
    5.
    发明授权
    Contingency table estimation via sketches 失效
    通过草图估计应急表

    公开(公告)号:US07536366B2

    公开(公告)日:2009-05-19

    申请号:US11319992

    申请日:2005-12-28

    IPC分类号: G06E1/00

    摘要: Systems and methods that enhance estimate(s) of features (e.g., word associations), via employing a sampling component (e.g., sketches) that facilitates computations of sample contingency tables, and designates occurrences (or absence) of features in data (e.g., words in document lists). The sampling component can further include a contingency table generator and an estimation that employs a likelihood argument (e.g., partial likelihood, maximum likelihood, and the like) to estimate features/word pair(s) associations in the contingency tables.

    摘要翻译: 通过采用促进样本应急表的计算的采样分量(例如草图)来增强特征估计(例如,字关联)的系统和方法,并指定数据中特征的出现(或不存在)(例如, 文件列表中的单词)。 采样分量还可以包括应急表生成器和使用似然参数(例如,部分似然,最大似然等)估计应急表中的特征/字对关联的估计。

    Contextual Query Suggestion in Result Pages
    6.
    发明申请
    Contextual Query Suggestion in Result Pages 有权
    结果页面中的上下文查询建议

    公开(公告)号:US20100228710A1

    公开(公告)日:2010-09-09

    申请号:US12391274

    申请日:2009-02-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: Described is a search technology in which a search engine constructs a results page for a query that integrates suggested queries with the individual query results (e.g., displayed URLs). When rendered, the proximity of the suggested queries to their corresponding individual query result provides context as to the specific URL to which the suggested query is related. Suggested queries may appear alongside their associated search result, e.g., a displayed URL, and/or in an expandable panel proximate that individual search result. Suggested queries may appear within text accompanying a URL, and/or in a drop down menu following interaction with such text or the like. Related queries may be found by using a search result URL to find a query, by analyzing a search result's text snippet, by accessing historical data, and/or by accessing current user session data.

    摘要翻译: 描述了一种搜索技术,其中搜索引擎构建用于将建议的查询与各个查询结果(例如,显示的URL)集成的查询的结果页面。 呈现时,建议的查询与其相应的单独查询结果的接近度提供了与建议查询相关的特定URL的上下文。 建议的查询可能与其相关联的搜索结果(例如,显示的URL)一起出现,和/或在靠近该搜索结果的可扩展面板中。 建议的查询可能出现在随URL附带的文本中,和/或在与这样的文本等相互作用之后的下拉菜单中。 可以通过使用搜索结果URL来查找查询,通过分析搜索结果的文本片段,访问历史数据和/或通过访问当前用户会话数据来找到相关查询。

    Personalized information retrieval search with backoff
    7.
    发明申请
    Personalized information retrieval search with backoff 有权
    个性化信息检索搜索与退避

    公开(公告)号:US20080082485A1

    公开(公告)日:2008-04-03

    申请号:US11529134

    申请日:2006-09-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30657

    摘要: Query logs are accessed to obtain queries, user information that specifies a user from which the query was received, a long with a selected result that was selected by the specified user who authored the query. This query log information is used to identify classes of users that looked for a similar result given a similar query. Those classes can then be used by a search engine in order to rank or provide search results to a user in response to a query input by the user.

    摘要翻译: 访问查询日志以获取查询,指定从其接收查询的用户的用户信息,具有由创建查询的指定用户选择的选定结果的长度。 此查询日志信息用于识别给出类似查询的类似结果的用户类。 搜索引擎可以使用这些类,以便响应用户输入的查询对用户进行排名或提供搜索结果。

    Data compression method and apparatus
    8.
    发明授权
    Data compression method and apparatus 有权
    数据压缩方法及装置

    公开(公告)号:US06959300B1

    公开(公告)日:2005-10-25

    申请号:US09383889

    申请日:1999-08-26

    IPC分类号: G06F7/00 G06F17/30 H03M7/30

    摘要: An improved data compression method and apparatus is provided, particularly with regard to the compression of data in tabular form such as database records. The present invention achieves improved compression ratios by utilizing metadata to transform the data in a manner that optimizes known compression techniques. In one embodiment of the invention, a schema is generated which is utilized to reorder and partition the data into low entropy and high entropy portions which are separately compressed by conventional compression methods. The high entropy portion is further reordered and partitioned to take advantage of row and column dependencies in the data. The present invention enables not only much greater compression ratios but increased speed than is achieved by compressing the untransformed data.

    摘要翻译: 提供了一种改进的数据压缩方法和装置,特别是关于诸如数据库记录的表格形式的数据的压缩。 本发明通过利用元数据以优化已知压缩技术的方式来变换数据来实现改进的压缩比。 在本发明的一个实施例中,生成了用于重新排序和分割数据到低熵和高熵部分的模式,这些部分由常规压缩方法分开压缩。 高熵部分被进一步重新排序并分区以利用数据中的行和列依赖性。 本发明不仅能够通过压缩未转换的数据而实现更大的压缩比,而且增加速度。

    Methods and apparatus for detecting and displaying similarities in large
data sets
    9.
    发明授权
    Methods and apparatus for detecting and displaying similarities in large data sets 失效
    用于检测和显示大数据集相似性的方法和装置

    公开(公告)号:US5953006A

    公开(公告)日:1999-09-14

    申请号:US853459

    申请日:1992-03-18

    摘要: Interactive Methods and apparatus for studying similarities of values in very large data sets. The methods and apparatus employ a dotplot in an interactive graphical user interface to make the relationship between the similarities and the data set visible. A variety of filtering, weighting, and compression techniques make it possible to employ the dot plot with sequences of more than 10,000 tokens and to interactively magnify the dot plot, change weighting and display quantization, and view the underlying data. Also disclosed is a technique which is employed in the apparatus for identifying long sequences of similar tokens. The apparatus is used in the study of large bodies of text and code.

    摘要翻译: 交互式方法和设备,用于研究非常大的数据集中的值的相似性。 方法和装置在交互式图形用户界面中使用点图,以使相似性和数据集之间的关系可见。 各种滤波,加权和压缩技术使得可以使用具有超过10,000个令牌的序列的点图并且交互地放大点图,改变加权和显示量化以及查看基础数据。 还公开了一种在用于识别类似标记的长序列的装置中采用的技术。 该装置用于大量文本和代码的研究。

    Glossary construction tool
    10.
    发明授权
    Glossary construction tool 失效
    词汇表施工工具

    公开(公告)号:US5850561A

    公开(公告)日:1998-12-15

    申请号:US312243

    申请日:1994-09-23

    IPC分类号: G06F17/27 G06F9/00

    CPC分类号: G06F17/2795

    摘要: A glossary construction tool for generating and maintaining a translation glossary, consisting of a number of terms and their translations. The glossary construction tool includes a terminology list development tool for generating a terminology list in the source language and a glossary development tool for automatically obtaining candidate translations for the terms in the terminology list. The terminology list development tool will construct the terminology list in the source language by analyzing the source text document to be translated and automatically extracting a list of candidate terms, comprised of multiple word noun phrases and single words not appearing on a standard or predefined stop list of "noise" words. The glossary development tool will obtain candidate translations for terms in the final terminology list by searching the source text document of a word-aligned text pair for a term to be translated and then provide candidate translations based on the indicated alignment with the target text document of the aligned text pair. A concordance tool provides monolingual and bilingual concordances in order to facilitate the user's evaluation of the automatically generated list of candidate terms and candidate translations, respectively.

    摘要翻译: 用于生成和维护翻译词汇表的词汇表构建工具,由多个术语和翻译组成。 词汇表构建工具包括用于生成源语言的术语列表的术语列表开发工具和用于自动获得术语列表中的术语的候选翻译的词汇表开发工具。 术语列表开发工具将通过分析要翻译的源文本文档来构建源语言术语列表,并自动提取候选词列表,其中包含多个单词名词短语和不出现在标准或预定义的停止列表上的单个词 的“噪音”词。 术语表开发工具将通过搜索词语对齐的文本对的源文本文档来获得术语列表中的术语的候选翻译,以便翻译的术语,然后基于与目标文本文档的指定对齐方式提供候选翻译 对齐的文本对。 一致性工具提供单语和双语一致性,以便于用户分别自动生成候选词和候选翻译列表的评估。