Processing collocation mistakes in documents
    1.
    发明授权
    Processing collocation mistakes in documents 有权
    处理文件中的并置错误

    公开(公告)号:US07574348B2

    公开(公告)日:2009-08-11

    申请号:US11177136

    申请日:2005-07-08

    IPC分类号: G06F17/27

    摘要: A sentence is accessed and at least one query is generated based on the sentence. At least one query can be compared to text within a collection of documents, for example using a web search engine. Collocation errors in the sentence can be detected and/or corrected based on the comparison of the at least one query and the text within the collection of documents.

    摘要翻译: 访问一个句子,并且基于该句子生成至少一个查询。 至少可以将一个查询与文档集合中的文本进行比较,例如使用Web搜索引擎。 可以基于至少一个查询与文档集合内的文本的比较来检测和/或修正该句子中的配置错误。

    Web-based collocation error proofing
    2.
    发明申请
    Web-based collocation error proofing 有权
    基于Web的搭配错误打样

    公开(公告)号:US20080133444A1

    公开(公告)日:2008-06-05

    申请号:US11633788

    申请日:2006-12-05

    IPC分类号: G06N7/02 G06F17/30 G06F3/048

    摘要: Collocation errors can be automatically proofed using local and network-based corpora, including the Web. For example, according to one illustrative method, one or more collocations from a text sample are compared with a corpus such as the content of the Web. The collocations are identified for whether they are disfavored in the corpus. Indications are provided via an output device of whether the collocations are disfavored in the corpus. Additional steps may then be taken such as searching for and providing potentially proper word collocations via a user output.

    摘要翻译: 可以使用本地和基于网络的语料库(包括Web)自动验证并置错误。 例如,根据一个说明性方法,将来自文本样本的一个或多个并置与诸如Web的内容的语料库进行比较。 识别他们是否在语料库中不利的搭配。 通过输出设备提供指示是否在语料库中不匹配。 然后可以采取额外的步骤,例如通过用户输出搜索并提供潜在的适当的单词搭配。

    Processing collocation mistakes in documents
    3.
    发明申请
    Processing collocation mistakes in documents 有权
    处理文件中的并置错误

    公开(公告)号:US20070010992A1

    公开(公告)日:2007-01-11

    申请号:US11177136

    申请日:2005-07-08

    IPC分类号: G06F17/27

    摘要: A sentence is accessed and at least one query is generated based on the sentence. At least one query can be compared to text within a collection of documents, for example using a web search engine. Collocation errors in the sentence can be detected and/or corrected based on the comparison of the at least one query and the text within the collection of documents.

    摘要翻译: 访问一个句子,并且基于该句子生成至少一个查询。 至少可以将一个查询与文档集合中的文本进行比较,例如使用Web搜索引擎。 可以基于至少一个查询与文档集合内的文本的比较来检测和/或修正该句子中的配置错误。

    Proofing of word collocation errors based on a comparison with collocations in a corpus
    4.
    发明授权
    Proofing of word collocation errors based on a comparison with collocations in a corpus 有权
    基于与语料库中的搭配进行比较来验证单词搭配错误

    公开(公告)号:US07774193B2

    公开(公告)日:2010-08-10

    申请号:US11633788

    申请日:2006-12-05

    IPC分类号: G06F17/28 G06F17/21

    摘要: Collocation errors can be automatically proofed using local and network-based corpora, including the Web. For example, according to one illustrative method, one or more collocations from a text sample are compared with a corpus such as the content of the Web. The collocations are identified for whether they are disfavored in the corpus. Indications are provided via an output device of whether the collocations are disfavored in the corpus. Additional steps may then be taken such as searching for and providing potentially proper word collocations via a user output.

    摘要翻译: 可以使用本地和基于网络的语料库(包括Web)自动验证并置错误。 例如,根据一个说明性方法,将来自文本样本的一个或多个并置与诸如Web的内容的语料库进行比较。 识别他们是否在语料库中不利的搭配。 通过输出设备提供指示是否在语料库中不匹配。 然后可以采取额外的步骤,例如通过用户输出搜索并提供潜在的适当的单词搭配。

    Joint ranking model for multilingual web search
    5.
    发明授权
    Joint ranking model for multilingual web search 有权
    多语言网络搜索的联合排名模型

    公开(公告)号:US08326785B2

    公开(公告)日:2012-12-04

    申请号:US12241078

    申请日:2008-09-30

    CPC分类号: G06F17/30675

    摘要: A classifier is built to rank documents of different languages found in a query based at least in part on similarity to other documents and the relevance of those other documents to the query. A joint ranking model, e.g., based upon a Boltzmann machine, is used to represent the content similarity among documents, and to help determine joint relevance probability for a set of documents. The relevant documents of one language are thus leveraged to improve the relevance estimation for documents of different languages. In one aspect, a hidden layer of units (neurons) represents clusters (corresponding to relevant topics) among the retrieved documents, with an output layer representing the relevant documents and their features, and edges representing a relationship between clusters and documents.

    摘要翻译: 构建分类器至少部分地基于与其他文档的相似性以及这些其他文档与查询的相关性来对查询中发现的不同语言的文档进行排序。 联合排名模型,例如基于玻尔兹曼(Boltzmann)机器,用于表示文档之间的内容相似性,并且帮助确定一组文档的联合相关概率。 因此,利用一种语言的相关文件来改进不同语言文件的相关性估计。 在一个方面,隐藏的单位(神经元)表示检索的文档中的集群(对应于相关主题),输出层表示相关文档及其特征,边缘表示集群和文档之间的关系。

    JOINT RANKING MODEL FOR MULTILINGUAL WEB SEARCH
    6.
    发明申请
    JOINT RANKING MODEL FOR MULTILINGUAL WEB SEARCH 有权
    多浏览网络联合排名模型

    公开(公告)号:US20100082511A1

    公开(公告)日:2010-04-01

    申请号:US12241078

    申请日:2008-09-30

    IPC分类号: G06F7/06 G06F17/30 G06F15/18

    CPC分类号: G06F17/30675

    摘要: Described is a technology in which a classifier is built to rank documents of different languages found in a query based at least in part on similarity to other documents and the relevance of those other documents to the query. A joint ranking model, e.g., based upon a Boltzmann machine, is used to represent the content similarity among documents, and to help determine joint relevance probability for a set of documents. The relevant documents of one language are thus leveraged to improve the relevance estimation for documents of different languages. In one aspect, a hidden layer of units (neurons) represents clusters (corresponding to relevant topics) among the retrieved documents, with an output layer representing the relevant documents and their features, and edges representing a relationship between clusters and documents.

    摘要翻译: 描述了一种技术,其中构建分类器以至少部分地基于与其他文档的相似性以及这些其他文档与查询的相关性来对在查询中发现的不同语言的文档进行排名。 联合排名模型,例如基于玻尔兹曼(Boltzmann)机器,用于表示文档之间的内容相似性,并且帮助确定一组文档的联合相关概率。 因此,利用一种语言的相关文件来改进不同语言文件的相关性估计。 在一个方面,隐藏的单位(神经元)表示检索的文档中的集群(对应于相关主题),输出层表示相关文档及其特征,边缘表示集群和文档之间的关系。

    NETWORK SEARCH FOR WRITING ASSISTANCE
    7.
    发明申请
    NETWORK SEARCH FOR WRITING ASSISTANCE 审中-公开
    网络搜索书面帮助

    公开(公告)号:US20120297294A1

    公开(公告)日:2012-11-22

    申请号:US13109021

    申请日:2011-05-17

    IPC分类号: G06F17/21 G06F17/30

    摘要: Architecture that utilizes web search implicitly to assist users in improving writing and associated productivity. The architecture extends the authoring experience of applications of office suite applications which can draw on a web search engine to offer contextual suggestions for revision, word auto-complete, and text prediction. Web-based research and reference to users is enabled as the user writes or revises text. Suggestions are made as to how to complete a phrase or sentence using data from networks such as the Internet or intranet, to how a user how revises a word or phrase in an already-written sentence using data from the network, and to problems in writing style/writing rules. Paragraph analysis is performed to find improper language usage or errors. Prediction and revision suggestions are extracted from web search or enterprise search document summaries, and intent of the user to obtain word completion, revision assistance, and prediction suggestions is identified.

    摘要翻译: 利用网页搜索隐式地协助用户改进写作和相关生产力的体系结构。 该架构扩展了办公套件应用程序的创作经验,可以利用Web搜索引擎提供修订,字自动完成和文本预测的上下文建议。 当用户编写或修改文本时,可以启用基于Web的研究和用户参考。 建议如何使用来自诸如因特网或内联网之类的网络的数据来完成短语或句子,以及用户如何使用来自网络的数据修改已经写入的句子中的单词或短语,以及如何修改文字中的问题 风格/写作规则。 进行段落分析以查找不正确的语言使用或错误。 从网络搜索或企业搜索文档摘要中提取预测和修订建议,并确定用户获取单词完成,修订协助和预测建议的意图。

    Method and system for retrieving confirming sentences
    8.
    发明授权
    Method and system for retrieving confirming sentences 有权
    检索确认句子的方法和系统

    公开(公告)号:US07974963B2

    公开(公告)日:2011-07-05

    申请号:US11187567

    申请日:2005-07-22

    IPC分类号: G06F17/00

    CPC分类号: G06F17/3069 Y10S707/99933

    摘要: A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engine defines indexing units based upon the query, with the indexing units including both lemma from the query and extended indexing units associated with the query. The search engine then retrieves a plurality of sentences from the sentence database using the defined indexing units as search parameters. A similarity between each of the plurality of retrieved sentences and the query is determined by the search engine, wherein each similarity is determined as a function of a linguistic weight of a term in the query. The search engine then ranks the plurality of retrieved sentences based upon the determined similarities.

    摘要翻译: 提供了一种方法,计算机可读介质和系统,其响应于查询从句子数据库中检索确认句子。 搜索引擎响应于查询从句子数据库中检索确认句子。 在检索确认语句中,搜索引擎基于查询来定义索引单元,索引单元包括来自查询的引理和与查询相关联的扩展索引单元。 然后,搜索引擎使用定义的索引单元作为搜索参数从句子数据库中检索多个句子。 由搜索引擎确定多个检索到的句子和查询中的每一个之间的相似度,其中每个相似度被确定为查询中的术语的语言权重的函数。 然后,搜索引擎基于所确定的相似度对多个检索到的句子进行排序。

    Query speller
    9.
    发明授权
    Query speller 有权
    查询拼写器

    公开(公告)号:US07818332B2

    公开(公告)日:2010-10-19

    申请号:US11465023

    申请日:2006-08-16

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/3064

    摘要: Candidate suggestions for correcting misspelled query terms input into a search application are automatically generated. A score for each candidate suggestion can be generated using a first decoding pass and paths through the suggestions can be ranked in a second decoding pass. Candidate suggestions can be generated based on typographical errors, phonetic mistakes and/or compounding mistakes. Furthermore, a ranking model can be developed to rank candidate suggestions to be presented to a user.

    摘要翻译: 自动生成用于纠正输入到搜索应用程序中的拼错查询条件的候选建议。 可以使用第一解码通道来生成每个候选建议的得分,并且通过建议的路径可以被排列在第二解码通行证中。 可以根据印刷错误,语音错误和/或复合错误生成候选建议。 此外,可以开发排名模型来排列要呈现给用户的候选建议。

    Method and system for retrieving confirming sentences
    10.
    发明申请
    Method and system for retrieving confirming sentences 有权
    检索确认句子的方法和系统

    公开(公告)号:US20050273318A1

    公开(公告)日:2005-12-08

    申请号:US11187567

    申请日:2005-07-22

    CPC分类号: G06F17/3069 Y10S707/99933

    摘要: A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engine defines indexing units based upon the query, with the indexing units including both lemma from the query and extended indexing units associated with the query. The search engine then retrieves a plurality of sentences from the sentence database using the defined indexing units as search parameters. A similarity between each of the plurality of retrieved sentences and the query is determined by the search engine, wherein each similarity is determined as a function of a linguistic weight of a term in the query. The search engine then ranks the plurality of retrieved sentences based upon the determined similarities.

    摘要翻译: 提供了一种方法,计算机可读介质和系统,其响应于查询从句子数据库中检索确认句子。 搜索引擎响应于查询从句子数据库中检索确认句子。 在检索确认语句中,搜索引擎基于查询来定义索引单元,索引单元包括来自查询的引理和与查询相关联的扩展索引单元。 然后,搜索引擎使用定义的索引单元作为搜索参数从句子数据库中检索多个句子。 由搜索引擎确定多个检索到的句子和查询中的每一个之间的相似度,其中每个相似度被确定为查询中的术语的语言权重的函数。 然后,搜索引擎基于所确定的相似度对多个检索到的句子进行排序。