Methods and systems for alignment of parallel text corpora
    1.
    发明授权
    Methods and systems for alignment of parallel text corpora 有权
    对齐并行文本语料库的方法和系统

    公开(公告)号:US09047275B2

    公开(公告)日:2015-06-02

    申请号:US13464447

    申请日:2012-05-04

    CPC classification number: G06F17/2881 G06F17/2785

    Abstract: Computer-implemented systems and methods align fragments of a first text with corresponding fragments of a second text, which is a translation of the first text. One preferred embodiment preliminarily divides the first and second texts into fragments; generates a hypothesis about the correspondence between the fragments of the first and second texts; performs a lexico-morphological analysis of the fragments using linguistic descriptions; performs a syntactic analysis of the fragments using linguistic descriptions and generates syntactic structures for the fragments; generates semantic structures for the fragments; and estimates the degree of correspondence between the semantic structures.

    Abstract translation: 计算机实现的系统和方法将第一文本的片段与第二文本的相应片段对齐,第二文本是第一文本的翻译。 一个优选实施例预先将第一和第二文本划分成片段; 产生关于第一和第二文本的片段之间的对应关系的假设; 使用语言描述进行片段的词典 - 形态分析; 使用语言描述对片段进行语法分析,并为片段生成句法结构; 为片段生成语义结构; 并估计语义结构之间的对应程度。

    CROSS-LANGUAGE TEXT CLASSIFICATION
    2.
    发明申请
    CROSS-LANGUAGE TEXT CLASSIFICATION 有权
    跨语言文本分类

    公开(公告)号:US20120271627A1

    公开(公告)日:2012-10-25

    申请号:US13535638

    申请日:2012-06-28

    Abstract: Methods are described for performing classification (categorization) of text documents written in various languages. Language-independent semantic structures are constructed before classifying documents. These structures reflect lexical, morphological, syntactic, and semantic properties of documents. The methods suggested are able to perform cross-language text classification which is based on document properties reflecting their meaning. The methods are applicable to genre classification, topic detection, news analysis, authorship analysis, etc.

    Abstract translation: 描述用于执行用各种语言编写的文本文档的分类(分类)的方法。 在分类文档之前构建语言无关的语义结构。 这些结构反映了文献的词汇,形态,句法和语义属性。 所提出的方法能够执行基于反映其含义的文档属性的跨语言文本分类。 该方法适用于类型分类,主题检测,新闻分析,作者分析等。

    Multilevel bit-mapped image analysis method
    3.
    发明授权
    Multilevel bit-mapped image analysis method 有权
    多级位映像图像分析方法

    公开(公告)号:US08103103B2

    公开(公告)日:2012-01-24

    申请号:US10386541

    申请日:2003-03-13

    CPC classification number: G06K9/00442

    Abstract: The present invention discloses a multilevel method of bitmapped image analysis that comprises a whole image data representation via its components—objects of different levels of complexity—hierarchically connected therebetween by spatially-parametrical links. The said method comprises preliminarily generating a classifier of the objects that possibly may be present in the image consisting of one or more levels differing in complexity; parsing the image into objects; attaching each object to one of predetermined levels; establishing hierarchical links between objects of different levels; establishing links between objects within the same level; and performing an object feature analysis. The objects feature analysis comprises at least generating and examining a hypothesis about object features and correcting the object's features of the same and other levels in response to the hypothesis examination results. The step of object features analysis may also comprise execution of a recursive RX-cut within the same level.

    Abstract translation: 本发明公开了一种位图图像分析的多级方法,其包括通过其组件的整体图像数据表示 - 通过空间参数链接层级连接在其间的不同复杂级别的对象。 所述方法包括预先生成可能存在于由复杂度不同的一个或多个级别组成的图像中的对象的分类器; 将图像解析成对象; 将每个物体附接到预定水平之一; 建立不同层次对象之间的分层联系; 建立同级别对象之间的链接; 并执行对象特征分析。 对象特征分析至少包括生成和检查关于对象特征的假设,并且响应于假设检查结果来校正相同和其他级别的对象的特征。 对象特征分析的步骤还可以包括在相同水平内执行递归RX切割。

    AUTOMATIC FILE NAME GENERATION IN OCR SYSTEMS
    4.
    发明申请
    AUTOMATIC FILE NAME GENERATION IN OCR SYSTEMS 有权
    OCR系统中的自动文件名称生成

    公开(公告)号:US20100215272A1

    公开(公告)日:2010-08-26

    申请号:US12749525

    申请日:2010-03-30

    CPC classification number: G06K9/00469

    Abstract: Methods and system for processing document images in OCR systems, particularly for selecting a proper file name for a recognized document. The method comprises generating at least one document type hypothesis for the document; verifying each document type hypothesis; selecting a best document type hypothesis and saving the document with a proper name based on the best type hypothesis and unique features. The method further includes determining a logical structure of a document and selecting a best document model hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document. On the basis of the best document model hypothesis the text document reflecting the logical structure of the source document in extended computer-editable format is formed and saved with a proper file name.

    Abstract translation: 用于在OCR系统中处理文档图像的方法和系统,特别是用于为识别的文档选择适当的文件名。 该方法包括为文档生成至少一个文档类型假设; 验证每种文件类型假说; 选择最佳文件类型假设,并根据最佳类型假设和独特功能,以正确的名称保存文档。 该方法还包括确定文档的逻辑结构并选择与文档的所选最佳块假设具有最佳对应关系的最佳文档模型假设。 在最佳文件模型假设的基础上,反映出扩展计算机可编辑格式的源文档的逻辑结构的文本文档被形成并以适当的文件名保存。

    METHOD AND SYSTEM FOR ANALYZING VARIOUS LANGUAGES AND CONSTRUCTING LANGUAGE-INDEPENDENT SEMANTIC STRUCTURES
    5.
    发明申请
    METHOD AND SYSTEM FOR ANALYZING VARIOUS LANGUAGES AND CONSTRUCTING LANGUAGE-INDEPENDENT SEMANTIC STRUCTURES 有权
    分析各种语言和构建语言独立语义结构的方法与系统

    公开(公告)号:US20080091405A1

    公开(公告)日:2008-04-17

    申请号:US11548214

    申请日:2006-10-10

    CPC classification number: G06F17/277 G06F17/2755 G06F17/2785 G06F17/2872

    Abstract: A method and computer system for analyzing sentences of various languages and constructing a language-independent semantic structure are provided. On the basis of comprehensive knowledge about languages and semantics, exhaustive linguistic descriptions are created, and lexical, morphological, syntactic, and semantic analyses for one or more sentences of a natural or artificial language are performed. A computer system is also provided to implement, analyze and store various linguistic structures and to perform lexical, morphological, syntactic, and semantic analyses. As result, a generalized data structure, such as a semantic structure, is generated and used to describe the meaning of one or more sentences in language-independent form, applicable to automated abstracting, machine translation, control systems, Internet information retrieval, etc.

    Abstract translation: 提供了一种用于分析各种语言句子和构建语言无关语义结构的方法和计算机系统。 在语言和语义综合知识的基础上,创造了详尽的语言描述,并对自然或人工语言的一个或多个句子进行词汇,形态,句法和语义分析。 还提供了一种计算机系统来实现,分析和存储各种语言结构,并进行词法,形态,句法和语义分析。 因此,生成广义数据结构,如语义结构,并用于描述语言无关形式的一个或多个句子的含义,适用于自动抽象,机器翻译,控制系统,互联网信息检索等。

    Method and system for translating sentence between languages based on semantic structure of the sentence

    公开(公告)号:US09817818B2

    公开(公告)日:2017-11-14

    申请号:US13477021

    申请日:2012-05-21

    CPC classification number: G06F17/28 G06F17/2881

    Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.

    Method and system for analyzing and translating various languages with use of semantic hierarchy
    7.
    发明授权
    Method and system for analyzing and translating various languages with use of semantic hierarchy 有权
    使用语义层次分析和翻译各种语言的方法和系统

    公开(公告)号:US09471562B2

    公开(公告)日:2016-10-18

    申请号:US13288953

    申请日:2011-11-03

    Abstract: A method and computer system for analyzing sentences of various languages and constructing a language-independent semantic structure are provided. On the basis of comprehensive knowledge about languages and semantics, exhaustive linguistic descriptions are created, and lexical, morphological, syntactic, and semantic analyses for one or more sentences of a natural or artificial language are performed. A computer system is also provided to implement, analyze and store various linguistic structures and to perform lexical, morphological, syntactic, and semantic analyses. As result, a generalized data structure, such as a semantic structure, is generated and used to describe the meaning of one or more sentences in language-independent form, applicable to automated abstracting, machine translation, control systems, Internet information retrieval, etc.

    Abstract translation: 提供了一种用于分析各种语言句子和构建语言无关语义结构的方法和计算机系统。 在语言和语义综合知识的基础上,创造了详尽的语言描述,并对自然或人工语言的一个或多个句子进行词汇,形态,句法和语义分析。 还提供了一种计算机系统来实现,分析和存储各种语言结构,并进行词法,形态,句法和语义分析。 因此,生成广义数据结构,如语义结构,并用于描述语言无关形式的一个或多个句子的含义,适用于自动抽象,机器翻译,控制系统,互联网信息检索等。

    Translation of a selected text fragment of a screen
    9.
    发明授权
    Translation of a selected text fragment of a screen 有权
    翻译屏幕的所选文本片段

    公开(公告)号:US09262409B2

    公开(公告)日:2016-02-16

    申请号:US13552601

    申请日:2012-07-18

    Abstract: Disclosed is a method for translating text fragments displayed on a screen from an input language into an output language and displaying the result. Translation may use electronic dictionaries, machine translation, natural language processing, control systems, information searches, (e.g., search engine via an Internet protocol), semantic searches, computer-aided learning, and expert systems. For a word combination, appropriate local or network accessible dictionaries are consulted. The disclosed method provides a translation in grammatical agreement in accordance with grammatical rules of the output language in consideration of the context of the text.

    Abstract translation: 公开了一种将屏幕上显示的文本片段从输入语言翻译成输出语言并显示结果的方法。 翻译可以使用电子词典,机器翻译,自然语言处理,控制系统,信息搜索(例如,通过因特网协议的搜索引擎),语义搜索,计算机辅助学习和专家系统。 对于单词组合,可以查询适当的本地或网络可访问字典。 考虑到文本的上下文,所公开的方法根据输出语言的语法规则提供了语法协议的翻译。

    Translating sentences between languages
    10.
    发明授权
    Translating sentences between languages 有权
    翻译语言之间的句子

    公开(公告)号:US08892418B2

    公开(公告)日:2014-11-18

    申请号:US13554695

    申请日:2012-07-20

    CPC classification number: G06F17/28 G06F17/2881

    Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.

    Abstract translation: 提供了一种从中间语言无关语义表示来翻译语言之间的句子的方法和计算机系统。 在全面了解语言和语义的基础上,将全面的语言描述用于分析句子,构建语法结构和语言独立语义结构和表示,并以自然或人工语言合成一个或多个句子。 还提供了一种计算机系统来分析和综合各种语言结构并执行各种各样的句子类型的翻译。 结果,从输入语言的句子生成诸如语义结构的广义数据结构,并且可以在输出语言中被正确地转换成表达其含义的自然语句。 该方法和计算机系统可应用于自动抽象,机器翻译,自然语言处理,控制系统,互联网信息检索等。

Patent Agency Ranking