Cognitive information security using a behavioral recognition system
    41.
    发明授权
    Cognitive information security using a behavioral recognition system 有权
    使用行为识别系统的认知信息安全

    公开(公告)号:US09507768B2

    公开(公告)日:2016-11-29

    申请号:US14457060

    申请日:2014-08-11

    Abstract: Embodiments presented herein describe a method for processing streams of data of one or more networked computer systems. According to one embodiment of the present disclosure, an ordered stream of normalized vectors corresponding to information security data obtained from one or more sensors monitoring a computer network is received. A neuro-linguistic model of the information security data is generated by clustering the ordered stream of vectors and assigning a letter to each cluster, outputting an ordered sequence of letters based on a mapping of the ordered stream of normalized vectors to the clusters, building a dictionary of words from of the ordered output of letters, outputting an ordered stream of words based on the ordered output of letters, and generating a plurality of phrases based on the ordered output of words.

    Abstract translation: 本文提出的实施例描述了一种用于处理一个或多个联网的计算机系统的数据流的方法。 根据本公开的一个实施例,接收与从监视计算机网络的一个或多个传感器获得的信息安全数据相对应的归一化向量的有序流。 信息安全数据的神经语言模型是通过将有序的向量流聚类并分配给每个聚类的一个字母来生成的,其基于归一化向量的有序流向集群的映射输出有序的字母序列,构建一个 根据有序输出的字母的字的词典,基于字母的有序输出输出有序的单词流,并且基于单词的有序输出生成多个短语。

    METHOD AND SYSTEM FOR GENERATING A DEFINITION OF A WORD FROM MULTIPLE SOURCES
    42.
    发明申请
    METHOD AND SYSTEM FOR GENERATING A DEFINITION OF A WORD FROM MULTIPLE SOURCES 有权
    用于从多个来源生成字词的方法和系统

    公开(公告)号:US20160335248A1

    公开(公告)日:2016-11-17

    申请号:US15108824

    申请日:2014-10-22

    Abstract: There is provided a method of performing an on-line definition of a first word, the first word received from a user of an electronic device via a communication network. The method can be executed at a server. The method comprises: obtaining a first definition set from a first source, the first definition set being based on the first word; obtaining a second definition set from a second source, the second definition set being based on the first word; parsing the first definition set to obtain individual first set words; parsing the second definition set to obtain individual second set words; organizing the individual first set words into at least one definition cluster; causing the electronic device to display to the user at least the first cluster.

    Abstract translation: 提供了一种通过通信网络从电子设备的用户接收的第一个字执行第一个字的在线定义的方法。 该方法可以在服务器上执行。 所述方法包括:从第一源获得第一定义集,所述第一定义集合基于所述第一字; 从第二源获得第二定义集,所述第二定义集合基于所述第一字; 解析第一个定义集以获得单个的第一个集合的单词; 解析第二定义集以获得单独的第二集合词; 将至少一个定义集群中的个体第一集合单词组织起来; 使得电子设备至少向第一群集显示给用户。

    INDEX-SIDE DIACRITICAL CANONICALIZATION
    43.
    发明申请
    INDEX-SIDE DIACRITICAL CANONICALIZATION 审中-公开
    指标界面综合评估

    公开(公告)号:US20160307000A1

    公开(公告)日:2016-10-20

    申请号:US12942967

    申请日:2010-11-09

    CPC classification number: G06F21/64 G06F16/3337 G06F17/273 G06F17/2795

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for index-side synonym expansion. One method includes obtaining a token sequence for a resource and indexing a particular token in the token sequence. The indexing includes obtaining a diacritically canonicalized form of the particular token; determining that the diacritically canonicalized form of the particular token is different from the particular token; and storing data associating the resource with both the particular token and the different diacritically canonicalized form of the particular token as index terms for the resource in a search engine.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于索引侧同义词扩展。 一种方法包括获得资源的令牌序列并索引令牌序列中的特定令牌。 索引包括获得特定令牌的二义性规范化形式; 确定特定令牌的二义性规范化形式与特定令牌不同; 以及存储将所述资源与所述特定令牌和所述特定令牌的不同二进制规范化形式相关联的数据,作为所述资源在搜索引擎中的索引项。

    Method for preserving conceptual distance within unstructured documents
    45.
    发明授权
    Method for preserving conceptual distance within unstructured documents 有权
    在非结构化文档中保留概念距离的方法

    公开(公告)号:US09424299B2

    公开(公告)日:2016-08-23

    申请号:US14641527

    申请日:2015-03-09

    Abstract: A method, system and computer-usable medium are disclosed for preserving conceptual distance within unstructured documents by characterizing conceptual relationships. Natural language processing is applied to content in a plurality of documents to identify topics and subjects. Analytic analysis is then applied to the identified topics and subjects to identify concepts. The content in each of the plurality of documents is partitioned into a first structured hierarchy, preserving at least one structure in each document inherent in the each document. Access is then provided to the content through a first index based upon utilizing the first structured hierarchy and through a second index utilizing a second structured hierarchy. The conceptual relationship criteria are based upon a directed graph with weights based upon a similarity and a distance based upon concepts.

    Abstract translation: 公开了一种方法,系统和计算机可用介质,用于通过表征概念关系来保持非结构化文档内的概念距离。 自然语言处理应用于多个文档中的内容以识别主题和主题。 然后将分析分析应用于识别的主题和主题以识别概念。 多个文档中的每个文档中的内容被划分为第一结构化层次结构,在每个文档中固有的每个文档中保留至少一个结构。 然后基于利用第一结构化层次并通过利用第二结构化层次的第二索引,通过第一索引将访问提供给内容。 概念关系标准基于具有基于基于概念的相似性和距离的权重的有向图。

    Preserving conceptual distance within unstructured documents
    46.
    发明授权
    Preserving conceptual distance within unstructured documents 有权
    在非结构化文档中保留概念距离

    公开(公告)号:US09424298B2

    公开(公告)日:2016-08-23

    申请号:US14508200

    申请日:2014-10-07

    Abstract: A method, system and computer-usable medium are disclosed for preserving conceptual distance within unstructured documents by characterizing conceptual relationships. Natural language processing is applied to content in a plurality of documents to identify topics and subjects. Analytic analysis is then applied to the identified topics and subjects to identify concepts. The content in each of the plurality of documents is partitioned into a first structured hierarchy, preserving at least one structure in each document inherent in the each document. Access is then provided to the content through a first index based upon utilizing the first structured hierarchy and through a second index utilizing a second structured hierarchy. The conceptual relationship criteria are based upon a directed graph with weights based upon a similarity and a distance based upon concepts.

    Abstract translation: 公开了一种方法,系统和计算机可用介质,用于通过表征概念关系来保持非结构化文档内的概念距离。 自然语言处理应用于多个文档中的内容以识别主题和主题。 然后将分析分析应用于识别的主题和主题以识别概念。 多个文档中的每个文档中的内容被划分为第一结构化层次结构,在每个文档中固有的每个文档中保留至少一个结构。 然后基于利用第一结构化层次并通过利用第二结构化层次的第二索引,通过第一索引将访问提供给内容。 概念关系标准基于具有基于基于概念的相似性和距离的权重的有向图。

    Context based synonym filtering for natural language processing systems
    49.
    发明授权
    Context based synonym filtering for natural language processing systems 有权
    基于语境的自然语言处理系统的同义词过滤

    公开(公告)号:US09378204B2

    公开(公告)日:2016-06-28

    申请号:US14285019

    申请日:2014-05-22

    Abstract: Mechanisms are provided for performing context based synonym filtering for natural language processing. Content is parsed into one or more conceptual units, wherein each conceptual unit comprises a portion of text of the content that is associated with a single concept. For each conceptual unit, a term in the conceptual unit is identified that has a synonym to be utilized during natural language processing of the content. A first measure of relatedness of the term to at least one other term in the conceptual unit is determined. A second measure of relatedness of the synonym of the term to the at least one other term in the conceptual unit is determined. A determination whether or not to utilize the synonym when performing natural language processing on the conceptual unit is made based on the first and second measures of relatedness and natural language processing on the content is performed accordingly.

    Abstract translation: 提供了用于对自然语言处理执行基于上下文的同义词过滤的机制。 内容被解析为一个或多个概念单元,其中每个概念单元包括与单个概念相关联的内容的文本的一部分。 对于每个概念单元,识别概念单元中的术语,其具有在内容的自然语言处理期间被使用的同义词。 确定该术语与概念单元中至少一个其他术语的相关性的第一个度量。 确定该术语的同义词与概念单元中的至少一个其他术语的相关性的第二量度。 基于相关性的第一和第二测量,对内容进行自然语言处理,进行对概念单元进行自然语言处理时是否使用同义词的确定。

Patent Agency Ranking