COMMON PHRASE IDENTIFICATION AND LANGUAGE DICTATION RECOGNITION SYSTEMS AND METHODS FOR USING THE SAME

    公开(公告)号:US20170286393A1

    公开(公告)日:2017-10-05

    申请号:US15624411

    申请日:2017-06-15

    申请人: InfraWare, Inc.

    摘要: In at least one exemplary embodiment for common phrase identification and language dictation recognition systems and methods for using the same, the system comprises a database capable of receiving a plurality of verbal records, the verbal record comprising at least one identifier and at least one verbal feature and a processor operably coupled to the database, where the processor has and executes a software program. The processor being operational to identify a subset of the plurality of verbal records from the database, extract at least one verbal feature from the identified records, analyze the at least one verbal feature of the subset of the plurality of verbal records, process the subset of the plurality of records using the analyzed feature according to at least one reasoning approach, generate a processed verbal record using the processed subset of the plurality of records, and deliver the processed verbal record to a recipient. The processor being further operational to identify common phrases in parts of the verbal record, identifying a body of work for building a set of common phrases, analyze documents in a training set to find some common phrases, and replacing phrases with the common phrases.

    Systems and methods for high-speed searching and filtering of large datasets

    公开(公告)号:US09697250B1

    公开(公告)日:2017-07-04

    申请号:US14678982

    申请日:2015-04-04

    发明人: Roy W. Ward

    IPC分类号: G06F17/30

    摘要: A binary data file embodies an inline tree data structure storing fields of a hierarchical dataset. The inline tree comprises first-level binary string segments, each comprising substantially contiguous second-level binary string segments, corresponding to subranges of first and second subsets of data fields. Size is reduced by substituting: binary string indices for alphanumeric strings; a data clump index for a set of correlated/anticorrelated strings; field masks for unoccupied data fields. A dedicated conversion program generates the inline tree from conventional database formats, which is read entirely into RAM to be searched/filtered by a dedicated search/filter program. Small size ( 106 records (>100 data fields) in

    NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM, ENCODING METHOD, ENCODING APPARATUS, DECODING METHOD, AND DECODING APPARATUS
    57.
    发明申请
    NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM, ENCODING METHOD, ENCODING APPARATUS, DECODING METHOD, AND DECODING APPARATUS 有权
    非终端计算机可读记录介质,编码方法,编码设备,解码方法和解码设备

    公开(公告)号:US20170017629A1

    公开(公告)日:2017-01-19

    申请号:US15207876

    申请日:2016-07-12

    申请人: FUJITSU LIMITED

    IPC分类号: G06F17/22

    摘要: A code converting unit encodes input text data based on an code assignment table stored in a storage device that defines a conversion rule for encoding text data, wherein; the code assignment table being generated by assigning a part of character strings assigned to a 1-byte region of a first code assignment table to a 2-byte region of the code assignment table, and by assigning one or more codes each having two or more bytes to at least a part of character strings assigned to the 2-byte region of the code assignment table.

    摘要翻译: 代码转换单元基于存储在定义用于编码文本数据的转换规则的存储设备中的代码分配表来编码输入文本数据,其中; 通过将分配给第一代码分配表的1字节区域的字符串的一部分分配给代码分配表的2字节区域,并通过分配一个或多个代码来生成代码分配表,每个代码具有两个或更多个 字节到分配给代码分配表的2字节区域的字符串的至少一部分。

    ASSIGNING CONTENT OBJECTS TO DELIVERY NETWORKS
    58.
    发明申请
    ASSIGNING CONTENT OBJECTS TO DELIVERY NETWORKS 审中-公开
    评估内容交付网络的对象

    公开(公告)号:US20160283480A1

    公开(公告)日:2016-09-29

    申请号:US14669910

    申请日:2015-03-26

    摘要: A system, method, and apparatus are provided for assigning or allocating multiple content objects, within a content page (e.g., web page) or other content collection (e.g., a set of pages), to different content delivery networks for delivery in response to a content request. The objects are ranked by importance (e.g., importance in rendering or presenting the page), and the networks are ranked by performance (e.g., throughput). In order of importance, the objects are assigned to the best-performing network that is “available.” Some or all networks are initially available, and a given network becomes “unavailable” after it has been assigned its portion of the objects (e.g., based on content, number of objects, amount of data, percentage). If a total accumulated cost of delivering the objects exceeds a target before all objects have been allocated, the allocation process may terminate early and the remaining objects may be assigned to the least-expensive network.

    摘要翻译: 提供了一种系统,方法和装置,用于在内容页面(例如,网页)或其他内容集合(例如,一组页面)中分配或分配多个内容对象到不同的内容传送网络以响应于 内容请求。 对象按重要性排列(例如,在呈现或呈现页面中的重要性),并且通过性能(例如,吞吐量)对网络进行排名。 按照重要性的顺序,将对象分配给“可用”的性能最好的网络。一些或所有网络最初可用,并且给定网络在分配了其部分对象之后变为“不可用”(例如, 基于内容,对象数量,数据量,百分比)。 如果在分配所有对象之前传递对象的总累积成本超过目标,则分配过程可以提前终止,并且剩余的对象可以被分配给最便宜的网络。

    Managing a display of results of a keyword search on a web page by modifying attributes of DOM tree structure
    59.
    发明授权
    Managing a display of results of a keyword search on a web page by modifying attributes of DOM tree structure 有权
    通过修改DOM树结构的属性来管理网页上的关键字搜索结果的显示

    公开(公告)号:US09448979B2

    公开(公告)日:2016-09-20

    申请号:US13859866

    申请日:2013-04-10

    摘要: An approach is provided for managing a display of a keyword search result. The search for the keyword on a web page includes identifying first Document Object Model (DOM) element(s) including a subset of DOM element(s) that include the keyword. Based on preference(s), second DOM element(s) are identified, which are unrelated to the subset of DOM element(s). Based on the preference(s), styles of the first and second DOM element(s) are modified to generate a display of the search result that includes content of the web page specified by the first DOM element(s), and that (1) does not include other content of the web page specified by the second DOM element(s) or (2) emphasizes the content specified by the first DOM element(s) over the other content specified by the second DOM element(s), in accordance with the modified styles.

    摘要翻译: 提供了一种用于管理关键词搜索结果的显示的方法。 在网页上搜索关键字包括识别包括包含关键字的DOM元素的子集的第一文档对象模型(DOM)元素。 基于偏好,识别与DOM元素的子集无关的第二DOM元素。 基于偏好,修改第一和第二DOM元素的样式以生成包括由第一DOM元素指定的网页的内容的搜索结果的显示,并且(1 )不包括由第二DOM元素指定的网页的其他内容,或者(2)强调第一DOM元素指定的内容超过由第二DOM元素指定的另一内容,在 按照修改后的样式。

    Apparatus, method, and computer program product for searching structured document
    60.
    发明授权
    Apparatus, method, and computer program product for searching structured document 有权
    用于搜索结构化文档的装置,方法和计算机程序产品

    公开(公告)号:US09378301B2

    公开(公告)日:2016-06-28

    申请号:US12503439

    申请日:2009-07-15

    申请人: Masakazu Hattori

    发明人: Masakazu Hattori

    IPC分类号: G06F17/30 G06F17/27 G06F17/22

    摘要: A structured document searching apparatus that stores structured document data each including hierarchized elements stores a data stream in which the elements included in the structured document data are arranged in the order of the syntactic analysis result, and stores while at least one index stream in which the elements included in the structured document data and serving as an index in a structured document data search are arranged in the order of the syntactic analysis. The structured document searching apparatus creates a scanning plan that instructs the scanning of the data stream and the index stream, based on a search criterion for the structured document data search, and executes the scanning of at least either one of the data stream and the index stream instructed by the scanning plan.

    摘要翻译: 存储结构化文档数据的结构化文档搜索装置,每个结构化文档数据包括分级元素,存储其中包含在结构化文档数据中的元素按照句法分析结果的顺序排列的数据流,并存储至少一个索引流,其中 包含在结构化文档数据中并作为结构化文档数据搜索中的索引的元素按照句法分析的顺序排列。 结构化文档搜索装置基于结构化文档数据搜索的搜索条件创建指示数据流和索引流的扫描的扫描计划,并且执行数据流和索引中的至少一个的扫描 扫描计划指示的流。