Method and Apparatus for Self Optimizing Data Selection
    2.
    发明申请
    Method and Apparatus for Self Optimizing Data Selection 有权
    自动优化数据选择的方法和装置

    公开(公告)号:US20120030218A1

    公开(公告)日:2012-02-02

    申请号:US12845254

    申请日:2010-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30985

    摘要: A method, system, and article for improving performance of a Boolean combination of at least two filters to a data stream. Stream processing is applied to an expression having to or more logical operators. As the data stream is processed, efficiency of the operators in the expression is evaluated. A sort algorithm is dynamically invoked to ensure that a more efficient operator precedes processing of a less efficient operator.

    摘要翻译: 一种用于提高至少两个过滤器到数据流的布尔组合的性能的方法,系统和文章。 流处理应用于具有或多个逻辑运算符的表达式。 随着数据流的处理,表达式中运算符的效率得到评估。 动态调用排序算法,以确保更高效的运算符在较低效率的运算符的处理之前进行。

    SIMPLIFYING COMPLEX DATA STREAM PROBLEMS INVOLVING FEATURE EXTRACTION FROM NOISY DATA
    3.
    发明申请
    SIMPLIFYING COMPLEX DATA STREAM PROBLEMS INVOLVING FEATURE EXTRACTION FROM NOISY DATA 失效
    简化涉及从噪声数据中提取特征的复杂数据流问题

    公开(公告)号:US20100011192A1

    公开(公告)日:2010-01-14

    申请号:US12171053

    申请日:2008-07-10

    IPC分类号: G06F9/06

    摘要: Methods, systems and computer program products for simplifying complex data stream problems involving feature extraction from noisy data. Exemplary embodiments include a method for processing a data stream, including applying multiple operators to the data stream, wherein an operation by each of the multiple operators includes retrieving the next chunk for each of set of input parameters, performing digital processing operations on a respective next chunk, producing sets of output parameters and adding data to one or more internal data stores, each internal data store acting as a data stream source.

    摘要翻译: 方法,系统和计算机程序产品,用于简化从嘈杂数据中涉及特征提取的复杂数据流问题。 示例性实施例包括用于处理数据流的方法,包括将多个运算符应用于数据流,其中由多个运算符中的每个运算符进行的运算包括为每组输入参数检索下一个块,在相应的下一个 块,产生输出参数集合并将数据添加到一个或多个内部数据存储器,每个内部数据存储器充当数据流源。

    METHOD, COMPUTER PROGRAM PRODUCT, AND DEVICE FOR CONDUCTING A MULTI-CRITERIA SIMILARITY SEARCH
    4.
    发明申请
    METHOD, COMPUTER PROGRAM PRODUCT, AND DEVICE FOR CONDUCTING A MULTI-CRITERIA SIMILARITY SEARCH 审中-公开
    方法,计算机程序产品和用于执行多标准类似搜索的设备

    公开(公告)号:US20080133496A1

    公开(公告)日:2008-06-05

    申请号:US11565748

    申请日:2006-12-01

    IPC分类号: G06F17/30

    摘要: Similarities among multiple near-neighbor objects are searched for based on multiple criteria. A query is received for an object closest to an object provided by a user, and weights are assigned by a user to distance functions among the multiple objects at the time of the query. Each distance function represents a different criterion. The weighted average is calculated for the distance functions, and the closest object to the query object based on the weighted average for the distance functions.

    摘要翻译: 基于多个标准来搜索多个近邻对象之间的相似性。 对于最接近用户提供的对象的对象,接收到查询,并且在查询时由用户将权重分配给多个对象之间的距离函数。 每个距离函数代表不同的标准。 根据距离函数的加权平均值计算距离函数的加权平均值,以及与查询对象最接近的对象。

    System and Method for Identifying Similar Molecules
    5.
    发明申请
    System and Method for Identifying Similar Molecules 有权
    识别相似分子的系统和方法

    公开(公告)号:US20080004810A1

    公开(公告)日:2008-01-03

    申请号:US11428147

    申请日:2006-06-30

    IPC分类号: G06F19/00

    CPC分类号: G06F19/705 G06F19/708

    摘要: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.

    摘要翻译: 采用向量化过程,其中化学品标识符串被转换成各自的向量。 然后可以搜索这些载体以鉴别彼此相同或相似的分子。 向量空间的维度可以由构成化学标识符串的符号序列来定义。 由国际纯粹和应用化学联合会(IUPAC)定义的国际化学标识符(InChI)字符串特别适用于这些方法。

    Failure recovery and error correction techniques for data loading in information warehouses
    6.
    发明授权
    Failure recovery and error correction techniques for data loading in information warehouses 有权
    信息仓库中数据加载的故障恢复和纠错技术

    公开(公告)号:US09218377B2

    公开(公告)日:2015-12-22

    申请号:US12134065

    申请日:2008-06-05

    IPC分类号: G06F17/30 G06F11/14 G06F7/00

    摘要: A method of data loading for large information warehouses includes performing checkpointing concurrently with data loading into an information warehouse, the checkpointing ensuring consistency among multiple tables; and recovering from a failure in the data loading using the checkpointing. A method is also disclosed for performing versioning concurrently with data loading into an information warehouse. The versioning method enables processing undo and redo operations of the data loading between a later version and a previous version. Data load failure recovery is performed without starting a data load from the beginning but rather from a latest checkpoint for data loading at an information warehouse level using a checkpoint process characterized by a state transition diagram having a multiplicity of states; and tracking state transitions among the states using a system state table.

    摘要翻译: 大型信息仓库的数据加载方法包括:将数据加载到信息仓库中同时进行检查点检查,确保多个表格之间的一致性; 并使用检查点从数据加载失败中恢复。 还公开了一种与数据加载到信息仓库中同时进行版本控制的方法。 版本控制方法可以处理在更高版本和先前版本之间的数据加载的撤消和重做操作。 执行数据加载失败恢复,而不从一开始就开始数据加载,而是从最新检查点开始,使用特征在于具有多个状态的状态转换图的检查点进程在信息仓库级别进行数据加载。 并使用系统状态表来跟踪状态之间的状态转换。

    System and method for identifying similar molecules
    7.
    发明授权
    System and method for identifying similar molecules 有权
    识别类似分子的系统和方法

    公开(公告)号:US08515684B2

    公开(公告)日:2013-08-20

    申请号:US13333408

    申请日:2011-12-21

    IPC分类号: G01N33/50

    CPC分类号: G06F19/705 G06F19/708

    摘要: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.

    摘要翻译: 采用向量化过程,其中化学品标识符串被转换成各自的向量。 然后可以搜索这些载体以鉴别彼此相同或相似的分子。 向量空间的维度可以由构成化学标识符串的符号序列来定义。 由国际纯粹和应用化学联合会(IUPAC)定义的国际化学标识符(InChI)字符串特别适用于这些方法。

    System, method, and apparatus for managing price information with locking mechanisms
    8.
    发明授权
    System, method, and apparatus for managing price information with locking mechanisms 失效
    用锁定机制管理价格信息的系统,方法和装置

    公开(公告)号:US08244554B2

    公开(公告)日:2012-08-14

    申请号:US12629698

    申请日:2009-12-02

    IPC分类号: G06Q99/00

    摘要: A computer-implemented method for managing price information. Embodiments include receiving a mapping of interconnected components, identifying as a first subset components subject to a first fixed price agreement not subject to a second fixed price agreement that overlaps the first fixed price agreement, identifying as a second subset the components subject to the second fixed price agreement not subject to the first fixed price agreement, and identifying as a third subset the components subject to both the first fixed price agreement and the second fixed price agreement. The method also includes receiving a price change for a price associated with a component in one of the subsets of components, and distributing an offset of the price change to components in the other subsets of components.

    摘要翻译: 一种用于管理价格信息的计算机实现方法。 实施例包括接收互连组件的映射,识别为不受第一固定价格协议重叠的第二固定价格协议的第一固定价格协议的第一子集组件,将第二固定价格协议识别为第二固定价格协议的组件 价格协议不受第一个固定价格协议的约束,并将第一个固定价格协议和第二个固定价格协议的组件确定为第三个子集。 该方法还包括接收与组件中的一个子集中的组件相关联的价格的价格变化,以及将价格变动的偏移分布到组件的其他子集中的组件。

    Methodologies and analytics tools for identifying white space opportunities in a given industry
    9.
    发明授权
    Methodologies and analytics tools for identifying white space opportunities in a given industry 有权
    用于识别给定行业中的空白机会的方法和分析工具

    公开(公告)号:US08060505B2

    公开(公告)日:2011-11-15

    申请号:US11674598

    申请日:2007-02-13

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method for analyzing predefined subject matter in a patent database being for use with a set of target patents, each target patent related to the predefined subject matter, the method comprising: creating a feature space based on frequently occurring terms found in the set of target patents; creating a partition taxonomy based on a clustered configuration of the feature space; editing the partition taxonomy using domain expertise to produce an edited partition taxonomy; creating a classification taxonomy based on structured features present in the edited partition taxonomy; creating a contingency table by comparing the edited partition taxonomy and the classification taxonomy to provide entries in the contingency table; and identifying all significant relationships in the contingency table to help determine the presence of any white space.

    摘要翻译: 一种用于分析专利数据库中预定主题的方法,用于与一组目标专利一起使用,每个目标专利与预定义的主题相关,所述方法包括:基于在目标集合中发现的经常出现的项来创建特征空间 专利; 基于特征空间的集群配置创建分区分类; 使用领域专业知识编辑分区分类,以产生编辑的分区分类; 根据编辑的分区分类中存在的结构化特征创建分类分类; 通过比较编辑的分区分类法和分类分类法来创建应急表,以提供应急表中的条目; 并确定应急表中的所有重要关系,以帮助确定任何空白的存在。

    Simplifying complex data stream problems involving feature extraction from noisy data
    10.
    发明授权
    Simplifying complex data stream problems involving feature extraction from noisy data 有权
    简化从噪声数据中涉及特征提取的复杂数据流问题

    公开(公告)号:US07805445B2

    公开(公告)日:2010-09-28

    申请号:US12171678

    申请日:2008-07-11

    IPC分类号: G06F7/00 G06F17/30

    摘要: Methods, systems and computer program products for simplifying complex data stream problems involving feature extraction from noisy data. Exemplary embodiments include a method for processing a data stream, including applying multiple operators to the data stream, wherein an operation by each of the multiple operators includes retrieving the next chunk for each of set of input parameters, performing digital processing operations on a respective next chunk, producing sets of output parameters and adding data to one or more internal data stores, each internal data store acting as a data stream source.

    摘要翻译: 方法,系统和计算机程序产品,用于简化从嘈杂数据中涉及特征提取的复杂数据流问题。 示例性实施例包括用于处理数据流的方法,包括将多个运算符应用于数据流,其中由多个运算符中的每个运算符进行的运算包括为每组输入参数检索下一个块,在相应的下一个 块,产生输出参数集合并将数据添加到一个或多个内部数据存储器,每个内部数据存储器充当数据流源。