Methods for obtaining improved text similarity measures which replace similar characters with a string pattern representation by using a semantic data tree
    1.
    发明授权
    Methods for obtaining improved text similarity measures which replace similar characters with a string pattern representation by using a semantic data tree 失效
    用于通过使用语义数据树获得用字符串模式表示替换相似字符的改进的文本相似性度量的方法

    公开(公告)号:US07945525B2

    公开(公告)日:2011-05-17

    申请号:US11937550

    申请日:2007-11-09

    IPC分类号: G06F17/00

    摘要: The embodiments of the invention provide methods for obtaining improved text similarity measures. More specifically, a method of measuring similarity between at least two electronic documents begins by identifying similar terms between the electronic documents. This includes basing similarity between the similar terms on patterns, wherein the patterns can include word patterns, letter patterns, numeric patterns, and/or alphanumeric patterns. The identifying of the similar terms also includes identifying multiple pattern types between the electronic documents. Moreover, the basing of the similarity on patterns identifies terms within the electronic documents that are within a category of a hierarchy. Specifically, the identifying of the terms reviews a hierarchical data tree, wherein nodes of the tree represent terms within the electronic documents. Lower nodes of the tree have specific terms; and, wherein higher nodes of the tree have general terms.

    摘要翻译: 本发明的实施例提供了用于获得改进的文本相似性度量的方法。 更具体地说,一种测量至少两个电子文档之间的相似性的方法,首先是识别电子文档之间的类似术语。 这包括在模式上的类似术语之间的基础相似性,其中模式可以包括字模式,字母模式,数字模式和/或字母数字模式。 类似术语的识别还包括识别电子文档之间的多种模式类型。 此外,模式上的相似性的基础确定电子文档内的层次结构类别内的术语。 具体地,术语的识别审查分层数据树,其中树的节点表示电子文档内的术语。 树的下层节点有特定的术语; 并且其中树的较高节点具有一般术语。

    Efficient discovery of keys in a database
    3.
    发明授权
    Efficient discovery of keys in a database 有权
    在数据库中有效发现密钥

    公开(公告)号:US08838648B2

    公开(公告)日:2014-09-16

    申请号:US11465155

    申请日:2006-08-17

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30306

    摘要: A method and system for discovering keys in a database. A minimal set of non-keys of the database are found. The database includes at least two entities and at least two attributes. The minimal set of non-keys includes at least two non-keys. Each entity independently includes a value of each attribute. A set of keys of the database is generated from the minimal set of non-keys. Each key of the generated set of keys independently is a unitary key consisting of one attribute or a composite key consisting of at least two attributes.

    摘要翻译: 用于发现数据库中的密钥的方法和系统。 找到数据库的一小组非密钥。 该数据库包括至少两个实体和至少两个属性。 最小的一组非键包括至少两个非键。 每个实体独立地包括每个属性的值。 数据库的一组键从最小的非键集合生成。 所生成的一组密钥的每个密钥独立地是由包括至少两个属性的一个属性或复合密钥组成的整体密钥。

    Invocation of web services from a database
    4.
    发明授权
    Invocation of web services from a database 有权
    从数据库调用Web服务

    公开(公告)号:US08166006B2

    公开(公告)日:2012-04-24

    申请号:US09968353

    申请日:2001-09-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30483

    摘要: The present invention provides a method, system and program product for integrating a service external to a database into a database such that the service may be easily invoked from the database. Preferably, the service is a web service available over the internet The service may be invoked from any of a number of invoking mechanisms of the database. In a first specific embodiment, the mechanism comprises a user-defined function within an SQL statement. In a second specific embodiment, the mechanism comprises a virtual table. In a third specific embodiment, the mechanism comprises a stored procedure. In a fourth specific embodiment, the mechanism comprises a trigger. In a fifth specific embodiment, the mechanism comprises a federated table accessed via a nickname and implemented using a wrapper.

    摘要翻译: 本发明提供了一种用于将数据库外部的服务集成到数据库中的方法,系统和程序产品,使得可以容易地从数据库调用该服务。 优选地,该服务是通过互联网可用的web服务。可以从数据库的多个调用机制中的任何一个调用该服务。 在第一具体实施例中,该机制包括SQL语句内的用户定义的函数。 在第二具体实施例中,机构包括虚拟表。 在第三具体实施例中,机构包括存储过程。 在第四具体实施例中,该机构包括触发器。 在第五具体实施例中,该机制包括通过昵称访问并且使用包装器实现的联合表。

    System and method for adaptive database caching
    5.
    发明授权
    System and method for adaptive database caching 有权
    自适应数据库缓存的系统和方法

    公开(公告)号:US07945577B2

    公开(公告)日:2011-05-17

    申请号:US12123101

    申请日:2008-05-19

    IPC分类号: G06F17/00

    CPC分类号: G06F17/3048 Y10S707/99933

    摘要: A local database cache enabling persistent, adaptive caching of either full or partial content of a remote database is provided. Content of tables comprising a local cache database is defined on per-table basis. A table is either: defined declaratively and populated in advance of query execution, or is determined dynamically and asynchronously populated on-demand during query execution. Based on a user input query originally issued against a remote DBMS and referential cache constraints between tables in a local database cache, a Janus query plan, comprising local, remote, and probe query portions is determined. A probe query portion of a Janus query plan is executed to determine whether up-to-date results can be delivered by the execution of a local query portion against a local database cache, or whether it is necessary to retrieve results from a remote database by executing a remote query portion of Janus query plan.

    摘要翻译: 提供本地数据库缓存,实现远程数据库的完整或部分内容的持久自适应缓存。 包含本地缓存数据库的表的内容基于每个表进行定义。 表是:在查询执行之前以声明方式和填充方式定义,或者在查询执行期间按需动态和异步地填充表。 基于最初针对远程DBMS发出的用户输入查询和本地数据库高速缓存中的表之间的参考高速缓存约束,确定包括本地,远程和探测查询部分的Janus查询计划。 执行Janus查询计划的探查查询部分以确定是否可以通过对本地数据库高速缓存执行本地查询部分来传递最新结果,或者是否需要通过以下方式从远程数据库检索结果: 执行Janus查询计划的远程查询部分。

    Discovering topical structures of databases
    6.
    发明授权
    Discovering topical structures of databases 有权
    发现数据库的主题结构

    公开(公告)号:US07818323B2

    公开(公告)日:2010-10-19

    申请号:US12034964

    申请日:2008-02-21

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30572 G06F17/30306

    摘要: A system and method for automatically discovering topical structures of databases includes a model builder adapted to compute various kinds of representations for the database based on schema information and data values of the database. A plurality of base clusterers is also provided, one for each representation. Each base clusterer is adapted to perform, for the representation, preliminary topical clustering of tables within the database to produce a plurality of clusters, such that each of the clusters corresponds to a set of tables on the same topic. A meta-clusterer aggregates results of the clusterers into a final clustering, such that the final clustering comprises a plurality of the clusters. A representative finder identifies representative tables from the clusters in the final clustering. The representative finder identifies at least one representative table for each of the clusters in the final clustering. The representative finder also arranges the representative tables by topic as a topical directory and outputs the topical directory.

    摘要翻译: 用于自动发现数据库的主题结构的系统和方法包括:模型构建器,其适于基于数据库的模式信息和数据值来计算数据库的各种表示。 还提供了多个基本聚类器,每个表示一个。 每个基本聚类器适于为表示执行数据库内的表的初步主题聚类以产生多个聚类,使得每个聚类对应于同一主题上的一组表。 元聚类器将聚类器的结果聚合成最终聚类,使得最终聚类包括多个聚类。 代表性查找器从最终聚类中的聚类中识别代表性表。 代表查找器识别最终聚类中每个聚类的至少一个代表表。 代表查找器还将主题的代表表排列为主题目录,并输出主题目录。

    System and method for adaptive database caching
    7.
    发明申请
    System and method for adaptive database caching 有权
    自适应数据库缓存的系统和方法

    公开(公告)号:US20060026154A1

    公开(公告)日:2006-02-02

    申请号:US10902583

    申请日:2004-07-30

    IPC分类号: G06F17/00

    CPC分类号: G06F17/3048 Y10S707/99933

    摘要: A local database cache enabling persistent, adaptive caching of either full or partial content of a remote database is provided. Content of tables comprising a local cache database is defined on per-table basis. A table is either: defined declaratively and populated in advance of query execution, or is determined dynamically and asynchronously populated on-demand during query execution. Based on a user input query originally issued against a remote DBMS and referential cache constraints between tables in a local database cache, a Janus query plan, comprising local, remote, and probe query portions is determined. A probe query portion of a Janus query plan is executed to determine whether up-to-date results can be delivered by the execution of a local query portion against a local database cache, or whether it is necessary to retrieve results from a remote database by executing a remote query portion of Janus query plan.

    摘要翻译: 提供本地数据库缓存,实现远程数据库的完整或部分内容的持久自适应缓存。 包含本地缓存数据库的表的内容基于每个表进行定义。 表是:在查询执行之前以声明方式和填充方式定义,或者在查询执行期间按需动态和异步地填充表。 基于最初针对远程DBMS发出的用户输入查询和本地数据库高速缓存中的表之间的参考高速缓存约束,确定包括本地,远程和探测查询部分的Janus查询计划。 执行Janus查询计划的探查查询部分以确定是否可以通过对本地数据库高速缓存执行本地查询部分来传递最新结果,或者是否需要通过以下方式从远程数据库检索结果: 执行Janus查询计划的远程查询部分。

    Scaling dynamic authority-based search using materialized subgraphs
    8.
    发明授权
    Scaling dynamic authority-based search using materialized subgraphs 有权
    使用实体化子图扩展基于权限的动态搜索

    公开(公告)号:US09171077B2

    公开(公告)日:2015-10-27

    申请号:US12394371

    申请日:2009-02-27

    IPC分类号: G06F7/00 G06F17/30

    摘要: According to one embodiment of the present invention, a method for processing a query is provided. The method includes generating a set of pre-computed materialized sub-graphs from a dataset and receiving a search query having one or more search query terms. A particular one of the pre-computed materialized sub-graphs is accessed and a dynamic authority-based keyword search is executed on the particular one of the pre-computed materialized sub-graphs. Nodes in the dataset are then retrieved based on the executing, and a response to the search query is provided which includes the retrieved nodes.

    摘要翻译: 根据本发明的一个实施例,提供了一种处理查询的方法。 该方法包括从数据集生成一组预先计算的物化子图并接收具有一个或多个搜索查询项的搜索查询。 访问预先计算的物化子图中的特定一个,并且对预先计算的物化子图中的特定一个执行基于动态权限的关键字搜索。 然后基于执行来检索数据集中的节点,并且提供对搜索查询的响应,其包括检索到的节点。

    System, method, and apparatus for multidimensional exploration of content items in a content store
    9.
    发明授权
    System, method, and apparatus for multidimensional exploration of content items in a content store 有权
    用于内容存储中的内容项的多维探索的系统,方法和装置

    公开(公告)号:US08560548B2

    公开(公告)日:2013-10-15

    申请号:US12544090

    申请日:2009-08-19

    IPC分类号: G06F7/00

    CPC分类号: G06F17/3071

    摘要: A computer-implemented method for accessing content items in a content store are described. In one embodiment, the computer-implemented method includes maintaining a text index of content items in a content store to enable a keyword search on the content items, receiving a query having a keyword and generating a hit list from the text index using the keyword, and extracting frequent phrases from text within content items of the hit list. The computer-implemented method also includes assigning a relative relevance to the frequent phrases and grouping content items into topics based on presence of relevant phrases within the content items of the hit list. The hit list includes one or more content items of the content store. The frequent phrases having a relatively high relevance are relevant phrases.

    摘要翻译: 描述用于访问内容存储中的内容项的计算机实现的方法。 在一个实施例中,计算机实现的方法包括维护内容商店中的内容项目的文本索引,以便能够对内容项进行关键字搜索,接收具有关键字的查询并使用该关键字从文本索引生成命中列表, 并从命中列表的内容项目中的文本中提取频繁的短语。 计算机实现的方法还包括分配与频繁短语相关的相关性,并且基于在命中列表的内容项中的相关短语的存在将内容项分组成主题。 命中列表包括内容存储的一个或多个内容项。 具有相对较高相关性的频繁短语是相关短语。

    Graph search system and method for querying loosely integrated data
    10.
    发明授权
    Graph search system and method for querying loosely integrated data 失效
    用于查询松散集成数据的图形搜索系统和方法

    公开(公告)号:US08326847B2

    公开(公告)日:2012-12-04

    申请号:US12053597

    申请日:2008-03-22

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30395 G06F17/30554

    摘要: A system, method and computer program product for executing a query on linked data sources. Embodiments of the invention generate an instance graph expressing relationships between objects in the linked data sources and receive a query including at least first and second search terms. The first search term is then executed on the instance graph and a summary graph is generated using the results of the executing step. A second search term is then executed on the summary graph.

    摘要翻译: 一种用于执行链接数据源查询的系统,方法和计算机程序产品。 本发明的实施例生成表示链接的数据源中的对象之间的关系并且接收包括至少第一和第二搜索项的查询的实例图。 然后在实例图上执行第一个搜索项,并使用执行步骤的结果生成汇总图。 然后在摘要图上执行第二个搜索项。