专利检索 ap:("International Business Machines Corporation") AND inv:"Kavitha Srinivas" 第 1 页

1.

发明公开
PERFORMING CODE COMPLETION USING DATA FLOW 审中-公开

公开(公告)号：US20240126513A1

公开(公告)日：2024-04-18

申请号：US17964961

申请日：2022-10-13

申请人： International Business Machines Corporation

发明人： Wenting Zhao , IBRAHIM ABDELAZIZ , Julian Timothy Dolby , Kavitha Srinivas

IPC分类号： G06F8/33 , G06F8/36 , G06N5/04

CPC分类号： G06F8/33 , G06F8/36 , G06N5/04

摘要： A corpus of source code from a code database is accessed and a language prediction model is trained based on the corpus of source code. A given program is accessed and a completion of a given line of the given program is predicted by performing inferencing using the language prediction model and at least a portion of the given program. The given line is completed based upon the prediction.

2.

发明授权
Automatic domain annotation of structured data 有权

公开(公告)号：US11954424B2

公开(公告)日：2024-04-09

申请号：US17661619

申请日：2022-05-02

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Horst Cornelius Samulowitz , Kavitha Srinivas

IPC分类号： G06F17/00 , G06F16/245 , G06F40/117 , G06F40/169 , G06F40/177 , G06F40/20

CPC分类号： G06F40/169 , G06F16/245 , G06F40/117 , G06F40/177 , G06F40/20

摘要： A processor may receive structured data. The structured data may include one or more columns and associated column names. The processor may analyze the structured data. Analyzing the structured data may include gathering a requisite set of keywords from the associated column names across all columns and/or a sample of column cells. The processor may access a corpus of documents. Each of the documents in the corpus may be associated with a respective keyword. The processor may search the corpus of documents based on the requisite set of keywords. The processor may summarize one or more documents associated with the requisite set of keywords.

3.

发明授权
Question answering approach to semantic parsing of mathematical formulas 有权

公开(公告)号：US11663251B2

公开(公告)日：2023-05-30

申请号：US17447126

申请日：2021-09-08

申请人： International Business Machines Corporation

发明人： William Karol Lynch , Kavitha Srinivas , Horst Cornelius Samulowitz , Fabio Lorenzi

IPC分类号： G06F16/335 , G06V30/416 , G06F40/205 , G06F16/332

CPC分类号： G06F16/3329 , G06F16/335 , G06V30/416 , G06F40/205

摘要： A method, system, and computer program product are disclosed. The method includes extracting at least one identifier from a formula in a document and extracting text passages in the document that contain the identifier(s). The method also includes selecting an identifier and extracted text passages containing the identifier, as well as generating identifier-passage pairs for the selected text passages and the identifier. Further, the method includes submitting the identifier-passage pairs to a question answering (QA) model, which generates candidate answers from the selected text passages. A definition of the identifier is then selected from the candidate answers.

4.

发明授权
Knowledge aided feature engineering 有权

公开(公告)号：US11599826B2

公开(公告)日：2023-03-07

申请号：US16741084

申请日：2020-01-13

申请人： International Business Machines Corporation

发明人： Udayan Khurana , Sainyam Galhotra , Oktie Hassanzadeh , Kavitha Srinivas , Horst Cornelius Samulowitz

IPC分类号： G06N20/00 , G06F11/34

摘要： Embodiments relate to a system, program product, and method for employing feature engineering to improve classifier performance. A first machine learning (ML) model with a first learning program is selected. The first selected ML model is operatively associated with a first structured dataset. First features in the first dataset directed at performance of the selected ML model are identified. A second structured dataset is assessed with respect to the identified features in the first dataset, and new features in the second dataset are identified, where the new features are semantically related to the identified features in the first dataset. The first dataset is dynamically augmented with the identified new features in the second dataset. The dynamically augmented first dataset is applied to the selected ML model to subject an embedded learning algorithm of the selected ML model to training using the augmented first dataset.

5.

发明授权
Storing graph data in a relational database 有权

公开(公告)号：US10387496B2

公开(公告)日：2019-08-20

申请号：US14718147

申请日：2015-05-21

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Achille B. Fokoue-Nkoutche , Gang Hu , Anastasios Kementsietsidis , Kavitha Srinivas , Wen B. Sun , Guo Tong Xie

IPC分类号： G06F16/22 , G06F16/901 , G06F16/28 , G06F16/2452

摘要： Embodiments include methods, systems and computer program products for storing graph data for a directed graph in a relational database. Aspects include creating a plurality of relational tables for the graph data, using a processor on a computer, the plurality of relational tables including adjacency tables and attribute tables. Each row of the attribute tables is dedicated to a subject of the graph data in the dataset and stores a JavaScript Object Notation (JSON) object corresponding to the subject. Each row of the adjacency tables includes a hashtable containing properties and values of the subject for that row.

6.

发明授权
Finding optimal query plans 有权

公开(公告)号：US09471635B2

公开(公告)日：2016-10-18

申请号：US14993272

申请日：2016-01-12

申请人： International Business Machines Corporation

发明人： Mihaela A. Bornea , Julian Dolby , Achille B. Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas

IPC分类号： G06F17/30 , G06F7/00

CPC分类号： G06F17/30469 , G06F17/30442 , G06F17/30477 , G06F17/3053 , G06F17/30935 , G06F17/30958

摘要： Systems and methods for optimizing a query, and more particularly, systems and methods for finding optimal plans for graph queries by casting the task of finding the optimal plan as an integer programming (ILP) problem. A method for optimizing a query, comprises building a data structure for a query, the data structure including a plurality of components, wherein each of the plurality of components corresponds to at least one graph pattern, determining a plurality of flows of query variables between the plurality of components, and determining a combination of the plurality of flows between the plurality of components that results in a minimum cost to execute the query.

7.

发明申请
Method and Apparatus for Identifying the Optimal Schema to Store Graph Data in a Relational Store 审中-公开
标题翻译：用于识别最佳模式以在关系存储中存储图形数据的方法和装置

公开(公告)号：US20160203236A1

公开(公告)日：2016-07-14

申请号：US15078931

申请日：2016-03-23

申请人： International Business Machines Corporation

发明人： Mihaela Ancuta Bornea , Julian Timothy Dolby , Achille Belly Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas

IPC分类号： G06F17/30

CPC分类号： G06F16/9024 , G06F16/211

摘要： A system for identifying a schema for storing graph data includes a database containing a graph dataset of data and relationships between data pairs and a list of storage methods that each are a distinct structural arrangement of the data and relationships from the graph data set. An analyzer module collects statistics for the graph dataset, and a data classification module uses the collected statistics to calculate metrics describing the data and relationships in the graph dataset, uses the calculated metrics to group the data and relationships into a plurality of graph dataset subsets and associates each graph dataset subset with one of the plurality of storage methods. The resulting group of storage methods associated with the plurality of graph dataset subsets includes a unique storage method for each graph dataset subset. The data and relationships in each graph dataset subset are arranged in accordance with associated storage methods.

摘要翻译： 用于识别用于存储图形数据的模式的系统包括数据库，该数据库包含数据的图形数据集和数据对之间的关系以及存储方法的列表，每个存储方法是与图形数据集的数据和关系的不同结构布置。分析器模块收集图形数据集的统计信息，数据分类模块使用收集的统计信息来计算描述图形数据集中的数据和关系的度量，使用计算的度量将数据和关系分组为多个图形数据集子集，以及将每个图形数据集子集与多个存储方法之一相关联。与多个图形数据集子集相关联的所得到的存储方法组包括用于每个图形数据集子集的唯一存储方法。每个图形数据集子集中的数据和关系按照相关的存储方法进行排列。

8.

发明申请
Method and Apparatus for Storing Sparse Graph Data as Multi-Dimensional Cluster 有权
标题翻译：将稀疏图数据存储为多维集群的方法和装置

公开(公告)号：US20150052134A1

公开(公告)日：2015-02-19

申请号：US13967261

申请日：2013-08-14

申请人： International Business Machines Corporation

发明人： Mihaela Ancuta Bornea , Julian Timothy Dolby , Achille Belly Fokoue-Nkoutche , Anastasios Kementsietsidis , Kavitha Srinivas

IPC分类号： G06F17/30

CPC分类号： G06F17/30592 , G06F17/30312 , G06F17/30339 , G06F17/30424 , G06F17/30598 , G06F17/30958

摘要： A system for storing graph data as a multi-dimensional cluster having a database with a graph dataset containing data and relationships between data pairs and a schema list of storage methods that use a table with columns and rows associated with data or relationships. An analyzer module to collect statistics of a graph dataset and a dimension identification module to identify a plurality of dimensions that each represent a column in the table. A schema creation and loading module creates a modified storage method and having a plurality of distinct table blocks and a plurality of table block indexes, one index for each table block and arranges the data and relationships in the given graph dataset in accordance with the modified storage method to create the multi-dimensional cluster.

摘要翻译： 用于将图形数据存储为具有数据库的图形数据的系统，该数据库具有包含数据和数据对之间的关系的图形数据集，以及使用具有与数据或关系相关联的列和行的表的存储方法的模式列表。分析器模块，用于收集图形数据集和维度识别模块的统计信息，以识别每个表示表中的列的多个维度。模式创建和加载模块创建经修改的存储方法并且具有多个不同的表块和多个表块索引，每个表块的一个索引，并且根据修改的存储器将数据和关系布置在给定图形数据集中方法来创建多维集群。

9.

发明申请
Scalable Ontology Extraction 失效
标题翻译：可扩展本体提取

公开(公告)号：US20130024406A1

公开(公告)日：2013-01-24

申请号：US13625931

申请日：2012-09-25

申请人： International Business Machines Corporation

发明人： Achille Fokoue , Aditya Kalyanpur , Kavitha Srinivas

IPC分类号： G06F15/18

CPC分类号： G06N5/025 , G06F19/00

摘要： Techniques for facilitating learning of one or more ontological rules of a resource description framework database are provided. The techniques include obtaining ontology vocabulary from a resource description framework database, generating a rule hypothesis by incrementally building upon a previously learnt rule from the database by adding one or more predicates to the previously learnt rule, performing a constraint check on the generated rule hypothesis by determining compatibility with each previously learnt rule to ensure that a complete rule set including each previously learnt rule and the generated rule hypothesis is consistent, validating the rule hypothesis as a rule using one or more association rule mining techniques to determine validity of the rule hypothesis against the database, and applying the rule to the database to infer one or more facts from the database to facilitate learning of one or more additional ontological rules.

摘要翻译： 提供了一种便于学习资源描述框架数据库的一个或多个本体论规则的技术。这些技术包括从资源描述框架数据库中获取本体词汇，通过向先前学习的规则添加一个或多个谓词，通过逐步建立在先前学习的规则上，从数据库生成规则假设，通过以下方式对生成的规则假设执行约束检查：确定与每个先前学习的规则的兼容性，以确保包括每个先前学习的规则和生成的规则假设的完整规则集合是一致的，使用一个或多个关联规则挖掘技术来将规则假设作为规则验证，以确定规则假设的有效性数据库，以及将规则应用于数据库以从数据库推断一个或多个事实，以便于学习一个或多个附加本体规则。

10.

发明授权
Reducing semantic errors in code generated by machine learning models 有权

公开(公告)号：US11681510B2

公开(公告)日：2023-06-20

申请号：US17935149

申请日：2022-09-26

申请人： International Business Machines Corporation

发明人： Julian Timothy Dolby , Martin Hirzel , Kiran A Kate , Louis Mandel , Avraham Ever Shinnar , Kavitha Srinivas

IPC分类号： G06F9/44 , G06F8/41 , G06N3/08 , G06F11/36 , G06F8/35 , G06N3/044

CPC分类号： G06F8/436 , G06F8/35 , G06F11/3608 , G06N3/08 , G06N3/044

摘要： Embodiments are disclosed for a method. The method includes identifying a prefix updated by a searcher of a machine learning model. The machine learning model is configured to generate source code in a programming language. The method also includes determining whether the prefix violates a semantic correctness property of the programming language. Additionally, the method includes instructing the searcher, in response to the determination, to prune the prefix from a set of prefixes under consideration by the searcher.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类