Multi-layer graph-based categorization

    公开(公告)号:US12223264B2

    公开(公告)日:2025-02-11

    申请号:US18154601

    申请日:2023-01-13

    Abstract: A method may include a obtaining a first data model instance comprising an identifier string and. a set of attributes associated with a set of attribute name strings. The method may include obtaining an ontology graph that includes a first label, a second label, and an association between them. The method may include using a prediction model to select the first label based on the first data model instance and determining the second label based on the relationship. The method may include determining a selected set of labels that includes the first label and the second label to associate with the first data model instance. The method may include associating the selected set of labels with the first data model instance in a dataset that includes a plurality of records, where each record is associated with a different data model instance.

    Ontology customization for indexing digital content

    公开(公告)号:US12222974B2

    公开(公告)日:2025-02-11

    申请号:US17853086

    申请日:2022-06-29

    Abstract: A method for automatically classifying terms of a first ontology into categories of a classification scheme defined with respect to a second ontology includes generating, for each term in the first ontology and each term in the second ontology, an embedding encoding the term and a description of the term. The method further includes adding the generated embeddings to a transformer model and computing, for each pair of the embeddings consisting of a first term from the first ontology and a second term from the second ontology, a similarity metric quantifying a similarity of the first term and the second term. The method still further provides for determining a matching scheme based on the similarity metric computed with respect to each pair of the embeddings, where the matching scheme associates term of the first ontology with one or more relevant categories of the classification scheme defined with respect to the second ontology. The method further provides for returning the one or more relevant categories of the classification scheme that are matched, by the determined matching scheme, to a term of the second ontology received as an input.

    Correlate multiple notes in a database using bi-directional link

    公开(公告)号:US12198075B2

    公开(公告)日:2025-01-14

    申请号:US17136216

    申请日:2020-12-29

    Abstract: Disclosed are a method and device for calculating a correlation between notes using a database constructed on a basis of artificial intelligence, and supporting a service for the notes on a basis of the calculated correlation. A method by which a note providing device that interworks with a user terminal provides notes, includes: constructing a keyword DB by extracting a keyword from a note generated through the user terminal and reflecting a weight calculated through machine learning using the extracted keyword; and calculating a correlation score for each of a plurality of target notes correlated with a reference note using the keyword DB. Therefore, the method and device for providing the notes using the artificial intelligence-based correlation calculation can more accurately recommend the correlated notes by reflecting the interaction of the user.

    Methods and systems for reuse of data item fingerprints in generation of semantic maps

    公开(公告)号:US12197485B2

    公开(公告)日:2025-01-14

    申请号:US18138195

    申请日:2023-04-24

    Applicant: cortical.io AG

    Abstract: A method for using distributed representations of data items within a first set of data documents clustered in a first two-dimensional metric space to generate a cluster of distributed representations in a second two-dimensional metric space includes clustering in a first two-dimensional metric space, by a reference map generator, a set of data documents, generating a semantic map. A parser generates an enumeration of data items occurring in the set of data documents. A representation generator generates a distributed representation using occurrence information about each data item. A sparsifying module receives an identification of a maximum level of sparsity and reduces a total number of set bits within the distributed representation. The reference map generator clusters, in a second two-dimensional metric space, a set of SDRs retrieved from the SDR database and selected according to a second at least one criterion, generating a second semantic map.

    SYSTEM AND METHOD FOR DATA MANAGEMENT

    公开(公告)号:US20240419714A1

    公开(公告)日:2024-12-19

    申请号:US18695441

    申请日:2022-09-26

    Abstract: A system and method for data management is provided. The method includes obtaining a dataset from a data source by a processing unit. The dataset includes a plurality of datapoints, and each of the datapoints belongs to a column among a plurality of columns. Further, an ontology label for at least one column in the dataset is predicted using a machine learning model. The predicted ontology label is associated with an ontology comprising a plurality of ontology labels. Further, a mapping between the dataset and the ontology is generated based on the relation between the predicted ontology label and the column. Furthermore, the datapoints are classified with respect to the ontology labels based on the mapping generated. The classified datasets are outputted on a user interface.

    System and method for querying a data repository

    公开(公告)号:US12169524B2

    公开(公告)日:2024-12-17

    申请号:US18226156

    申请日:2023-07-25

    Abstract: The present disclosure relates to methods and systems for querying data in a data repository. According to a first aspect, this disclosure describes a method of querying a database, comprising: receiving, at a computing device, a plurality of keywords; determining, by the computer device, a plurality of datasets relating to the keywords; identifying, by the computer device, metadata for the plurality of datasets indicating a relationship between the datasets by examining an ontology associated with the datasets; providing, by the computer device, one or more suggested database queries in natural language form, the one or more suggested database queries constructed based on the plurality of keywords and the metadata; receiving, by the computing device, a selection of the one or more suggested database queries; and constructing, by the computer device, an object view for the plurality of datasets based on the selected query and the metadata.

    PROVIDING DATA FROM A DIRECTED GRAPH TO A LANGUAGE MODEL

    公开(公告)号:US20240411787A1

    公开(公告)日:2024-12-12

    申请号:US18632864

    申请日:2024-04-11

    Applicant: SAP SE

    Abstract: A method, a system and a computer program for providing data from a directed graph to a language model are provided. The method comprises defining a plurality of conditions and a plurality of patterns, wherein each of the conditions has at least one corresponding pattern. The method further comprises receiving a subset of the directed graph, wherein the subset of the directed graph includes a plurality of statements, wherein each of the statements includes a subject, an object and a predicate relating the subject to the object. The method further comprises for each of the statements in the subset of the directed graph, performing the following: when one of the conditions matches a respective statement and the pattern corresponding to the condition can be applied to the respective statement, computing a string from the respective statement using the pattern.

Patent Agency Ranking