INFORMATION PROCESSING APPARATUS AND STORAGE MEDIUM

    公开(公告)号:US20240104128A1

    公开(公告)日:2024-03-28

    申请号:US18275134

    申请日:2021-02-03

    Inventor: Masafumi Oyamada

    CPC classification number: G06F16/36 G06F40/40

    Abstract: In order to make it possible to correctly merge dissimilarly expressed character strings, an information processing apparatus (1) includes: a data acquisition section (11) that acquires a data set including a plurality of character string pairs in each of which whether or not character strings therein indicate the same object is known; and a conversion pattern decision section (12) that decides, based on results of trials to convert each of the plurality of character string pairs included in the data set, a conversion pattern that heightens accuracy in determining whether or not the character string pair included in the data set indicates the same object.

    System and method for identifying cyberthreats from unstructured social media content

    公开(公告)号:US11934535B2

    公开(公告)日:2024-03-19

    申请号:US18169627

    申请日:2023-02-15

    CPC classification number: G06F21/577 G06F16/338 G06F16/355 G06F16/36

    Abstract: A cyberthreat detection system queries a content database for unstructured content that contains a set of keywords, clusters the unstructured content into clusters based on topics, and determines a cybersecurity cluster utilizing a list of vetted cybersecurity phrases. The set of keywords represents a target of interest such as a newly discovered cyberthreat, an entity, a brand, or a combination thereof. The cybersecurity cluster thus determined is composed of unstructured content that has the set of keywords as well as some percentage of the vetted cybersecurity phrases. If the size of the cybersecurity cluster, as compared to the amount of unstructured content queried from the content database, meets or exceeds a predetermined threshold, the query is saved as a new classifier rule that can then be used by a cybersecurity classifier to automatically, dynamically and timely identify the target of interest in unclassified unstructured content.

    Persistance and linking of analytic products in big data environments

    公开(公告)号:US11836193B2

    公开(公告)日:2023-12-05

    申请号:US16318066

    申请日:2017-07-14

    CPC classification number: G06F16/94 G06F16/36 G06F16/9024 G06F2216/11

    Abstract: Persistence and linking of analytic products is provided. Information regarding a plurality of analytic methods is collected. A first process node is generated in a network. The first process node corresponds to a first analytic method. Information is collected regarding a plurality of executions of the first analytic method. A plurality of session nodes is generated in the network corresponding to the plurality of executions. Each of the plurality of session nodes is linked to the first process node. Metadata regarding the plurality of executions is associated with the plurality of session nodes. At least one product node is generated corresponding to a product. The product integrates a result value of at least one of the plurality of executions. The at least one product node is linked to the session node of the plurality of session nodes corresponding to the at least one of the plurality of executions.

    Mapping of topics within a domain based on terms associated with the topics

    公开(公告)号:US11797593B2

    公开(公告)日:2023-10-24

    申请号:US17855685

    申请日:2022-06-30

    Applicant: Intuit Inc.

    Inventor: Bei Huang Nhung Ho

    CPC classification number: G06F16/35 G06F16/338 G06F16/3331 G06F16/34 G06F16/36

    Abstract: The invention relates to a method for mapping topics. The method includes obtaining terms, obtaining tokens from each term, and identifying a first and a second set of topics. Each of the topics represents one or more of the terms. The method further includes identifying first and second topic names for the first and the second sets of topics. For each topic, the tokens associated with the terms assigned to the topic are analyzed for relevance, and a token with a high relevance is selected as the topic name. The method also includes selecting one of the first and one of the second sets of topics to obtain first and second selected topics, determining, based on the one or more terms, a similarity value between each of the first and the second selected topics, and establishing a mapping between similar first and second selected topics.

    METHODS AND SYSTEMS FOR REUSE OF DATA ITEM FINGERPRINTS IN GENERATION OF SEMANTIC MAPS

    公开(公告)号:US20230334079A1

    公开(公告)日:2023-10-19

    申请号:US18138195

    申请日:2023-04-24

    Applicant: cortical.io AG

    CPC classification number: G06F16/36 G06F16/334

    Abstract: A method for using distributed representations of data items within a first set of data documents clustered in a first two-dimensional metric space to generate a cluster of distributed representations in a second two-dimensional metric space includes clustering in a first two-dimensional metric space, by a reference map generator, a set of data documents, generating a semantic map. A parser generates an enumeration of data items occurring in the set of data documents. A representation generator generates a distributed representation using occurrence information about each data item. A sparsifying module receives an identification of a maximum level of sparsity and reduces a total number of set bits within the distributed representation. The reference map generator clusters, in a second two-dimensional metric space, a set of SDRs retrieved from the SDR database and selected according to a second at least one criterion, generating a second semantic map.

Patent Agency Ranking