METHOD, TERMINAL DEVICE AND STORAGE MEDIUM FOR MINING ENTITY DESCRIPTION TAG

    公开(公告)号:US20190197166A1

    公开(公告)日:2019-06-27

    申请号:US16164619

    申请日:2018-10-18

    Abstract: The present disclosure provides a method, a terminal device and a storage medium for mining an entity description tag. The method includes: acquiring a group of one or more core words corresponding to each field and a first syntax dependent template corresponding to each core word; performing a matching on each data in a first data source by using the first syntax dependent template to determine a first description tag set in each field; performing a recognition on each data in a second data source to determine an entity set; determining a second description tag set based on a matching degree between each description tag in the description tag set of each field and each data in the second data source; and determining an entity description tag set based on a correlation between each entity in the entity set and each description tag in the second descriptive tag set.

    METHOD AND APPARATUS FOR VERIFYING MEDICAL FACT

    公开(公告)号:US20210217504A1

    公开(公告)日:2021-07-15

    申请号:US17023998

    申请日:2020-09-17

    Abstract: The present disclosure relates to the field of medical data processing based on natural language processing. Embodiments of the present disclosure disclose a method and apparatus for verifying a medical fact. The method may include: acquiring a description text of the medical fact; selecting a relevant paragraph related to the description text of the medical fact from a medical document; and inputting the description text of the medical fact and the corresponding relevant paragraph into a trained discrimination model for authenticity judgment, to obtain a verification result of the medical fact, the discrimination model being pre-trained based on a medical text paragraph pair extracted from the medical document, and being iteratively adjusted using a medical fact sample set including authenticity labeling information after the pre-training.

    METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR EXTRACTING SPO TRIPLES

    公开(公告)号:US20210216819A1

    公开(公告)日:2021-07-15

    申请号:US17149267

    申请日:2021-01-14

    Abstract: A method and an apparatus for extracting SPO triples, an electronic device, and a storage medium are related to the field of artificial intelligence technologies. The solution may include: inputting annotated training data into each of multiple extraction models; predicting SPO triples satisfying defined relations in the annotated training data through each of multiple extraction models; combining the predicted SPO triples corresponding to each of multiple extraction models; extracting SPO triples satisfying screening conditions from the combined SPO triples; mining SPO triples with missing annotations from the annotated training data based on the SPO triples satisfying screening conditions, in response to that the SPO triples satisfying screening conditions do not satisfy output conditions; supplementing the SPO triples with missing annotations into the annotated training data; repeating the inputting, predicting, combining, extracting, mining and supplementing until the SPO triples satisfying screening conditions satisfy the output conditions.

    METHOD AND APPARATUS FOR PROCESSING INFORMATION

    公开(公告)号:US20210216725A1

    公开(公告)日:2021-07-15

    申请号:US17147881

    申请日:2021-01-13

    Abstract: A method and an apparatus for processing information are provided. The method can include: acquiring a word sequence obtained by performing word segmentation on two paragraphs in a text; inputting the word sequence into a to-be-trained natural language processing model to generate a word vector corresponding to a word in the word sequence; inputting the word vector into a preset processing layer of the to-be-trained natural language processing model; predicting whether the two paragraphs are adjacent, and a replaced word in the two paragraphs; and acquiring reference information of the two paragraphs, and training the to-be-trained natural language processing model to obtain a trained natural language processing model, based on the prediction result and the reference information.

    METHOD, APPARATUS AND DEVICE FOR GENERATING ENTITY RELATIONSHIP DATA, AND STORAGE MEDIUM

    公开(公告)号:US20200057788A1

    公开(公告)日:2020-02-20

    申请号:US16539796

    申请日:2019-08-13

    Abstract: Embodiments of the present disclosure provide a method, an apparatus and a device for generating entity relationship data, and a storage medium. The method includes: obtaining webpage source data corresponding to a target webpage; identifying at least one key value block from the webpage source data, wherein the key value block comprises at least one key value pair; identifying body values corresponding to the at least one key value block from the webpage source data; and generating entity relationship data corresponding to the target webpage according to the key value blocks and the body values corresponding to the key value blocks. With the technical solution the present disclosure, the webpage universality may be improved, labor cost may be reduced, and output quantity of the entity relationship data may be increased.

Patent Agency Ranking