METHOD AND APPARATUS FOR MINING ENTITY FOCUS IN TEXT

    公开(公告)号:US20210216715A1

    公开(公告)日:2021-07-15

    申请号:US17023915

    申请日:2020-09-17

    Abstract: A method for mining an entity focus in a text may include: performing word and phrase feature extraction on an input text; inputting an extracted word and phrase feature into a text coding network for coding, to obtain a coding sequence of the input text; processing the coding sequence of the input text using a core entity labeling network to predict a position of a core entity in the input text; extracting a subsequence corresponding to the core entity in the input text from the coding sequence of the input text, based on the position of the core entity in the input text; and predicting a position of a focus corresponding to the core entity in the input text using a focus labeling network, based on the coding sequence of the input text and the subsequence corresponding to the core entity in the input text.

    METHOD AND APPARATUS FOR LABELING CORE ENTITY, AND ELECTRONIC DEVICE

    公开(公告)号:US20210216712A1

    公开(公告)日:2021-07-15

    申请号:US17149185

    申请日:2021-01-14

    Abstract: A method and an apparatus for labelling a core entity, and a related electronic device are proposed. A character vector sequence, a first word vector sequence and an entity vector sequence corresponding to a target text are obtained by performing character vector mapping, word vector mapping and entity vector mapping are performed on the target text, to obtain a target vector sequence corresponding to the target text. A first probability that each character of the target text is a starting character of a core entity and a second probability that each character of the target text is an ending character of a core entity are determined by encoding and decoding the target vector sequence. One or more core entities of the target text are determined based on the first probability and the second probability.

    METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR EXTRACTING SPO TRIPLES

    公开(公告)号:US20210216819A1

    公开(公告)日:2021-07-15

    申请号:US17149267

    申请日:2021-01-14

    Abstract: A method and an apparatus for extracting SPO triples, an electronic device, and a storage medium are related to the field of artificial intelligence technologies. The solution may include: inputting annotated training data into each of multiple extraction models; predicting SPO triples satisfying defined relations in the annotated training data through each of multiple extraction models; combining the predicted SPO triples corresponding to each of multiple extraction models; extracting SPO triples satisfying screening conditions from the combined SPO triples; mining SPO triples with missing annotations from the annotated training data based on the SPO triples satisfying screening conditions, in response to that the SPO triples satisfying screening conditions do not satisfy output conditions; supplementing the SPO triples with missing annotations into the annotated training data; repeating the inputting, predicting, combining, extracting, mining and supplementing until the SPO triples satisfying screening conditions satisfy the output conditions.

    METHOD AND APPARATUS FOR PROCESSING INFORMATION

    公开(公告)号:US20210216725A1

    公开(公告)日:2021-07-15

    申请号:US17147881

    申请日:2021-01-13

    Abstract: A method and an apparatus for processing information are provided. The method can include: acquiring a word sequence obtained by performing word segmentation on two paragraphs in a text; inputting the word sequence into a to-be-trained natural language processing model to generate a word vector corresponding to a word in the word sequence; inputting the word vector into a preset processing layer of the to-be-trained natural language processing model; predicting whether the two paragraphs are adjacent, and a replaced word in the two paragraphs; and acquiring reference information of the two paragraphs, and training the to-be-trained natural language processing model to obtain a trained natural language processing model, based on the prediction result and the reference information.

Patent Agency Ranking