Method, apparatus, device, and storage medium for linking entity

    公开(公告)号:US11727216B2

    公开(公告)日:2023-08-15

    申请号:US17117553

    申请日:2020-12-10

    CPC classification number: G06F40/30 G06F17/16 G06F40/295

    Abstract: A method, apparatus, device, and storage medium for linking an entity, relates to the technical fields of knowledge graph and deep learning are provided. The method may include: acquiring a target text; determining at least one entity mention included in the target text and a candidate entity corresponding to each entity mention; determining an embedding vector of each candidate entity based on the each candidate entity and a preset entity embedding vector determination model; determining context semantic information of the target text based on the target text and each embedding vector; determining type information of the at least one entity mention; and determining an entity linking result of the at least one entity mention, based on the each embedding vector, the context semantic information, and each type information.

    Method, Device, Equipment, and Storage Medium for Mining Topic Concept

    公开(公告)号:US20210326535A1

    公开(公告)日:2021-10-21

    申请号:US17036609

    申请日:2020-09-29

    Abstract: The present disclosure provides a method, a device, an equipment and a storage medium for mining a topic concept. The method includes: acquiring a plurality of candidate topic concepts based on a query; performing word segmentation on the plurality of candidate topic concepts and performing part-of-speech tagging on words obtained after performing the word segmentation, to obtain a part-of-speech sequence of each of the plurality of candidate topic concepts; and filtering the plurality of candidate topic concepts based on the part-of-speech sequence, to filter out a topic concept corresponding to a target part-of-speech sequence among the plurality of candidate topic concepts, in which a proportion of accurate topic concepts in the target part-of-speech sequence is lower than or equal to a first preset threshold, or a proportion of inaccurate topic concepts in the target part-of-speech sequence is higher than or equal to a second preset threshold.

Patent Agency Ranking