-
公开(公告)号:US11003731B2
公开(公告)日:2021-05-11
申请号:US16247023
申请日:2019-01-14
Inventor: Jiepeng Zheng , Miao Yu , Renkai Yang , Yilin Zhang , Jialin Wu
IPC: G06F16/9538 , G06F16/35 , G06F16/9535
Abstract: Embodiments of a method and apparatus for generating information are provided. An embodiment of the method can include: extracting search data in a preset period; determining target search statements from the search data; determining attributes of the entities in the target search statements; and clustering, for each of the entities involved in the target search statements, the target search statements including the each of the entities according to the attributes in the target search statements including the each of the entities, and determining a target attribute of the each of the entities based on a sum of a number of searches for the target search statements in each of clustered groups. The embodiment can achieve flexible information generation.
-
2.
公开(公告)号:US11372942B2
公开(公告)日:2022-06-28
申请号:US16691017
申请日:2019-11-21
Inventor: Miao Yu , Xinwei Feng , Huanyu Zhou , Xunchao Song , Songtai Dai
IPC: G06F16/9536 , G06F16/9032 , G06F16/903 , G06F40/295
Abstract: Embodiments of the present disclosure provide a method, apparatus, computer device, and storage medium for verifying community question answer data. The method may include: acquiring a community question answer data set, and generating a plurality of question answer pairs based on the community question answer data set, a question answer pair including: a question, and a to-be-verified answer corresponding to the question; generating an authoritative data set based on data stored in at least one confidence source site; and performing an authority verification on the to-be-verified answer, based on a score of a similarity between the to-be-verified answer and authoritative data in the authoritative data set in at least one dimension.
-
公开(公告)号:US11216618B2
公开(公告)日:2022-01-04
申请号:US16538589
申请日:2019-08-12
Inventor: Xinwei Feng , Xunchao Song , Miao Yu , Huanyu Zhou , Shaoshun Kang
IPC: G06F17/00 , G06F40/30 , G06F16/33 , G06F40/295
Abstract: Embodiments of the present disclosure provide a query processing method and an apparatus, a server and a storage medium. The method includes: determining a word vector representation of a query sequence and an entity vector representation of the query sequence respectively based on respective words and respective entities included in the query sequence; determining a word vector representation of a paragraph and an entity vector representation of the paragraph respectively based on respective words and respective entities included in the paragraph; and determining a similarity between the query sequence and the paragraph according to the word vector representation of the query sequence, the entity vector representation of the query sequence, the word vector representation of the paragraph, and the entity vector representation of the paragraph.
-
公开(公告)号:US20210390428A1
公开(公告)日:2021-12-16
申请号:US17119651
申请日:2020-12-11
Inventor: Hongjian Shi , Wenbin Jiang , Xinwei Feng , Miao Yu , Huanyu Zhou , Meng Tian , Xueqian Wu , Xunchao Song
Abstract: The present disclosure discloses a method, apparatus, device, and storage medium for training a model, relates to the technical fields of knowledge graph, natural language processing, and deep learning. The method may include: acquiring a first annotation data set, the first annotation data set including sample data and a annotation classification result corresponding to the sample data; training a preset initial classification model based on the first annotation data set to obtain an intermediate model; performing prediction on the sample data in the first annotation data set using the intermediate model to obtain a prediction classification result corresponding to the sample data; generating a second annotation data set based on the sample data, the corresponding annotation classification result, and the corresponding prediction classification result; and training the intermediate model based on the second annotation data set to obtain a classification model.
-
公开(公告)号:US11775776B2
公开(公告)日:2023-10-03
申请号:US17147881
申请日:2021-01-13
Inventor: Shuangjie Li , Miao Yu , Yabing Shi , Xuefeng Hao , Xunchao Song , Ye Jiang , Yang Zhang , Yong Zhu
IPC: G06F40/40 , G06N20/00 , G06F40/289
CPC classification number: G06F40/40 , G06F40/289 , G06N20/00
Abstract: A method and an apparatus for processing information are provided. The method can include: acquiring a word sequence obtained by performing word segmentation on two paragraphs in a text; inputting the word sequence into a to-be-trained natural language processing model to generate a word vector corresponding to a word in the word sequence; inputting the word vector into a preset processing layer of the to-be-trained natural language processing model; predicting whether the two paragraphs are adjacent, and a replaced word in the two paragraphs; and acquiring reference information of the two paragraphs, and training the to-be-trained natural language processing model to obtain a trained natural language processing model, based on the prediction result and the reference information.
-
公开(公告)号:US12175379B2
公开(公告)日:2024-12-24
申请号:US17119651
申请日:2020-12-11
Inventor: Hongjian Shi , Wenbin Jiang , Xinwei Feng , Miao Yu , Huanyu Zhou , Meng Tian , Xueqian Wu , Xunchao Song
Abstract: The present disclosure discloses a method, apparatus, device, and storage medium for training a model, relates to the technical fields of knowledge graph, natural language processing, and deep learning. The method may include: acquiring a first annotation data set, the first annotation data set including sample data and a annotation classification result corresponding to the sample data; training a preset initial classification model based on the first annotation data set to obtain an intermediate model; performing prediction on the sample data in the first annotation data set using the intermediate model to obtain a prediction classification result corresponding to the sample data; generating a second annotation data set based on the sample data, the corresponding annotation classification result, and the corresponding prediction classification result; and training the intermediate model based on the second annotation data set to obtain a classification model.
-
7.
公开(公告)号:US11366819B2
公开(公告)日:2022-06-21
申请号:US16812062
申请日:2020-03-06
Inventor: Songtai Dai , Xinwei Feng , Miao Yu , Huanyu Zhou , Xunchao Song , Pengcheng Yuan
IPC: G06F16/2457 , G06N20/00 , G09B7/02
Abstract: A method for obtaining an answer to a question is provided. The method may include: acquiring a question; determining at least a part of articles in a preset article database as candidate articles, and determining first scores of the candidate articles respectively, the first score of any of the candidate articles representing a matching degree between the candidate article and the question; determining at least a part of texts in each of the candidate articles as candidate texts, and determining second scores of the candidate texts respectively, the second score of any of the candidate texts representing a matching degree between the candidate text and the question; and determining at least a part of the candidate texts as the answer based on a score set of each of the candidate texts, the score set of any of the candidate texts including the second score and the first score.
-
8.
公开(公告)号:US11669690B2
公开(公告)日:2023-06-06
申请号:US17149226
申请日:2021-01-14
Inventor: Songtai Dai , Xinwei Feng , Miao Yu , Huanyu Zhou , Xunchao Song , Pengcheng Yuan
IPC: G06F17/00 , G06F40/30 , G06F16/35 , G06F40/295
CPC classification number: G06F40/30 , G06F16/35 , G06F40/295
Abstract: A method for processing a sematic description of a text entity is proposed. The method includes: acquiring a plurality of target texts containing a main entity, and extracting related entities describing the main entity from each target text; acquiring a sub-relation vector of a pair of the main entity and each related entity in each target text; calculating a similarity distance of the main entity between different target texts based on the sub-relation vector; and determining a semantic similarity of the main entity descripted in different target texts based on the similarity distance.
-
公开(公告)号:US20210209309A1
公开(公告)日:2021-07-08
申请号:US17212511
申请日:2021-03-25
Inventor: Meng Tian , Miao Yu , Wenbin Jiang , Xinwei Feng , Huanyu Zhou , Pengcheng Yuan , Xunchao Song , Xueqian Wu , Hongjian Shi
IPC: G06F40/30 , G06F40/205 , G06F40/242
Abstract: The disclosure discloses a semantics processing method, a semantics processing apparatus, an electronic device, and a medium, and relates to a field of knowledge graph technologies. The detailed implementation includes: determining a target semantic element rule matching a text to be parsed, and parsing the text to be parsed by employing the target semantic element rule to obtain a semantic element parsing result; generating a semantic tree based on the semantic element parsing result by employing a target structured rule associated with the target semantic element rule; and performing semantic understanding on the text to be parsed based on the semantic tree.
-
10.
公开(公告)号:US20200250248A1
公开(公告)日:2020-08-06
申请号:US16691017
申请日:2019-11-21
Inventor: Miao Yu , Xinwei Feng , Huanyu Zhou , Xunchao Song , Songtai Dai
IPC: G06F16/9536 , G06F16/9032 , G06F16/903 , G06F17/27
Abstract: Embodiments of the present disclosure provide a method, apparatus, computer device, and storage medium for verifying community question answer data. The method may include: acquiring a community question answer data set, and generating a plurality of question answer pairs based on the community question answer data set, a question answer pair including: a question, and a to-be-verified answer corresponding to the question; generating an authoritative data set based on data stored in at least one confidence source site; and performing an authority verification on the to-be-verified answer, based on a score of a similarity between the to-be-verified answer and authoritative data in the authoritative data set in at least one dimension.
-
-
-
-
-
-
-
-
-