Patent search ap:("BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO. Page LTD.") AND inv:"Zhen Li"

1.

发明授权
Method, apparatus, electronic device and storage medium for training semantic similarity model 有权

公开(公告)号：US12118063B2

公开(公告)日：2024-10-15

申请号：US17209051

申请日：2021-03-22

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhen Li , Yukun Li , Yu Sun

IPC: G06F40/30 , G06F18/214 , G06F18/2413

CPC classification number: G06F18/2148 , G06F18/24147 , G06F40/30

Abstract: The present disclosure provides a method, apparatus, electronic device and storage medium for training a semantic similarity model, which relates to the field of artificial intelligence. A specific implementation solution is as follows: obtaining a target field to be used by a semantic similarity model to be trained; calculating respective correlations between the target field and application fields corresponding to each of training datasets in known multiple training datasets; training the semantic similarity model with the training datasets in turn, according to the respective correlations between the target field and the application fields corresponding to each of the training datasets. According to the technical solution of the present disclosure, it is possible to, in the fine-tuning phase, more purposefully train the semantic similarity model with the training datasets with reference to the correlations between the target field and the application fields corresponding to the training datasets, thereby effectively improving the learning capability of the sematic similarity model and effectively improving the accuracy of the trained semantic similarity model.

2.

发明申请
METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM FOR TRAINING SEMANTIC SIMILARITY MODEL 有权

公开(公告)号：US20230004753A9

公开(公告)日：2023-01-05

申请号：US17209051

申请日：2021-03-22

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO.,LTD.

Inventor： Zhen Li , Yukun Li , Yu Sun

IPC: G06K9/62 , G06F40/30

Abstract: The present disclosure provides a method, apparatus, electronic device and storage medium for training a semantic similarity model, which relates to the field of artificial intelligence. A specific implementation solution is as follows: obtaining a target field to be used by a semantic similarity model to be trained; calculating respective correlations between the target field and application fields corresponding to each of training datasets in known multiple training datasets; training the semantic similarity model with the training datasets in turn, according to the respective correlations between the target field and the application fields corresponding to each of the training datasets. According to the technical solution of the present disclosure, it is possible to, in the fine-tuning phase, more purposefully train the semantic similarity model with the training datasets with reference to the correlations between the target field and the application fields corresponding to the training datasets, thereby effectively improving the learning capability of the sematic similarity model and effectively improving the accuracy of the trained semantic similarity model.

3.

发明授权
Method and apparatus for obtaining word vectors based on language model, device and storage medium 有权

公开(公告)号：US11526668B2

公开(公告)日：2022-12-13

申请号：US17095955

申请日：2020-11-12

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhen Li , Yukun Li , Yu Sun

IPC: G06F40/279 , G06F16/9032 , G06N20/00 , G06F40/205 , G06K9/62

Abstract: A method and apparatus for obtaining word vectors based on a language model, a device and a storage medium are disclosed, which relates to the field of natural language processing technologies in artificial intelligence. An implementation includes inputting each of at least two first sample text language materials into the language model, and outputting a context vector of a first word mask in each first sample text language material via the language model; determining the word vector corresponding to each first word mask based on a first word vector parameter matrix, a second word vector parameter matrix and a fully connected matrix respectively; and training the language model and the fully connected matrix based on the word vectors corresponding to the first word masks in the at least two first sample text language materials, so as to obtain the word vectors.

4.

发明授权
Method for training language model based on various word vectors, device and medium 有权

公开(公告)号：US11556715B2

公开(公告)日：2023-01-17

申请号：US16951702

申请日：2020-11-18

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhen Li , Yukun Li , Yu Sun

IPC: G06F40/30 , G06N20/00 , G06F40/279

Abstract: A method for training a language model based on various word vectors, a device and a medium, which relate to the field of natural language processing technologies in artificial intelligence, are disclosed. An implementation includes inputting a first sample text language material including a first word mask into the language model, and outputting a context vector of the first word mask via the language model; acquiring a first probability distribution matrix of the first word mask based on the context vector of the first word mask and a first word vector parameter matrix, and a second probability distribution matrix of the first word mask based on the context vector of the first word mask and a second word vector parameter matrix; and training the language model based on a word vector corresponding to the first word mask.

5.

发明申请
METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM FOR TRAINING SEMANTIC SIMILARITY MODEL 有权

公开(公告)号：US20220300763A1

公开(公告)日：2022-09-22

申请号：US17209051

申请日：2021-03-22

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO.,LTD.

Inventor： Zhen Li , Yukun Li , Yu Sun

IPC: G06K9/62 , G06F40/30

Abstract: The present disclosure provides a method, apparatus, electronic device and storage medium for training a semantic similarity model, which relates to the field of artificial intelligence. A specific implementation solution is as follows: obtaining a target field to be used by a semantic similarity model to be trained; calculating respective correlations between the target field and application fields corresponding to each of training datasets in known multiple training datasets; training the semantic similarity model with the training datasets in turn, according to the respective correlations between the target field and the application fields corresponding to each of the training datasets. According to the technical solution of the present disclosure, it is possible to, in the fine-tuning phase, more purposefully train the semantic similarity model with the training datasets with reference to the correlations between the target field and the application fields corresponding to the training datasets, thereby effectively improving the learning capability of the sematic similarity model and effectively improving the accuracy of the trained semantic similarity model.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification