Method and apparatus for training semantic representation model, device and computer storage medium

    公开(公告)号:US11914964B2

    公开(公告)日:2024-02-27

    申请号:US17209124

    申请日:2021-03-22

    CPC classification number: G06F40/30 G06N5/04 G06N20/00

    Abstract: The present application discloses a method and apparatus for training a semantic representation model, a device and a computer storage medium, which relates to the field of natural language processing technologies in artificial intelligence. An implementation includes: acquiring a semantic representation model which has been trained for a first language as a first semantic representation model; taking a bottom layer and a top layer of the first semantic representation model as trained layers, initializing the trained layers, keeping model parameters of other layers unchanged, and training the trained layers using training language materials of a second language until a training ending condition is met; successively bringing the untrained layers into the trained layers from bottom to top, and executing these layers respectively: keeping the model parameters of other layers than the trained layers unchanged, and training the trained layers using the training language materials of the second language until the training ending condition is met respectively; and obtaining a semantic representation model for the second language after all the layers are trained.

    Method, apparatus, electronic device and readable storage medium for translation

    公开(公告)号:US11574135B2

    公开(公告)日:2023-02-07

    申请号:US16861750

    申请日:2020-04-29

    Abstract: The present disclosure provides a method, apparatus, electronic device and readable storage medium for translation and relates to translation technologies. In the embodiments of the present disclosure, the at least one knowledge element is obtained according to associated information of content to be translated, and respective knowledge element in the at least one knowledge element comprise an element of the first language type and an element of the second language type so that the at least one knowledge element can be used to obtain a translation result of the content to be translated. Since the at least one knowledge element obtained in advance is taken as global information of the translation task of this time, it can be ensured that the translation result of the same content to be translated is consistent, thereby improving the quality of the translation result.

    Method and apparatus for generating dialogue model

    公开(公告)号:US11537798B2

    公开(公告)日:2022-12-27

    申请号:US16895297

    申请日:2020-06-08

    Abstract: Embodiments of the present disclosure relate to a method and apparatus for generating a dialogue model. The method may include: acquiring a corpus sample set, a corpus sample including input information and target response information; classifying corpus samples in the corpus sample set, setting discrete hidden variables for the corpus samples based on a classification result to generate a training sample set, a training sample including the input information, the target response information, and a discrete hidden variable; and training a preset neural network using the training sample set to obtain the dialogue model, the dialogue model being used to represent a corresponding relationship between inputted input information and outputted target response information.

    METHOD AND APPARATUS FOR TRAINING SEMANTIC REPRESENTATION MODEL, DEVICE AND COMPUTER STORAGE MEDIUM

    公开(公告)号:US20220004716A1

    公开(公告)日:2022-01-06

    申请号:US17209124

    申请日:2021-03-22

    Abstract: The present application discloses a method and apparatus for training a semantic representation model, a device and a computer storage medium, which relates to the field of natural language processing technologies in artificial intelligence. An implementation includes: acquiring a semantic representation model which has been trained for a first language as a first semantic representation model; taking a bottom layer and a top layer of the first semantic representation model as trained layers, initializing the trained layers, keeping model parameters of other layers unchanged, and training the trained layers using training language materials of a second language until a training ending condition is met; successively bringing the untrained layers into the trained layers from bottom to top, and executing these layers respectively: keeping the model parameters of other layers than the trained layers unchanged, and training the trained layers using the training language materials of the second language until the training ending condition is met respectively; and obtaining a semantic representation model for the second language after all the layers are trained.

    MACHINE TRANSLATION MODEL TRAINING METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210200963A1

    公开(公告)日:2021-07-01

    申请号:US17200588

    申请日:2021-03-12

    Abstract: The present disclosure provides a machine translation model training method, apparatus, electronic device and storage medium, which relates to the technical field of natural language processing. A specific implementation solution is as follows: selecting, from parallel corpuses, a set of samples whose translation quality satisfies a preset requirement and which have universal-field features and/or target-field features, to constitute a first training sample set; selecting, from the parallel corpuses, a set of samples whose translation quality satisfies a preset requirement and which do not have universal-field features and target-field features, to constitute a second training sample set; training an encoder in the machine translation model in the target field, a discriminator configured in encoding layers of the encoder, and the encoder and a decoder in the machine translation model in the target field in turn with the first training sample set and second training sample set, respectively. The training method according to the present disclosure is time-saving and effort-saving, and may effectively improve the training efficiency of the machine translation model in the target field.

    METHOD, APPARATUS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM FOR TRANSLATION

    公开(公告)号:US20210192151A1

    公开(公告)日:2021-06-24

    申请号:US16861750

    申请日:2020-04-29

    Abstract: The present disclosure provides a method, apparatus, electronic device and readable storage medium for translation and relates to translation technologies. In the embodiments of the present disclosure, the at least one knowledge element is obtained according to associated information of content to be translated, and respective knowledge element in the at least one knowledge element comprise an element of the first language type and an element of the second language type so that the at least one knowledge element can be used to obtain a translation result of the content to be translated. Since the at least one knowledge element obtained in advance is taken as global information of the translation task of this time, it can be ensured that the translation result of the same content to be translated is consistent, thereby improving the quality of the translation result.

Patent Agency Ranking