-
公开(公告)号:US20220019744A1
公开(公告)日:2022-01-20
申请号:US17319189
申请日:2021-05-13
Inventor: Fei YU , Jiji TANG , Weichong YIN , Yu SUN , Hao TIAN , Hua WU , Haifeng WANG
Abstract: A multi-modal pre-training model acquisition method, an electronic device and a storage medium, which relate to the fields of deep learning and natural language processing, are disclosed. The method may include: determining, for each image-text pair as training data, to-be-processed fine-grained semantic word in the text; masking the to-be-processed fine-grained semantic words; and training the multi-modal pre-training model using the training data with the fine-grained semantic words masked.