-
公开(公告)号:US11928432B2
公开(公告)日:2024-03-12
申请号:US17319189
申请日:2021-05-13
Inventor: Fei Yu , Jiji Tang , Weichong Yin , Yu Sun , Hao Tian , Hua Wu , Haifeng Wang
CPC classification number: G06F40/284 , G06F40/30 , G06N5/04 , G06N20/00 , G06V10/811 , G06V20/30
Abstract: A multi-modal pre-training model acquisition method, an electronic device and a storage medium, which relate to the fields of deep learning and natural language processing, are disclosed. The method may include: determining, for each image-text pair as training data, to-be-processed fine-grained semantic word in the text; masking the to-be-processed fine-grained semantic words; and training the multi-modal pre-training model using the training data with the fine-grained semantic words masked.