-
公开(公告)号:US20230115737A1
公开(公告)日:2023-04-13
申请号:US18080432
申请日:2022-12-13
Inventor: Shuai CHEN , Qi WANG , Zhifan FENG , Chunguang CHAI , Yong ZHU
IPC: G06F16/483 , G06F16/43 , G06F18/25 , G06F18/22 , G06N5/02
Abstract: A method of processing multimedia data, a device, and a medium, which relates to a field of an artificial intelligence technology, in particular to fields of knowledge graph and deep learning. The method of processing the multimedia data includes: recognizing the multimedia data so as to obtain at least one key information of the multimedia data; querying a predetermined knowledge base according to the at least one key information, so as to determine a multimedia name associated with the at least one key information and an association degree between the multimedia name and the at least one key information; and determining, in the multimedia name, a name of the multimedia data based on a similarity between alternative multimedia data for the multimedia name and the multimedia data, in response to the association degree being less than a first threshold value.
-
公开(公告)号:US20220284218A1
公开(公告)日:2022-09-08
申请号:US17502173
申请日:2021-10-15
Inventor: Hu YANG , Feng HE , Qi WANG , Zhifan FENG , Chunguang CHAI , Yong ZHU
Abstract: The present disclosure discloses a video classification method, an electronic device and a storage medium, and relates to the field of computer technologies, and particularly to the field of artificial intelligence technologies, such as knowledge graph technologies, computer vision technologies, deep learning technologies, or the like. The video classification method includes: extracting a keyword in a video according to multi-modal information of the video; acquiring background knowledge corresponding to the keyword, and determining a text to be recognized according to the keyword and the background knowledge; and classifying the text to be recognized to obtain a class of the video.
-
公开(公告)号:US20230010160A1
公开(公告)日:2023-01-12
申请号:US17945415
申请日:2022-09-15
Inventor: Shuai CHEN , Qi WANG , Hu YANG , Feng HE , Zhifan FENG , Chunguang CHAI , Yong ZHU
Abstract: Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.
-
4.
公开(公告)号:US20230153337A1
公开(公告)日:2023-05-18
申请号:US18157452
申请日:2023-01-20
Inventor: Wenbin JIANG , Yajuan LV , Chunguang CHAI , Yong ZHU
IPC: G06F16/332 , G06F40/30
CPC classification number: G06F16/3329 , G06F40/30
Abstract: A question answering method, a method of training a question answering model, a device, and a medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of natural language processing technology, deep learning technology, and knowledge mapping technology. The question answering method includes: obtaining data to be processed, wherein the data to be processed includes a question and candidate answers; performing general semantic understanding on the data to be processed to obtain a general data feature; selecting a target question answering mode from candidate question answering modes based on the general data feature; and processing the general data feature by using the target question answering mode, to obtain a target answer for the question from the candidate answers.
-
公开(公告)号:US20220350965A1
公开(公告)日:2022-11-03
申请号:US17864636
申请日:2022-07-14
Inventor: Tongyang LIU , Shu WANG , Wanli CHANG , Wei ZHENG , Zhifan FENG , Chunguang CHAI , Yong ZHU
IPC: G06F40/211 , G06F40/30 , G06F40/109 , G06N3/08
Abstract: A method for generating a pre-trained language model, includes: obtaining sample files; obtaining typography structure information and text information of the sample files by parsing the sample files; obtaining a plurality of task models of a pre-trained language model; obtaining a trained pre-trained language model by jointly training the pre-trained language model and the plurality of task models according to the typography structure information and the text information; and generating a target pre-trained language model by fine-tuning the trained pre-trained language model according to the typography structure information and the text information.
-
6.
公开(公告)号:US20210397980A1
公开(公告)日:2021-12-23
申请号:US17036160
申请日:2020-09-29
IPC: G06N5/02 , G06N5/04 , G06F40/279 , G06K9/62
Abstract: The present disclosure provides an information recommendation method, which relates to a field of knowledge graph. The method includes: acquiring request information; extracting a request entity word representing an entity from the request information; determining recommendation information based on the request entity word and a pre-constructed knowledge graph; and pushing the recommendation information, wherein the knowledge graph is constructed based on a text, and the knowledge graph indicates a first word representing a source of the text. The present disclosure further provides an information recommendation apparatus, an electronic device and a computer-readable storage medium.
-
公开(公告)号:US20230016403A1
公开(公告)日:2023-01-19
申请号:US17934876
申请日:2022-09-23
Inventor: Zhaoji WANG , Fang HUANG , Ye JIANG , Yabing SHI , Chunguang CHAI , Yong ZHU
IPC: G06F16/9537 , G06F40/226 , G06F40/30
Abstract: The present disclosure provides a method of processing triple data, a method of training a triple data processing model, an electronic device, and a storage medium. A specific implementation solution includes: performing a triple data extraction on text data to obtain a plurality of field data; normalizing the plurality of field data to determine target triple data, wherein the target triple data contains entity data, entity relationship data, and association entity data; and verifying a confidence level of the target triple data to obtain a verification result.
-
8.
公开(公告)号:US20230013796A1
公开(公告)日:2023-01-19
申请号:US17866104
申请日:2022-07-15
Inventor: Wenbin JIANG , Zhifan FENG , Xinwei FENG , Yajuan LYU , Yong ZHU
Abstract: The present disclosure provides a method and apparatus for acquiring a pre-trained model, an electronic device and a storage medium, and relates to the fields such as deep learning, natural language processing, knowledge graph and intelligent voice. The method may include: acquiring a pre-training task set composed of M pre-training tasks, M being a positive integer greater than 1, the pre-training tasks including: N question-answering tasks corresponding to different question-answering forms, N being a positive integer greater than 1 and less than or equal to M; and jointly pre-training the pre-trained model according to the M pre-training tasks.
-
公开(公告)号:US20230008897A1
公开(公告)日:2023-01-12
申请号:US17932598
申请日:2022-09-15
Inventor: Wenbin JIANG , Yajuan LYU , Yong ZHU , Hua WU , Haifeng WANG
IPC: G06F16/735
Abstract: An information search method includes: obtaining search words at least including a question to be searched and obtaining an initial text vector representation of the search words; obtaining a video corresponding to the search words, and obtaining multi-modality vector representations of the video; starting from the initial text vector representation, performing N rounds of interaction between the video and the search words based on the multi-modality vector representations and a text vector representation of the search words of a current round, to generate a target fusion vector representation, where N is an integer greater than or equal to 1; and obtaining target video frames matching the question to be searched by annotating the video based on the target fusion vector representation.
-
10.
公开(公告)号:US20230092736A1
公开(公告)日:2023-03-23
申请号:US17872318
申请日:2022-07-25
Inventor: Wenbin JIANG , Yajuan LYU , Yong ZHU , Hua WU , Haifeng WANG
Abstract: The present disclosure provides a method for processing intelligent question-answering, an intelligent question-answering system, an electronic device and a storage medium, and relates to the field of artificial intelligence technologies, such as machine learning technologies, natural language processing technologies, or the like. An implementation includes: acquiring an input question and input data information; and based on the question, the data information and a plurality of knowledge bases, deciding an answer to the question by multilayer appreciation using a plurality of understanding module layers.
-
-
-
-
-
-
-
-
-