-
公开(公告)号:US20220147822A1
公开(公告)日:2022-05-12
申请号:US17459066
申请日:2021-08-27
Inventor: Ying XIN , Yuan FENG , Guanzhong WANG , Pengcheng YUAN , Bin ZHANG , Xiaodi WANG , Xiang LONG , Yan PENG , Honghui ZHENG , Shumin HAN
IPC: G06N3/08 , G06N3/04 , G06V10/82 , G06V10/766 , G06V10/77
Abstract: Provided are a training method and apparatus for a target detection model, a device and a storage medium. The training method is described below. A feature map of a sample image is processed through a classification network of an initial model and a heat map and a classification prediction result of the feature map are obtained, a classification loss value is determined according to the classification prediction result and classification supervision data of the sample image, and a category probability of pixels in the feature map is determined according to the heat map of the feature map and a probability distribution map of the feature map is obtained; the feature map is processed through a regression network of the initial model and a regression prediction result is obtained, and a regression loss value is determined.
-
2.
公开(公告)号:US20210216783A1
公开(公告)日:2021-07-15
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang LONG , Dongliang HE , Fu LI , Xiang ZHAO , Tianwei LIN , Hao SUN , Shilei WEN , Errui DING
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
公开(公告)号:US20210334950A1
公开(公告)日:2021-10-28
申请号:US17174002
申请日:2021-02-11
Inventor: Xiang LONG , Ping WANG , Zhichao ZHOU , Fu LI , Dongliang HE , Hao SUN
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.
-
公开(公告)号:US20210312240A1
公开(公告)日:2021-10-07
申请号:US17348285
申请日:2021-06-15
Inventor: Xiaodi WANG , Shumin HAN , Yuan FENG , Ying XIN , Bin ZHANG , Shufei LIN , Pengcheng YUAN , Xiang LONG , Yan PENG , Honghui ZHENG
Abstract: A header model for instance segmentation includes a target box branch having a first branch and a second branch, where the first branch is configured to process an inputted first feature map to obtain class information and confidence of a target box, and the second branch is configured to process the first feature map to obtain location information of the target box. The header model also includes a mask branch configured to process an inputted second feature map to obtain mask information, wherein the second feature map is a feature map outputted by an ROI extraction module, and the first feature map is a feature map resulting from a pooling performed on the second feature map.
-
公开(公告)号:US20210350173A1
公开(公告)日:2021-11-11
申请号:US17379428
申请日:2021-07-19
Inventor: Xiang LONG , Yan PENG , Shufei LIN , Ying XIN , Bin ZHANG , Pengcheng YUAN , Xiaodi WANG , Yuan FENG , Shumin HAN
Abstract: Provided are a method and apparatus for evaluating image relative definition, a device and a medium, relating to technologies such as computer vision, deep learning and intelligent medical. A specific implementation solution is: extracting a multi-scale feature of each image in an image set, where the multi-scale feature is used for representing definition features of objects having different sizes in an image; and scoring relative definition of each image in the image set according to the multi-scale feature by using a relative definition scoring model pre-trained, where the purpose for training the relative definition scoring model is to learn a feature related to image definition in the multi-scale feature.
-
公开(公告)号:US20210279934A1
公开(公告)日:2021-09-09
申请号:US17182604
申请日:2021-02-23
Inventor: Xiang LONG , Xin Li , Henan Zhang , Hao Sun
Abstract: A method and apparatus for generating a virtual avatar are provided. The method may include: acquiring a first avatar, and determining an expression parameter of the first avatar, where the expression parameter of the first avatar including an expression parameter of at least one of five sense organs; and determining, based on the expression parameter of at least one of the five sense organs, a target virtual avatar that is associated with an attribute of the first avatar and has an expression of the first avatar.
-
7.
公开(公告)号:US20220020175A1
公开(公告)日:2022-01-20
申请号:US17489991
申请日:2021-09-30
Inventor: Xiaodi WANG , Shumin HAN , Yuan FENG , Ying XIN , Bin ZHANG , Xiang LONG , Honghui ZHENG , Yan PENG , Zhuang JIA
Abstract: An object detection model training method, object detection method and related apparatus, relate to the field of artificial intelligence technologies such as computer vision, deep learning. An implementation includes: obtaining training sample data including a first remote sensing image and position annotation information of an anchor box of a subject to be detected in the first remote sensing image, where the position annotation information includes angle information of the anchor box relative to a preset direction; obtaining an object feature map of the first remote sensing image based on an object detection model, performing object detection on the subject to be detected based on the object feature map to obtain an object bounding box, and determining loss information between the anchor box and the object bounding box based on the angle information; updating a parameter of the object detection model based on the loss information.
-
-
-
-
-
-