-
公开(公告)号:US20210365738A1
公开(公告)日:2021-11-25
申请号:US17444427
申请日:2021-08-04
Inventor: Zhuang Jia , Xiang Long , Honghui Zheng , Yan Peng , Yuan Feng , Bin Zhang , Xiaodi Wang , Pengcheng Yuan , Ying Xin , Shumin Han
Abstract: The present disclosure discloses a method and apparatus for training a model, a method and apparatus for predicting a mineral, a device and a storage medium, and relates to the fields of computer vision and deep learning technologies. An implementation of the method may include: acquiring a target hyperspectral image of a target area, the target hyperspectral image including at least one pixel point annotated with a mineral category; determining a mask image corresponding to the target hyperspectral image; determining a sample hyperspectral image according to the target hyperspectral image and the mask image; determining an annotation vector of each pixel point according to the at least one pixel point annotated with the mineral category; and training a model according to the sample hyperspectral image and the annotation vector of the each pixel point.
-
公开(公告)号:US20210319062A1
公开(公告)日:2021-10-14
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/783 , G06K9/00 , G06K9/62
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
公开(公告)号:US11256920B2
公开(公告)日:2022-02-22
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
公开(公告)号:US20210383520A1
公开(公告)日:2021-12-09
申请号:US17412435
申请日:2021-08-26
Inventor: Mingyuan Mao , Yuan Feng , Ying Xin , Pengcheng Yuan , Bin Zhang , Xiaodi Wang , Xiang Long , Yan Peng , Honghui Zheng , Shumin Han
Abstract: The present disclosure discloses a method and apparatus for generating an image, a device, a storage medium and a program product, relates to the field of artificial intelligence, and particularly to computer vision and deep learning technologies, and may be applied in smart cloud and power grid inspection scenarios. A particular implementation of the method comprises: acquiring an original insulator image; performing an image transformation on the original insulator image to obtain a composite insulator image; and inputting the original insulator image and the composite insulator image into a pre-trained generative adversarial network to generate a target insulator image. According to the implementation, the image transformation is performed on the original insulator image, and then, massive target insulator images are generated through the generative adversarial network.
-
公开(公告)号:US20200374526A1
公开(公告)日:2020-11-26
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US11625433B2
公开(公告)日:2023-04-11
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/30 , G06F16/783 , G06V20/40 , G06F18/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
公开(公告)号:US11455765B2
公开(公告)日:2022-09-27
申请号:US17182604
申请日:2021-02-23
Inventor: Xiang Long , Xin Li , Henan Zhang , Hao Sun
Abstract: A method and apparatus for generating a virtual avatar are provided. The method may include: acquiring a first avatar, and determining an expression parameter of the first avatar, where the expression parameter of the first avatar including an expression parameter of at least one of five sense organs; and determining, based on the expression parameter of at least one of the five sense organs, a target virtual avatar that is associated with an attribute of the first avatar and has an expression of the first avatar.
-
公开(公告)号:US11259029B2
公开(公告)日:2022-02-22
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US20210390728A1
公开(公告)日:2021-12-16
申请号:US17412574
申请日:2021-08-26
Inventor: Yan PENG , Xiang Long , Shumin Han , Honghui Zheng , Zhuang Jia , Xiaodi Wang , Pengcheng Yuan , Yuan Feng , Bin Zhang , Ying Xin
Abstract: An object area measurement method and an apparatus are provided, relating to the computer vision and deep learning technology. The method includes acquiring an original image with a spatial resolution, the original image including a target object; acquiring an object identification model including at least two sets of classification models; generating one or more original image blocks based on the original image; performing operations on each original image block: scaling each original image block at at least two scaling levels to obtain scaled image blocks with at least two sizes, the scaled image blocks respectively corresponding to the at least two sets of classification models, and inputting the scaled image blocks into the object identification model to obtain an identification result of the target object; and determining an area of the target object based on the respective identification results of the one or more original image blocks and the spatial resolution.
-
公开(公告)号:US20210019531A1
公开(公告)日:2021-01-21
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
-
-
-
-
-
-
-
-