-
公开(公告)号:US11657612B2
公开(公告)日:2023-05-23
申请号:US17194131
申请日:2021-03-05
Inventor: Dongliang He , Xiao Tan , Shilei Wen , Hao Sun
IPC: G06V20/40 , G06F18/2411
CPC classification number: G06V20/40 , G06F18/2411 , G06V20/41 , G06V20/46 , G06V20/48
Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying a video. A specific embodiment of the method includes: acquiring a predetermined number of video frames from a video to be identified to obtain a video frame sequence; performing the following processing step: importing the video frame sequence into a pre-trained video identification model to obtain a classification tag probability corresponding to the video frame sequence, wherein the classification tag probability is used to characterize a probability of identifying a corresponding tag category of the video to be identified; and setting, in response to the classification tag probability being greater than or equal to a preset identification accuracy threshold, a video tag for the video to be identified according to the classification tag probability, or else increasing the number of video frames in the video frame sequence and continuing to perform the above processing step.
-
公开(公告)号:US11625433B2
公开(公告)日:2023-04-11
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/30 , G06F16/783 , G06V20/40 , G06F18/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
13.
公开(公告)号:US11600069B2
公开(公告)日:2023-03-07
申请号:US17144205
申请日:2021-01-08
Inventor: Tianwei Lin , Xin Li , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.
-
公开(公告)号:US11410422B2
公开(公告)日:2022-08-09
申请号:US16905482
申请日:2020-06-18
Inventor: Dongliang He , Xiang Zhao , Jizhou Huang , Fu Li , Xiao Liu , Shilei Wen
Abstract: A method and an apparatus for grounding a target video clip in a video are provided. The method includes: determining a current video clip in the video based on a current position; acquiring descriptive information indicative of a pre-generated target video clip descriptive feature, and executing a target video clip determining step which includes: determining current state information of the current video clip, wherein the current state information includes information indicative of a feature of the current video clip; generating a current action policy based on the descriptive information and the current state information, the current action policy being indicative of a position change of the current video clip in the video; the method further comprises: in response to reaching a preset condition, using a video clip resulting from executing the current action policy on the current video clip as the target video clip.
-
公开(公告)号:US20210019531A1
公开(公告)日:2021-01-21
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
16.
公开(公告)号:US10861133B1
公开(公告)日:2020-12-08
申请号:US16810986
申请日:2020-03-06
Inventor: Chao Li , Dongliang He , Xiao Liu , Yukang Ding , Shilei Wen , Errui Ding , Henan Zhang , Hao Sun
IPC: G06T3/40
Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.
-
17.
公开(公告)号:US11763552B2
公开(公告)日:2023-09-19
申请号:US17116597
申请日:2020-12-09
Inventor: Shufei Lin , Jianfeng Zhu , Pengcheng Yuan , Bin Zhang , Shumin Han , Yingbo Xu , Yuan Feng , Ying Xin , Xiaodi Wang , Jingwei Liu , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06V10/82 , G06N3/088 , G06T7/00 , G06F18/214 , G06N3/045 , G06V10/776 , G06V20/60
CPC classification number: G06V10/82 , G06F18/2148 , G06N3/045 , G06N3/088 , G06T7/0004 , G06V10/776 , G06V20/60 , G06T2207/20081 , G06T2207/20084 , G06T2207/30124
Abstract: A method for detecting a surface defect, a method for training model, an apparatus, a device, and a medium, are provided. The method includes: inputting a surface image of the article for detection into a defect detection model to perform a defect detection, and acquiring a defect detection result output by the defect detection model; inputting a surface image of a defective article determined to be defective into an image discrimination model based on the defect detection result to determine whether the surface image of the defective article is defective, wherein the image discrimination model is a trained generative adversarial networks model, and the generative adversarial networks model is obtained by training using a surface image of a defect-free good article; and adjusting the defect detection result of the surface image of the defective article according to a determination result of the image discrimination model.
-
公开(公告)号:US11610389B2
公开(公告)日:2023-03-21
申请号:US17201665
申请日:2021-03-15
Inventor: Jian Wang , Zipeng Lu , Hao Sun , Hongwu Zhang , Shilei Wen , Errui Ding
IPC: G06T7/73 , G06V10/46 , G06N3/04 , G06F18/213 , G06V10/764 , G06V10/82 , G06V40/20
Abstract: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.
-
公开(公告)号:US11514263B2
公开(公告)日:2022-11-29
申请号:US16869024
申请日:2020-05-07
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
IPC: G06K9/62 , G06N3/04 , G06N3/08 , G06V10/44 , G06V10/80 , G06V10/82 , G06V40/16 , G06V10/26 , G06V10/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for processing an image. A specific embodiment of the method includes: acquiring a feature map of a target image, where the target image contains a target object; determining a local feature map of a target size in the feature map; combining features of different channels in the local feature map to obtain a local texture feature map; and obtaining location information of the target object based on the local texture feature map.
-
公开(公告)号:US20220270373A1
公开(公告)日:2022-08-25
申请号:US17743410
申请日:2022-05-12
Inventor: Xipeng Yang , Minyue Jiang , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06V20/52 , G06V10/40 , G06V10/764 , G06V10/774 , G06V10/22
Abstract: A method, an electronic device and a storage medium are provided. The method may include: acquiring a to-be-inspected image; inputting the to-be-inspected image into a pre-established vehicle detection model to obtain a vehicle detection result, where the vehicle detection result includes category information, coordinate information, coordinate reliabilities, and coordinate error information of detection boxes, and the vehicle detection model is configured for characterizing a corresponding relationship between images and vehicle detection results; selecting, based on the coordinate reliabilities of the detection boxes, a detection box from the vehicle detection result for use as a to-be-processed detection box; and generating, based on coordinate information and coordinate error information of the to-be-processed detection box, coordinate information of a processed detection box.
-
-
-
-
-
-
-
-
-