-
1.
公开(公告)号:US20210216782A1
公开(公告)日:2021-07-15
申请号:US17144205
申请日:2021-01-08
Inventor: Tianwei LIN , Xin LI , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Errui DING
Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.
-
公开(公告)号:US20220207299A1
公开(公告)日:2022-06-30
申请号:US17460646
申请日:2021-08-30
Inventor: Chao LI , Dongliang HE , Wenling GAO , Fu LI , Hao SUN
Abstract: A method for building an image enhancement model includes obtaining training data; building a neural network model consisting of a feature extraction module, at least one channel dilated convolution module and a spatial upsampling module, where each channel dilated convolution module includes a spatial downsampling submodule, a channel dilation submodule and a spatial upsampling submodule; training the neural network model by using the video frames and the standard images corresponding to the video frames until the neural network model converges, to obtain an image enhancement model. In addition, a method for image enhancement includes obtaining a video frame to be processed; taking the video frame to be processed as an input of an image enhancement model, and taking an output result of the image enhancement model as an image enhancement result of the video frame to be processed.
-
公开(公告)号:US20210383120A1
公开(公告)日:2021-12-09
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao ZHOU , Dongliang HE , Fu LI , Hao SUN
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US20210360252A1
公开(公告)日:2021-11-18
申请号:US17125370
申请日:2020-12-17
Inventor: Chao LI , Yukang DING , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING
IPC: H04N19/132 , H04N19/172 , G06N3/04 , G06K9/62
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
公开(公告)号:US20210304413A1
公开(公告)日:2021-09-30
申请号:US17344917
申请日:2021-06-10
Inventor: Hao SUN , Fu LI , Tianwei LIN , Dongliang HE
Abstract: An image processing method, an image processing device and an electronic device, all relate to computer vision and deep learning. The image processing method includes: acquiring a first image and a second image; performing semantic region segmentation on the first image and the second image to acquire a first segmentation image and a second segmentation image respectively; determining an association matrix between the first segmentation image and the second segmentation image; and processing the first image in accordance with the association matrix to acquire a target image.
-
公开(公告)号:US20210374415A1
公开(公告)日:2021-12-02
申请号:US17194131
申请日:2021-03-05
Inventor: Dongliang HE , Xiao TAN , Shilei WEN , Hao SUN
Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying a video. A specific embodiment of the method includes: acquiring a predetermined number of video frames from a video to be identified to obtain a video frame sequence; performing the following processing step: importing the video frame sequence into a pre-trained video identification model to obtain a classification tag probability corresponding to the video frame sequence, wherein the classification tag probability is used to characterize a probability of identifying a corresponding tag category of the video to be identified; and setting, in response to the classification tag probability being greater than or equal to a preset identification accuracy threshold, a video tag for the video to be identified according to the classification tag probability, or else increasing the number of video frames in the video frame sequence and continuing to perform the above processing step.
-
7.
公开(公告)号:US20210216783A1
公开(公告)日:2021-07-15
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang LONG , Dongliang HE , Fu LI , Xiang ZHAO , Tianwei LIN , Hao SUN , Shilei WEN , Errui DING
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
8.
公开(公告)号:US20200372609A1
公开(公告)日:2020-11-26
申请号:US16810986
申请日:2020-03-06
Inventor: Chao LI , Dongliang HE , Xiao LIU , Yukang DING , Shilei WEN , Errui DING , Henan ZHANG , Hao SUN
IPC: G06T3/40
Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.
-
公开(公告)号:US20210334950A1
公开(公告)日:2021-10-28
申请号:US17174002
申请日:2021-02-11
Inventor: Xiang LONG , Ping WANG , Zhichao ZHOU , Fu LI , Dongliang HE , Hao SUN
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.
-
公开(公告)号:US20210227302A1
公开(公告)日:2021-07-22
申请号:US17026488
申请日:2020-09-21
Inventor: Fu LI , Dongliang HE , Hao SUN
IPC: H04N21/81 , H04N21/8549 , H04N21/658 , H04N21/234
Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.
-
-
-
-
-
-
-
-
-