Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Dongliang HE"

1.

发明申请
METHOD AND APPARATUS FOR DETECTING TEMPORAL ACTION OF VIDEO, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210216782A1

公开(公告)日：2021-07-15

申请号：US17144205

申请日：2021-01-08

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Tianwei LIN , Xin LI , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Errui DING

IPC: G06K9/00 , G06K9/62

Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.

2.

发明申请
METHOD AND APPARATUS FOR BUILDING IMAGE ENHANCEMENT MODEL AND FOR IMAGE ENHANCEMENT 有权

公开(公告)号：US20220207299A1

公开(公告)日：2022-06-30

申请号：US17460646

申请日：2021-08-30

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chao LI , Dongliang HE , Wenling GAO , Fu LI , Hao SUN

IPC: G06K9/62 , G06T5/30 , G06N3/08

Abstract: A method for building an image enhancement model includes obtaining training data; building a neural network model consisting of a feature extraction module, at least one channel dilated convolution module and a spatial upsampling module, where each channel dilated convolution module includes a spatial downsampling submodule, a channel dilation submodule and a spatial upsampling submodule; training the neural network model by using the video frames and the standard images corresponding to the video frames until the neural network model converges, to obtain an image enhancement model. In addition, a method for image enhancement includes obtaining a video frame to be processed; taking the video frame to be processed as an input of an image enhancement model, and taking an output result of the image enhancement model as an image enhancement result of the video frame to be processed.

3.

发明申请
Method and Apparatus for Detecting Region of Interest in Video, Device and Medium 有权

公开(公告)号：US20210383120A1

公开(公告)日：2021-12-09

申请号：US17116578

申请日：2020-12-09

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhichao ZHOU , Dongliang HE , Fu LI , Hao SUN

IPC: G06K9/00 , G06K9/32 , G06K9/62 , G06T7/246

Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.

4.

发明申请
METHOD FOR VIDEO FRAME INTERPOLATION, RELATED ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210360252A1

公开(公告)日：2021-11-18

申请号：US17125370

申请日：2020-12-17

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chao LI , Yukang DING , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING

IPC: H04N19/132 , H04N19/172 , G06N3/04 , G06K9/62

Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.

5.

发明申请
Image Processing Method and Device, and Electronic Device 有权

公开(公告)号：US20210304413A1

公开(公告)日：2021-09-30

申请号：US17344917

申请日：2021-06-10

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Hao SUN , Fu LI , Tianwei LIN , Dongliang HE

IPC: G06T7/11 , G06K9/46 , G06K9/62

Abstract: An image processing method, an image processing device and an electronic device, all relate to computer vision and deep learning. The image processing method includes: acquiring a first image and a second image; performing semantic region segmentation on the first image and the second image to acquire a first segmentation image and a second segmentation image respectively; determining an association matrix between the first segmentation image and the second segmentation image; and processing the first image in accordance with the association matrix to acquire a target image.

6.

发明申请
METHOD AND APPARATUS FOR IDENTIFYING VIDEO 有权

公开(公告)号：US20210374415A1

公开(公告)日：2021-12-02

申请号：US17194131

申请日：2021-03-05

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Dongliang HE , Xiao TAN , Shilei WEN , Hao SUN

IPC: G06K9/00 , G06K9/62

Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying a video. A specific embodiment of the method includes: acquiring a predetermined number of video frames from a video to be identified to obtain a video frame sequence; performing the following processing step: importing the video frame sequence into a pre-trained video identification model to obtain a classification tag probability corresponding to the video frame sequence, wherein the classification tag probability is used to characterize a probability of identifying a corresponding tag category of the video to be identified; and setting, in response to the classification tag probability being greater than or equal to a preset identification accuracy threshold, a video tag for the video to be identified according to the classification tag probability, or else increasing the number of video frames in the video frame sequence and continuing to perform the above processing step.

7.

发明申请
METHOD AND APPARATUS FOR DETECTING TEMPORAL ACTION OF VIDEO, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210216783A1

公开(公告)日：2021-07-15

申请号：US17144523

申请日：2021-01-08

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiang LONG , Dongliang HE , Fu LI , Xiang ZHAO , Tianwei LIN , Hao SUN , Shilei WEN , Errui DING

IPC: G06K9/00 , G06K9/62

Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.

8.

发明申请
SUPER-RESOLUTION VIDEO RECONSTRUCTION METHOD, DEVICE, APPARATUS AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开

公开(公告)号：US20200372609A1

公开(公告)日：2020-11-26

申请号：US16810986

申请日：2020-03-06

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Chao LI , Dongliang HE , Xiao LIU , Yukang DING , Shilei WEN , Errui DING , Henan ZHANG , Hao SUN

IPC: G06T3/40

Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.

9.

发明申请
METHOD AND APPARATUS FOR PROCESSING IMAGE 有权

公开(公告)号：US20210334950A1

公开(公告)日：2021-10-28

申请号：US17174002

申请日：2021-02-11

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiang LONG , Ping WANG , Zhichao ZHOU , Fu LI , Dongliang HE , Hao SUN

IPC: G06T7/00 , G06N3/04 , G06N3/08

Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.

10.

发明申请
METHOD AND APPARATUS FOR SELECTING VIDEO CLIP, SERVER AND MEDIUM 有权

公开(公告)号：US20210227302A1

公开(公告)日：2021-07-22

申请号：US17026488

申请日：2020-09-21

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Fu LI , Dongliang HE , Hao SUN

IPC: H04N21/81 , H04N21/8549 , H04N21/658 , H04N21/234

Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification