METHOD AND APPARATUS FOR DETERMINING TARGET ANCHOR, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210357683A1

    公开(公告)日:2021-11-18

    申请号:US17338328

    申请日:2021-06-03

    Abstract: Embodiments of the present disclosure disclose a method and apparatus for determining a target anchor, a device and a storage medium. The method may include: extracting a plurality of feature maps of an original image using a feature extraction network; inputting the plurality of feature maps into a feature pyramid network to perform feature fusion, to obtain a plurality of fused feature maps; and using a region proposal network to implement operations as follows: determining an initial anchor of a web header using the fused feature map, based on a size of each fused feature map, and determining an offset parameter of the initial anchor, based on a ratio of the size of the fused feature map to the original image, and generating a plurality of candidate anchors in different directions, based on the offset parameter of the initial anchor.

    METHOD AND APPARATUS FOR DETECTING TEMPORAL ACTION OF VIDEO, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210216782A1

    公开(公告)日:2021-07-15

    申请号:US17144205

    申请日:2021-01-08

    Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.

    IMAGE PROCESSING AND TRAINING FOR A NEURAL NETWORK

    公开(公告)号:US20220004801A1

    公开(公告)日:2022-01-06

    申请号:US17480053

    申请日:2021-09-20

    Abstract: The present disclosure provides an image processing method and apparatus, a training method for a neural network and apparatus, a device, and a medium. The implementation is: inputting a source domain image and a target domain image into a matching feature extraction network to extract a matching feature of the source domain image and a matching feature of the target domain image, wherein the matching feature of the source domain image and the matching feature of target domain image are mutually matching features in the source domain image and the target domain image, the source domain image is a simulated image generated through rendering based on object pose parameters, and the target domain image is a real image that is actually shot and applicable to training of object pose estimation; and providing the matching feature of the source domain image for the training of the object pose estimation.

    METHOD, APPARATUS, ELECTRONIC DEVICE AND COMPUTER READABLE MEDIUM FOR CALIBRATING EXTERNAL PARAMETER OF CAMERA

    公开(公告)号:US20210358169A1

    公开(公告)日:2021-11-18

    申请号:US17339177

    申请日:2021-06-04

    Abstract: A method and an apparatus for calibrating an external parameter of a camera are provided. The method may include: acquiring a time-synchronized data set of three-dimensional point clouds and two-dimensional image of a calibration reference object, the two-dimensional image being acquired by a camera with a to-be-calibrated external parameter; establishing a transformation relationship between a point cloud coordinate system and an image coordinate system, the transformation relationship including a transformation parameter; back-projecting the data set of the three-dimensional point clouds onto a plane where the two-dimensional image is located through the transformation relationship to obtain a set of projection points of the three-dimensional point clouds; adjusting the transformation parameter to map the set of the projection points onto the two-dimensional image; and obtaining an external parameter of the camera based on the adjusted transformation parameter and the data set of the three-dimensional point clouds.

    IMAGE PROCESSING METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20210286975A1

    公开(公告)日:2021-09-16

    申请号:US17332520

    申请日:2021-05-27

    Abstract: The disclosure provides an image processing method, an image processing apparatus, an electronic device and a storage medium, which belongs to the field of computer technologies, and specifically relates to computing vision, image processing, face recognition, and deep learning technologies in artificial intelligence. The method includes: performing skin color recognition on a face image to be processed to determine a target skin color of a face contained in the face image; obtaining a reference transformation image corresponding to the face image by processing the face image using any style transfer model in response that a style transfer model set does not comprise a style transfer model corresponding to the target skin color; and obtaining a target transformation image matching the target skin color by adjusting a hue value, a saturation value, and a lightness value of each pixel in the target region based on the target skin color.

    METHOD AND APPARATUS FOR SELECTING VIDEO CLIP, SERVER AND MEDIUM

    公开(公告)号:US20210227302A1

    公开(公告)日:2021-07-22

    申请号:US17026488

    申请日:2020-09-21

    Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.

    IMAGE FILLING METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20210201448A1

    公开(公告)日:2021-07-01

    申请号:US17203437

    申请日:2021-03-16

    Abstract: An image filling method and apparatus, a device and a storage medium are disclosed. The image filling method includes: performing multilevel encoding processing on features of an image to be filled to generate multilevel encoded feature layers, sizes of the multilevel encoded feature layers being reduced layer by layer; performing layer-by-layer decoding processing on the multilevel encoded feature layers to obtain multilevel decoded feature layers and a first image, there being no missing region in the first image, wherein the layer-by-layer decoding processing includes a concatenation operation on a decoded feature layer and an encoded feature layer with a same size; and performing up-sampling processing on the first image to obtain multilevel up-sampled feature layers and a second image optimized by the up-sampling processing, the up-sampling processing including a concatenation operation on an up-sampled feature layer and a decoded feature layer with a same size.

Patent Agency Ranking