DYNAMIC TRIPLET CONVOLUTION FOR CONVOLUTIONAL NEURAL NETWORKS

    公开(公告)号:US20250068891A1

    公开(公告)日:2025-02-27

    申请号:US18724510

    申请日:2022-02-18

    Abstract: Methods, apparatus, systems and articles of manufacture (e.g., physical storage media) to implement dynamic triplet convolution for convolutional neural networks are disclosed. An example apparatus disclosed herein for a convolutional neural network is to calculate one or more scalar kernels based on an input feature map applied to a layer of the convolutional neural network, ones of the one or more scalar kernels corresponding to respective dimensions of a static multidimensional convolutional filter associated with the layer of the convolutional neural network. The disclosed example apparatus is also to scale elements of the static multidimensional convolutional filter along a first one of the dimensions based on a first one of the one or more scalar kernels corresponding to the first one of the dimensions to determine a dynamic multidimensional convolutional filter associated with the layer of the convolutional neural network.

    METHOD AND SYSTEM OF IMAGE HASHING OBJECT DETECTION FOR IMAGE PROCESSING

    公开(公告)号:US20230368493A1

    公开(公告)日:2023-11-16

    申请号:US18030024

    申请日:2020-11-13

    CPC classification number: G06V10/764 G06V10/82 G06V10/776

    Abstract: A method and system of image hashing object detection for image processing are provided. The method comprises the following steps: obtaining image head class input data and image tail class input data differentiated from the head class input data and respectively of two images each of an object to be classified; respectively inputting the head and tail class input data into two separate parallel representation neural networks being trained to respectively generate head and tail features, wherein the representation neural networks share at least some representation weights used to form the head and tail features; inputting the head and tail features into at least one classifier neural network to generate class-related data; generating a class-balanced loss of at least one of the classes of the class-related data comprising factoring an effective number of samples of individual classes; and rebalancing an output sample distribution among the classes at the representation neural networks, classifier neural networks, or both by using the class-balanced loss.

    COMBINATORIAL SHAPE REGRESSION FOR FACE ALIGNMENT IN IMAGES

    公开(公告)号:US20180137383A1

    公开(公告)日:2018-05-17

    申请号:US15573631

    申请日:2015-06-26

    Abstract: Combinatorial shape regression is described as a technique for face alignment and facial landmark detection in images. As described stages of regression may be built for multiple ferns for a facial landmark detection system. In one example a regression is performed on a training set of images using face shapes, using facial component groups, and using individual face point pairs to learn shape increments for each respective image in the set of images. A fern is built based on this regression. Additional regressions are performed for building additional ferns. The ferns are then combined to build the facial landmark detection system.

Patent Agency Ranking