System and method for a unified architecture multi-task deep learning machine for object recognition

    公开(公告)号:US11645869B2

    公开(公告)日:2023-05-09

    申请号:US16808357

    申请日:2020-03-03

    Abstract: A system to recognize objects in an image includes an object detection network outputs a first hierarchical-calculated feature for a detected object. A face alignment regression network determines a regression loss for alignment parameters based on the first hierarchical-calculated feature. A detection box regression network determines a regression loss for detected boxes based on the first hierarchical-calculated feature. The object detection network further includes a weighted loss generator to generate a weighted loss for the first hierarchical-calculated feature, the regression loss for the alignment parameters and the regression loss of the detected boxes. A backpropagator backpropagates the generated weighted loss. A grouping network forms, based on the first hierarchical-calculated feature, the regression loss for the alignment parameters and the bounding box regression loss, at least one of a box grouping, an alignment parameter grouping, and a non-maximum suppression of the alignment parameters and the detected boxes.

    Base station and control method therefor in wireless communication system

    公开(公告)号:US11528081B2

    公开(公告)日:2022-12-13

    申请号:US17265733

    申请日:2019-08-08

    Abstract: The present disclosure relates to a communication method and system for converging a 5th-Generation (5G) communication system for supporting higher data rates beyond a 4th-Generation (4G) system with a technology for Internet of Things (IoT). The present disclosure may be applied to intelligent services based on the 5G communication technology and the IoT-related technology, such as smart home, smart building, smart city, smart car, connected car, health care, digital education, smart retail, security and safety services. According to an embodiment of the present invention, a method for controlling a base station supporting a multi-antenna system may comprise the steps of: generating a plurality of test signals for a plurality of antennas in a modem; controlling the plurality of generated test signals to be fed back to the modem through a plurality of feedback paths which are formed for the plurality of antennas, respectively, and do not affect each other; and identifying the plurality of test signals fed back to the modem, on the basis of the plurality of generated test signals.

    SYSTEM AND METHOD FOR DEEP MACHINE LEARNING FOR COMPUTER VISION APPLICATIONS

    公开(公告)号:US20220391632A1

    公开(公告)日:2022-12-08

    申请号:US17889883

    申请日:2022-08-17

    Abstract: A computer vision (CV) training system, includes: a supervised learning system to estimate a supervision output from one or more input images according to a target CV application, and to determine a supervised loss according to the supervision output and a ground-truth of the supervision output; an unsupervised learning system to determine an unsupervised loss according to the supervision output and the one or more input images; a weakly supervised learning system to determine a weakly supervised loss according to the supervision output and a weak label corresponding to the one or more input images; and a joint optimizer to concurrently optimize the supervised loss, the unsupervised loss, and the weakly supervised loss.

    System and method for acoustic echo cancelation using deep multitask recurrent neural networks

    公开(公告)号:US11521634B2

    公开(公告)日:2022-12-06

    申请号:US17015931

    申请日:2020-09-09

    Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.

    METHOD AND APPARATUS BASED ON SCENE DEPENDENT LENS SHADING CORRECTION

    公开(公告)号:US20220375044A1

    公开(公告)日:2022-11-24

    申请号:US17572223

    申请日:2022-01-10

    Abstract: A method of performing scene-dependent lens shading correction (SD-LSC) is provided. The method includes collecting scene information from a Bayer thumbnail of an input image; generating a standard red green blue (sRGB) thumbnail by processing the Bayer thumbnail of the input image to simulate white balance (WB) and pre-gamma blocks; determining a representative color channel ratio of the input image based on the scene information and the sRGB thumbnail; determining an ideal grid gain of the input image based on the representative color channel ratio and a grid gain of the input image; merging the ideal grid gain and the grid gain of the input image to generate a new grid gain; and applying the new grid gain to the input image.

    System and method for boundary aware semantic segmentation

    公开(公告)号:US11461998B2

    公开(公告)日:2022-10-04

    申请号:US16777734

    申请日:2020-01-30

    Abstract: Some aspects of embodiments of the present disclosure relate to using a boundary aware loss function to train a machine learning model for computing semantic segmentation maps from input images. Some aspects of embodiments of the present disclosure relate to deep convolutional neural networks (DCNNs) for computing semantic segmentation maps from input images, where the DCNNs include a box filtering layer configured to box filter input feature maps computed from the input images before supplying box filtered feature maps to an atrous spatial pyramidal pooling (ASPP) layer. Some aspects of embodiments of the present disclosure relate to a selective ASPP layer configured to weight the outputs of an ASPP layer in accordance with attention feature maps.

    SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS

    公开(公告)号:US20220293120A1

    公开(公告)日:2022-09-15

    申请号:US17827424

    申请日:2022-05-27

    Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.

    System and method for providing dolly zoom view synthesis

    公开(公告)号:US11423510B2

    公开(公告)日:2022-08-23

    申请号:US16814184

    申请日:2020-03-10

    Abstract: A method and an apparatus are provided for providing a dolly zoom effect by an electronic device. A first image with a first depth map and a second image with a second depth map are obtained. A first synthesized image and a corresponding first synthesized depth map are generated using the first image and the first depth map respectively. A second synthesized image and a corresponding second synthesized depth map are generated using the second image and the second depth map respectively. A fused image is generated from the first synthesized image and the second synthesized image. A fused depth map is generated from the first synthesized depth map and the second synthesized depth map. A final synthesized image is generated based on processing the fused image and the fused depth map.

Patent Agency Ranking