DYNAMIC TEMPORAL NORMALIZATION FOR DEEP LEARNING IN VIDEO UNDERSTANDING APPLICATIONS

    公开(公告)号:US20240273873A1

    公开(公告)日:2024-08-15

    申请号:US18563305

    申请日:2021-09-01

    CPC classification number: G06V10/7715 G06V10/32 G06V10/82

    Abstract: Techniques related to application of deep neural networks to video for video recognition and understanding are discussed. A feature map of a deep neural network for a current time stamp of input video is standardized to a standardized feature map and pooled to a feature vector. The feature vector and transform parameters for a prior time stamp are used to generate transform parameters for the current time stamp based on application of a meta temporal relay. The resultant current time stamp transform parameters, such as a hidden state and a cell state of the meta temporal relay, are used to transform the standardized feature map to a normalized feature map for use by a subsequent layer of the deep neural network.

    Methods and systems for boosting deep neural networks for deep learning

    公开(公告)号:US11790223B2

    公开(公告)日:2023-10-17

    申请号:US16475076

    申请日:2017-04-07

    Abstract: Methods and systems are disclosed for boosting deep neural networks for deep learning. In one example, in a deep neural network including a first shallow network and a second shallow network, a first training sample is processed by the first shallow network using equal weights. A loss for the first shallow network is determined based on the processed training sample using equal weights. Weights for the second shallow network are adjusted based on the determined loss for the first shallow network. A second training sample is processed by the second shallow network using the adjusted weights. In another example, in a deep neural network including a first weak network and a second weak network, a first subset of training samples is processed by the first weak network using initialized weights. A classification error for the first weak network on the first subset of training samples is determined. The second weak network is boosted using the determined classification error of the first weak network with adjusted weights. A second subset of training samples is processed by the second weak network using the adjusted weights.

    METHODS AND APPARATUS TO DYNAMICALLY NORMALIZE DATA IN NEURAL NETWORKS

    公开(公告)号:US20230274132A1

    公开(公告)日:2023-08-31

    申请号:US18005804

    申请日:2020-08-26

    CPC classification number: G06N3/08

    Abstract: Methods, apparatus, systems, and articles of manufacture to dynamically normalize data in neural networks are disclosed. An apparatus for use with a machine learning model includes at least one normalization calculator to generate a plurality of alternate normalized outputs associated with input data for the machine learning model. Different ones of the alternate normalized outputs based on different normalization techniques. The apparatus further includes a soft weighting engine to generate a plurality of soft weights based on the input data. The apparatus also includes a normalized output generator to generate a final normalized output based on the plurality of alternate normalized outputs and the plurality of soft weights.

    Visual question answering using visual knowledge bases

    公开(公告)号:US11663249B2

    公开(公告)日:2023-05-30

    申请号:US16650853

    申请日:2018-01-30

    CPC classification number: G06F16/3329 G06N3/045 G06N3/049 G06N5/025

    Abstract: An example apparatus for visual question answering includes a receiver to receive an input image and a question. The apparatus also includes an encoder to encode the input image and the question into a query representation including visual attention features. The apparatus includes a knowledge spotter to retrieve a knowledge entry from a visual knowledge base pre-built on a set of question-answer pairs. The apparatus further includes a joint embedder to jointly embed the visual attention features and the knowledge entry to generate visual-knowledge features. The apparatus also further includes an answer generator to generate an answer based on the query representation and the visual-knowledge features.

    METHOD AND APPARATUS FOR DYNAMIC NORMALIZATION AND RELAY IN A NEURAL NETWORK

    公开(公告)号:US20220207359A1

    公开(公告)日:2022-06-30

    申请号:US17485406

    申请日:2021-09-25

    Abstract: Embodiments are generally directed to methods and apparatuses for dynamic normalization and relay in a neural network. An embodiment of an apparatus for dynamic normalization and relay in a neural network including a hyper normalization layer comprises: a compute engine to: generate a hidden state and a cell state for the hyper normalization layer based on an input feature map for the hyper normalization layer as well as a previous hidden state and a previous cell state; and normalize the input feature map in the hyper normalization layer with the hidden state and the cell state for the hyper normalization layer.

Patent Agency Ranking