Method and apparatus for identifying salient events by analyzing salient video segments identified by sensor information
    22.
    发明授权
    Method and apparatus for identifying salient events by analyzing salient video segments identified by sensor information 有权
    通过分析由传感器信息识别的突出视频段来识别突出事件的方法和装置

    公开(公告)号:US09384400B2

    公开(公告)日:2016-07-05

    申请号:US14325827

    申请日:2014-07-08

    Abstract: A method, apparatus and computer program product are provided to identify one or more salient events from an analysis of one or more images in an efficient and accurate manner. In this regard, the method, apparatus and computer program product may limit the visual analysis of the images to only a subset of the images that are determined to be potentially relevant based upon sensor information provided by one or more sensors carried by the image capturing device. In the context of a method, one or more images that are captured by an image capturing device are identified to be a salient video segment based upon sensor information provided by one or more sensors carried by the image capturing device. The method also includes identifying one or more salient events based upon an analysis of the one or more images of the salient video segment.

    Abstract translation: 提供了一种方法,装置和计算机程序产品,用于以有效和准确的方式从一个或多个图像的分析中识别一个或多个显着事件。 在这方面,方法,装置和计算机程序产品可以基于由图像捕获装置承载的一个或多个传感器提供的传感器信息将图像的视觉分析限制为仅被确定为潜在相关的图像的子集 。 在方法的上下文中,基于由图像捕获装置携带的一个或多个传感器提供的传感器信息,将由图像捕获装置捕获的一个或多个图像识别为显着视频片段。 该方法还包括基于对显着视频段的一个或多个图像的分析来识别一个或多个突出事件。

    Graph diffusion for structured pruning of neural networks

    公开(公告)号:US12242969B2

    公开(公告)日:2025-03-04

    申请号:US17354398

    申请日:2021-06-22

    Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: estimate an importance of parameters of a neural network based on a graph diffusion process over at least one layer of the neural network; determine the parameters of the neural network that are suitable for pruning or sparsification; remove neurons of the neural network to prune or sparsify the neural network; and provide at least one syntax element for signaling the pruned or sparsified neural network over a communication channel, wherein the at least one syntax element comprises at least one neural network representation syntax element.

    Compression Framework for Distributed or Federated Learning with Predictive Compression Paradigm

    公开(公告)号:US20220335269A1

    公开(公告)日:2022-10-20

    申请号:US17717729

    申请日:2022-04-11

    Abstract: An apparatus includes circuitry configured to: receive a plurality of compressed residual local weight updates from a plurality of respective institutes with a plurality of a respective first parameter, the first parameter used to determine a plurality of respective predicted local weight updates; determine a plurality of local weight updates or a plurality of adjusted local weight updates based on the plurality of compressed residual local weight updates and the plurality of respective predicted local weight updates; aggregate the plurality of determined local weight updates or the plurality of adjusted local weight updates to generate an intended global weight update, and update a model on a server based at least on the intended global weight update, the model used to perform a task; and transfer a compressed residual global weight update to the institutes with a second parameter, the second parameter used to determine a predicted global weight update.

    Caching and Clearing Mechanism for Deep Convolutional Neural Networks

    公开(公告)号:US20220191524A1

    公开(公告)日:2022-06-16

    申请号:US17549039

    申请日:2021-12-13

    Abstract: An apparatus includes circuitry configured to: partition an input tensor into one or more block tensors; partition at least one of the block tensors into one or more continuation bands, the one or more continuation bands being associated with a caching counter having a value; store the one or more continuation bands in a cache managed using a cache manager; retrieve, prior to a convolution or pooling operation on a current block tensor, the one or more continuation bands of a previous block tensor from the cache that are adjacent to a current block tensor; concatenate the retrieved continuation bands with the current block tensor; apply the convolution or pooling operation on the current block tensor after the concatenation; decrease the respective caching counter value of the retrieved continuation bands; and clear the continuation bands from the cache when its respective caching counter reaches a value of zero.

    High Level Syntax for Compressed Representation of Neural Networks

    公开(公告)号:US20210211733A1

    公开(公告)日:2021-07-08

    申请号:US17140512

    申请日:2021-01-04

    Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.

    METHODS, APPARATUS, SYSTEMS, COMPUTER PROGRAMS FOR ENABLING MEDIATED REALITY

    公开(公告)号:US20210081051A1

    公开(公告)日:2021-03-18

    申请号:US16954390

    申请日:2018-12-18

    Abstract: A method, an apparatus and a computer program are provided. The method comprises defining a first mapping that maps a virtual space to a real space; using a tracked real point of view of a user in the real space and the first mapping to determining a virtual point of view of the user within the virtual space defined by the user's virtual location and virtual orientation; causing rendering to the user of at least part of a virtual scene determined by the virtual point of view of the user within the virtual space, while the virtual space is mapped to the real space using the first mapping; altering, based on a grasp and move gesture performed by the user on a virtual location in the virtual space that differs from the virtual location partially defining the user's virtual point of view, the mapping of the virtual space to the real space from the first mapping to a second mapping; using a tracked real point of view of the user in the real space and the second mapping to determine a virtual point of view of the user within the virtual space defined by the user's virtual location and virtual orientation; and causing rendering to the user of at least part of a virtual scene determined by the virtual point of view of the user within the virtual space, while the virtual space is mapped to the real space using the second mapping.

    Monitoring
    28.
    发明授权

    公开(公告)号:US10776631B2

    公开(公告)日:2020-09-15

    申请号:US15741571

    申请日:2016-07-05

    Abstract: A method comprising: recognizing a first two-handed gesture and a second two-handed gesture in a monitored scene space to at least partially define a shape and position of a computer-implemented virtual boundary in a corresponding monitoring space, wherein the first two-handed gesture in the monitored scene space specifies a first two points in the monitoring space and the second two-handed gesture in the monitored scene space specifies a second two points in the monitoring space; causing implementation of the computer-implemented virtual boundary in the monitoring space corresponding to the monitored scene space, wherein a shape and position of the computer-implemented virtual boundary is at least partially defined by the first two points in the monitoring space and the second two points in the monitoring space; and processing received data to generate a response event when there is a change in a portion of the monitored scene space relative to the computer-implemented virtual boundary in the corresponding monitoring space.

    Intelligent audio rendering
    29.
    发明授权

    公开(公告)号:US10524074B2

    公开(公告)日:2019-12-31

    申请号:US15777718

    申请日:2016-11-22

    Abstract: A method comprising: automatically applying a selection criterion or criteria to a sound object; if the sound object satisfies the selection criterion or criteria then performing one of correct or incorrect rendering of the sound object; and if the sound object does not satisfy the selection criterion or criteria then performing the other of correct or incorrect rendering of the sound object, wherein correct rendering of the sound object comprises at least rendering the sound object at a correct position within a rendered sound scene compared to a recorded sound scene and wherein incorrect rendering of the sound object comprises at least rendering of the sound object at an incorrect position in a rendered sound scene compared to a recorded sound scene or not rendering the sound object in the rendered sound scene.

    Content Discovery
    30.
    发明申请
    Content Discovery 审中-公开

    公开(公告)号:US20190187954A1

    公开(公告)日:2019-06-20

    申请号:US16326306

    申请日:2017-08-22

    CPC classification number: G06F3/167 G06F3/011 G06F3/013

    Abstract: A method, apparatus and computer program code is provided. The method comprises: causing display of a virtual object at a first position in virtual space, the virtual object having a visual position and an aural position at the first position; processing positional audio data based on the aural position of the virtual object being at the first position; causing positional audio to be output to a user based on the processed positional audio data; changing the aural position of the virtual object from the first position to a second position in the virtual space, while maintaining the visual position of the virtual object at the first position; further processing positional audio data based on the aural position of the virtual object being at the second position; and causing positional audio to be output to the user based on the further processed positional audio data, while maintaining the visual position of the virtual object at the first position.

Patent Agency Ranking