Graph Diffusion for Structured Pruning of Neural Networks

    公开(公告)号:US20210397965A1

    公开(公告)日:2021-12-23

    申请号:US17354398

    申请日:2021-06-22

    Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: estimate an importance of parameters of a neural network based on a graph diffusion process over at least one layer of the neural network; determine the parameters of the neural network that are suitable for pruning or sparsification; remove neurons of the neural network to prune or sparsify the neural network; and provide at least one syntax element for signaling the pruned or sparsified neural network over a communication channel, wherein the at least one syntax element comprises at least one neural network representation syntax element.

    Distributed Audio Capture and Mixing
    2.
    发明申请

    公开(公告)号:US20180295463A1

    公开(公告)日:2018-10-11

    申请号:US15767422

    申请日:2016-10-07

    Abstract: A spatial audio signal is received that is associated with a microphone array configured to provide spatial audio capture and additional audio signal(s) associated with an additional microphone, the additional audio signal having been delayed by a variable delay determined such that common components of the spatial audio signal and the additional audio signal(s) are time aligned. A relative position is received between a first position associated with the microphone array and a second position associated with the additional microphone Source parameter(s) are received classifying an audio source associated with the common components and/or space parameter(s) identifying an environment within which the audio source is located Processing effect ruleset is determined based on the source parameter(s) and/or the space parameter(s). Multiple output audio channel signals are generated by mixing and applying processing effect(s) to the spatial audio signal and the additional audio signal(s) based on the processing effect ruleset(s).

    Mediated Reality
    3.
    发明公开
    Mediated Reality 审中-公开

    公开(公告)号:US20230343022A1

    公开(公告)日:2023-10-26

    申请号:US17787761

    申请日:2020-12-15

    CPC classification number: G06T15/205 G02B27/01 G06F3/013 G06F3/017 A63F13/5255

    Abstract: An apparatus including circuitry configured for: in a first-person perspective mediated reality state, rendering mediated reality content as content distributed across a first area of a user's field of view, wherein a point of view of a user determines a point of view within a three-dimensional virtual space and determines at least part of the content distributed across the first area of a user's field of view as a virtual scene; responding to at least one user gesture to enter a spatially consolidated state; and in the spatially consolidated state, rendering the mediated reality content as content distributed across a second area of a user's field of view, wherein the second area is smaller than the first area and the point of view of the user does not determine the content distributed across the second area of a user's field of view.

    Distributed Audio Capture and Mixing Controlling

    公开(公告)号:US20200015021A1

    公开(公告)日:2020-01-09

    申请号:US16464913

    申请日:2017-11-20

    Abstract: An apparatus for identifying which sound sources are associated with which microphone audio signals, the apparatus comprising including a processor configured to: determine/receive a position/orientation of at least one sound source relative to a microphone array; receive at least one microphone audio signal, each microphone audio signal received from a microphone; receive an audio-focussed audio signal from the microphone array, wherein the audio-focussed audio signal is directed from the microphone array towards the one of the at least one sound source so as to enhance the audio-focussed audio signal; compare the audio-focussed audio signal against each microphone audio signal to identify a match between one of the at least one microphone audio signal and the audio focussed audio signal; and associate the one of the at least one microphone with the at least one sound source, based on the identified match.

    Distributed Audio Capture and Mixing
    5.
    发明申请

    公开(公告)号:US20190313174A1

    公开(公告)日:2019-10-10

    申请号:US16464743

    申请日:2017-11-20

    Abstract: An apparatus for controlling a controllable position/orientation of at least one audio source within an audio scene, the audio scene including the at least one audio source; a capture device, the apparatus including a processor configured to: receive a physical position/orientation of the at least one audio source relative to a capture device capture orientation; receive an earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation; receive at least one control parameter; and control a controllable position/orientation of the at least one audio source, the controllable position being between the physical position/orientation of the at least one audio source relative to the capture device capture orientation and the earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation and based on the control parameter.

    APPARATUS AND ASSOCIATED METHODS
    6.
    发明申请

    公开(公告)号:US20190058861A1

    公开(公告)日:2019-02-21

    申请号:US16078746

    申请日:2017-02-22

    Abstract: An apparatus comprising: at least one processor; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: in respect of virtual reality content comprising video imagery configured to provide a virtual reality space for viewing in virtual reality and based on a plurality of indicated highlight portions, each highlight portion comprising a spatial portion of the video imagery that forms the virtual reality space and being smaller in spatial extent than the spatial extent of the virtual reality space, and further based on a viewing direction in the virtual reality space of each of the plurality highlight portions, provide for one or more of generation or display of virtual reality summary content comprising a plurality of clips, each clip comprising the video imagery associated with one of the highlight portions, the virtual reality summary content configured to provide for display of the clips in a time consecutive manner and to provide for display of consecutive clips with a modified spatial separation such that the angular separation between a clip viewing direction of at least one clip and a clip viewing direction of an immediately preceding clip is less than the angular separation between the viewing directions of the highlight portions associated with said at least one clip and said immediately preceding clip.

    Apparatus, Methods and Computer Programs for Enabling Audio Rendering

    公开(公告)号:US20240121570A1

    公开(公告)日:2024-04-11

    申请号:US18275238

    申请日:2022-01-18

    CPC classification number: H04S7/304 G06N3/08

    Abstract: Example apparatus include circuitry for: obtaining audio content representing at least one audio space; enabling at least one digital signal processing operation to render the audio content such that the rendered audio content includes at least one target response for the at least one audio space wherein the enabling of the at least one digital signal processing operation to render the audio content is controlled based on obtaining the at least one target response for the at least one audio space. When the obtained target response is known the circuitry obtains at least one parameter for the at least one digital signal processing operation. When the obtained target response is unknown the circuitry obtains at least one parameter for a neural network and determines at least one parameter for the at least one digital signal processing operation.

Patent Agency Ranking