Abstract:
This specification describes a method comprising causing provision of a first version of virtual reality content to a first user via first portable user equipment located at a first location and having a first orientation, the virtual reality content being associated with a second location and a second orientation, the first version of the virtual reality content being rendered for provision via the first user equipment in dependence on the first location relative to the second location and the first orientation relative to the second orientation.
Abstract:
A method, apparatus and computer program product are provided to identify one or more salient events from an analysis of one or more images in an efficient and accurate manner. In this regard, the method, apparatus and computer program product may limit the visual analysis of the images to only a subset of the images that are determined to be potentially relevant based upon sensor information provided by one or more sensors carried by the image capturing device. In the context of a method, one or more images that are captured by an image capturing device are identified to be a salient video segment based upon sensor information provided by one or more sensors carried by the image capturing device. The method also includes identifying one or more salient events based upon an analysis of the one or more images of the salient video segment.
Abstract:
An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: estimate an importance of parameters of a neural network based on a graph diffusion process over at least one layer of the neural network; determine the parameters of the neural network that are suitable for pruning or sparsification; remove neurons of the neural network to prune or sparsify the neural network; and provide at least one syntax element for signaling the pruned or sparsified neural network over a communication channel, wherein the at least one syntax element comprises at least one neural network representation syntax element.
Abstract:
An apparatus includes circuitry configured to: receive a plurality of compressed residual local weight updates from a plurality of respective institutes with a plurality of a respective first parameter, the first parameter used to determine a plurality of respective predicted local weight updates; determine a plurality of local weight updates or a plurality of adjusted local weight updates based on the plurality of compressed residual local weight updates and the plurality of respective predicted local weight updates; aggregate the plurality of determined local weight updates or the plurality of adjusted local weight updates to generate an intended global weight update, and update a model on a server based at least on the intended global weight update, the model used to perform a task; and transfer a compressed residual global weight update to the institutes with a second parameter, the second parameter used to determine a predicted global weight update.
Abstract:
An apparatus includes circuitry configured to: partition an input tensor into one or more block tensors; partition at least one of the block tensors into one or more continuation bands, the one or more continuation bands being associated with a caching counter having a value; store the one or more continuation bands in a cache managed using a cache manager; retrieve, prior to a convolution or pooling operation on a current block tensor, the one or more continuation bands of a previous block tensor from the cache that are adjacent to a current block tensor; concatenate the retrieved continuation bands with the current block tensor; apply the convolution or pooling operation on the current block tensor after the concatenation; decrease the respective caching counter value of the retrieved continuation bands; and clear the continuation bands from the cache when its respective caching counter reaches a value of zero.
Abstract:
An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.
Abstract:
A method, an apparatus and a computer program are provided. The method comprises defining a first mapping that maps a virtual space to a real space; using a tracked real point of view of a user in the real space and the first mapping to determining a virtual point of view of the user within the virtual space defined by the user's virtual location and virtual orientation; causing rendering to the user of at least part of a virtual scene determined by the virtual point of view of the user within the virtual space, while the virtual space is mapped to the real space using the first mapping; altering, based on a grasp and move gesture performed by the user on a virtual location in the virtual space that differs from the virtual location partially defining the user's virtual point of view, the mapping of the virtual space to the real space from the first mapping to a second mapping; using a tracked real point of view of the user in the real space and the second mapping to determine a virtual point of view of the user within the virtual space defined by the user's virtual location and virtual orientation; and causing rendering to the user of at least part of a virtual scene determined by the virtual point of view of the user within the virtual space, while the virtual space is mapped to the real space using the second mapping.
Abstract:
A method comprising: recognizing a first two-handed gesture and a second two-handed gesture in a monitored scene space to at least partially define a shape and position of a computer-implemented virtual boundary in a corresponding monitoring space, wherein the first two-handed gesture in the monitored scene space specifies a first two points in the monitoring space and the second two-handed gesture in the monitored scene space specifies a second two points in the monitoring space; causing implementation of the computer-implemented virtual boundary in the monitoring space corresponding to the monitored scene space, wherein a shape and position of the computer-implemented virtual boundary is at least partially defined by the first two points in the monitoring space and the second two points in the monitoring space; and processing received data to generate a response event when there is a change in a portion of the monitored scene space relative to the computer-implemented virtual boundary in the corresponding monitoring space.
Abstract:
A method comprising: automatically applying a selection criterion or criteria to a sound object; if the sound object satisfies the selection criterion or criteria then performing one of correct or incorrect rendering of the sound object; and if the sound object does not satisfy the selection criterion or criteria then performing the other of correct or incorrect rendering of the sound object, wherein correct rendering of the sound object comprises at least rendering the sound object at a correct position within a rendered sound scene compared to a recorded sound scene and wherein incorrect rendering of the sound object comprises at least rendering of the sound object at an incorrect position in a rendered sound scene compared to a recorded sound scene or not rendering the sound object in the rendered sound scene.
Abstract:
A method, apparatus and computer program code is provided. The method comprises: causing display of a virtual object at a first position in virtual space, the virtual object having a visual position and an aural position at the first position; processing positional audio data based on the aural position of the virtual object being at the first position; causing positional audio to be output to a user based on the processed positional audio data; changing the aural position of the virtual object from the first position to a second position in the virtual space, while maintaining the visual position of the virtual object at the first position; further processing positional audio data based on the aural position of the virtual object being at the second position; and causing positional audio to be output to the user based on the further processed positional audio data, while maintaining the visual position of the virtual object at the first position.