Distributed audio capture and mixing

    公开(公告)号:US10708679B2

    公开(公告)日:2020-07-07

    申请号:US16464743

    申请日:2017-11-20

    Abstract: An apparatus for controlling a controllable position/orientation of at least one audio source within an audio scene, the audio scene including the at least one audio source; a capture device, the apparatus including a processor configured to: receive a physical position/orientation of the at least one audio source relative to a capture device capture orientation; receive an earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation; receive at least one control parameter; and control a controllable position/orientation of the at least one audio source, the controllable position being between the physical position/orientation of the at least one audio source relative to the capture device capture orientation and the earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation and based on the control parameter.

    Method for analysing media content
    44.
    发明授权

    公开(公告)号:US10242289B2

    公开(公告)日:2019-03-26

    申请号:US15374010

    申请日:2016-12-09

    Inventor: Francesco Cricri

    Abstract: A method for operating a computer graphic system, the method comprising: inputting a media content object (MCO) into a feature extractor comprising semantic abstraction levels; extracting feature maps from the MCO on each of the semantic layers; selecting at least a portion of the MCO to be analyzed; determining, based on the analysis of the feature maps from the portion of the MCO and the analysis of a previous state of a recognition unit, one or more feature maps selected from the feature maps of the semantic layers; determining a weight for each feature map; repeating the determining steps N times, each time processing, based on the analysis, each feature map by applying the corresponding weight; inputting the processed feature maps to the recognition unit; and analyzing a number of the processed feature maps until a prediction about the portion of the MCO is output.

    Searching Image Content
    45.
    发明申请

    公开(公告)号:US20180225290A1

    公开(公告)日:2018-08-09

    申请号:US15750292

    申请日:2016-08-10

    Abstract: A method, an apparatus and computer program code is provided. The method comprises: responding to user input by making at least one alteration to a recording of a real scene in a first image content item; determining at least one altered characteristic of the recording of the real scene; determining whether one or more further image content items, different from the first image content item, have a recording of a real scene comprising the at least one determined altered characteristic; and causing at least one further image content item, having a recording of a real scene comprising the at least one determined altered characteristic, to be indicated to a user.

    Method and apparatus for sensor aided extraction of spatio-temporal features
    46.
    发明授权
    Method and apparatus for sensor aided extraction of spatio-temporal features 有权
    用于传感器辅助提取时空特征的方法和装置

    公开(公告)号:US09471993B2

    公开(公告)日:2016-10-18

    申请号:US14155936

    申请日:2014-01-15

    Abstract: A method, apparatus and computer program product are provided for extracting spatio-temporal features with the aid of sensor information. An exemplary method comprises receiving video data and auxiliary sensor data and associating the two with timestamp information. The method may also include segmenting an input data stream into stable segments and extracting temporal features from the associated video data. The method may further include extracting temporal features either form the whole video or only from the video data where little or no stable segments are detected and performing camera view motion compensation by using information provided by the auxiliary sensors for modifying the feature-descriptors.

    Abstract translation: 提供了一种借助于传感器信息来提取时空特征的方法,装置和计算机程序产品。 一种示例性方法包括接收视频数据和辅助传感器数据并将两者与时间戳信息相关联。 该方法还可以包括将输入数据流分割成稳定的段并从相关联的视频数据中提取时间特征。 该方法还可以包括提取形成整个视频的时间特征,或者仅从检测到很少或没有稳定段的视频数据提取时间特征,并且通过使用由辅助传感器提供的用于修改特征描述符的信息来执行摄像机视图运动补偿。

    Method, an apparatus and a computer program product for video encoding and video decoding

    公开(公告)号:US12142014B2

    公开(公告)日:2024-11-12

    申请号:US17430987

    申请日:2020-01-29

    Abstract: The embodiments relate to a method comprising compressing input data (I) by means of at least a neural network (E, 310); determining a compression rate for data compression; miming the neural network (E, 310) with the input data (I) to produce an output data (c); removing a number of elements from the output data (c) according to the compression rate to result in a reduced form of the output data (me); and providing the reduced form of the output data (me) and the compression rate to a decoder (D, 320). The embodiments also relate to a method comprising receiving input data (me) for decompression; decompressing the input data (me) by means of at least a neural network (D, 320); determining a decompression rate for decompressing the input data (me); miming the neural network (D, 320) with input data (me) to produce a decompressed output data (ĩ); padding a number of elements to the compressed input data (me) according to the decompression rate to produce an output data (ĩ); and providing the output data (ĩ).

    Apparatus, a method and a computer program for video coding and decoding

    公开(公告)号:US11831867B2

    公开(公告)日:2023-11-28

    申请号:US17430893

    申请日:2020-01-29

    Abstract: A method comprising: obtaining a configuration of at least one neural network comprising a plurality of intra-prediction mode agnostic layers and one or more intra-prediction mode specific layers, the one or more intra-prediction mode specific layers corresponding to different intra-prediction modes; obtaining at least one input video frame comprising a plurality of blocks; determining to encode one or more blocks using intra prediction; determining an intra-prediction mode for each of said one or more blocks; grouping blocks having same intra-prediction mode into groups, each group being assigned with a computation path among the plurality of intra-prediction mode agnostic and the one or more intra-prediction mode specific layers; training the plurality of intra-prediction mode agnostic and/or the one or more intra-prediction mode specific layers of the neural networks based on a training loss between an output of the neural networks relating to a group of blocks and ground-truth blocks, wherein the ground-truth blocks are either blocks of the input video frame or reconstructed blocks; and encoding a block using a computation path assigned to an intra-prediction mode for the block.

    Apparatus, a method and a computer program for video coding and decoding

    公开(公告)号:US11622119B2

    公开(公告)日:2023-04-04

    申请号:US17575946

    申请日:2022-01-14

    Abstract: A method includes maintaining a set of parameters or weights derived through online learning for a neural net; transmitting an update of the parameters or weights to a decoder; deriving a first prediction block based on an output of the neural net using the parameters or weights; deriving a first encoded prediction error block through encoding a difference of the first prediction block and a first input block; encoding the first encoded prediction error block into a bitstream; deriving a reconstructed prediction error block based on the first encoded prediction error block; deriving a second prediction block based on an output of the neural net using the parameters or weights and the reconstructed prediction error block; deriving a second encoded prediction error block through encoding a difference of the second prediction block and a second input block; and encoding the second encoded prediction error block into a bitstream.

Patent Agency Ranking