-
公开(公告)号:US20230209301A1
公开(公告)日:2023-06-29
申请号:US18112082
申请日:2023-02-21
摘要: An apparatus (131) comprising means for: obtaining media content (122), wherein the media content (122) comprises at least one object data; obtaining priority content information (126), the priority content information (126) comprising a priority identification identifying and classifying the at least one object; rendering the at least one object based on the priority content information (126).
-
公开(公告)号:US10770081B2
公开(公告)日:2020-09-08
申请号:US16479916
申请日:2018-01-03
发明人: Adriana Vasilache
IPC分类号: G10L19/24 , G10L19/008 , G10L19/035
摘要: A method comprising: receiving at least two audio channel signals; determining, for a first frame, at least two parameters representing a difference between the at least two channel audio signals; scalar quantising the at least two parameters to generate at least two index values; adaptively encoding an initial scalar quantized parameter of the at least two parameters; determining whether the initial scalar quantized parameter has a value different from a predetermined value; adaptively encoding any unencoded scalar quantized parameters where the initial scalar quantized parameter has a value different from the predetermined value; determining whether all of the at least two scalar quantized parameters have values equal to the predetermined value where the initial scalar quantized parameter has a value equal to the predetermined value; adaptively encoding any unencoded scalar quantized parameters and generating an indicator that an output is one of fixed or variable rate coding where the initial scalar quantized parameter has a value equal to the predetermined value and at least one of the at least two scalar quantized parameters have values different from the predetermined value; generating an indicator that the output is the other of the one of fixed or variable rate coding where the initial scalar quantized parameter has a value equal to the predetermined value and all of the at least two scalar quantized parameters have values equal to the predetermined value; generating a single channel representation of the at least two audio channel signals dependent on the at least two parameters; and encoding the single channel representation.
-
公开(公告)号:US20200209952A1
公开(公告)日:2020-07-02
申请号:US16622442
申请日:2018-06-11
发明人: Kari Juhani Järvinen , Adriana Vasilache , Henri Toukomaa , Sujeet Shyamsundar Mate , Antti Eronen
IPC分类号: G06F3/01
摘要: In respect of virtual-or-augmented reality content comprising data to provide a first sensory scene and a second, different, sensory scene to a user, based on a user-lock-input to lock the first sensory scene; provide for, based on one or both of a user-input to change user orientation in a virtual space and a user-input to change user location in said virtual space, i) presentation of the second sensory scene with a corresponding change in the second sensory scene relative to the user to account for the change in one or both of orientation and location of the user in the virtual space; and ii) presentation of the first, locked, sensory scene with no corresponding change in the first sensory scene relative to the user to account for the change in one or both of orientation and location of the user in the virtual space.
-
公开(公告)号:US20200154231A1
公开(公告)日:2020-05-14
申请号:US16620142
申请日:2018-06-11
发明人: Antti Eronen , Adriana Vasilache , Henri Toukomaa , Kari Juhani Järvinen , Sujeet Shyamsundar Mate
摘要: An apparatus, based on a first audio track of at least one audio track, the first audio track audibly presented to the user as spatial audio such that it is perceived to originate from a particular location and based on the user being within a predetermined distance of the particular location;configured to provide for a change in the audible presentation of the first audio track to the user from presentation as spatial audio to presentation as at least one of monophonic and stereophonic audio.
-
公开(公告)号:US10109287B2
公开(公告)日:2018-10-23
申请号:US14439132
申请日:2012-10-30
IPC分类号: G10L19/07 , G10L19/038 , G10L19/22 , H03M7/30 , G10L19/005 , G10L19/00
摘要: It is inter alia disclosed to quantize a vector of a plurality of coefficients using a predictive mode of operation of a vector quantizer, wherein the vector quantizer can operate in either a predictive mode of operation or a non-predictive mode of operation, determine a vector of a plurality of recovered coefficients; compare the vector of the plurality of coefficients to the vector of the plurality of recovered coefficients; and determine the mode of operation of the vector quantizer for a vector of a plurality of coefficients associated with a subsequent frame of audio samples, wherein the mode of operation is dependent on the comparison.
-
公开(公告)号:US10026413B2
公开(公告)日:2018-07-17
申请号:US15127143
申请日:2015-03-13
发明人: Lasse Laaksonen , Anssi Rämö , Adriana Vasilache
IPC分类号: H04J3/16 , G10L19/24 , G10L19/008 , G10L19/16
摘要: It is disclosed inter alia a method for forming an audio payload frame, wherein the audio payload frame comprises: an encoded audio data frame with a first marker bit at the front of the encoded audio data frame, wherein the first marker is set to a first value, and wherein the first value denotes a type of encoded audio data in the encoded audio data frame; an extension encoded audio data frame; and a second marker bit in front of the first marker bit, wherein the second marker bit is set to a second value; and wherein the second value denotes a type of encoded audio data other than the type of encoded audio data in the encoded audio data frame.
-
公开(公告)号:US12046250B2
公开(公告)日:2024-07-23
申请号:US17642288
申请日:2020-09-09
发明人: Adriana Vasilache
IPC分类号: G10L19/032 , G10L19/00 , G10L19/008
CPC分类号: G10L19/032 , G10L19/0017 , G10L19/008
摘要: An apparatus comprising means configured to: generate spatial audio signal directional metadata parameters for a block of time-frequencies; generate encoded spatial audio signal directional metadata parameters (108) for a block of time-frequencies based on a first quantization resolution (203); compare a number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution against a determined number of bits; output or store the encoded spatial audio signal directional metadata parameters for a block of time-frequencies (108) based on a first quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is less than a determined number of bits (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a second quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is more than the determined number of bits and a difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is less than a determined number of bits is within a determined threshold (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a third quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is more than the determined number of bits and the difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is greater than the determined threshold, wherein the third quantization resolution is determined such that a number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies based on the third quantization resolution is always equal to or less than the determined number of bits (217).
-
公开(公告)号:US11676612B2
公开(公告)日:2023-06-13
申请号:US17257813
申请日:2019-06-20
发明人: Adriana Vasilache , Anssi Rämö , Lasse Laaksonen
IPC分类号: G10L19/02 , G10L19/002 , G10L19/008 , G10L19/038
CPC分类号: G10L19/0204 , G10L19/002 , G10L19/008 , G10L19/038
摘要: An apparatus comprising means for: receiving values for sub-bands of a frame of an audio signal, the values comprising at least one azimuth value, at least one elevation value and at least one energy ratio value for each sub-band; determining an allocation of first number of bits to encode the values of the frame, wherein the first number of bits are fixed; encoding the at least one energy ratio value for a frame based on a defined allocation of a second number of bits from the first number of bits; encoding the at least one azimuth value and/or at least one elevation value of the frame based on a defined allocation of a third number of bits from the first number of bits, wherein the third number of bits is variably distributed on a sub-band-by-sub-band basis.
-
公开(公告)号:US11570569B2
公开(公告)日:2023-01-31
申请号:US16960723
申请日:2019-01-14
发明人: Sujeet Shyamsundar Mate , Adriana Vasilache , Lasse Laaksonen , Kari Jarvinen , Antti Eronen , Jussi Leppanen
摘要: An apparatus including at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured, with the at least one processor, to cause the apparatus at least to: generate content lock information for a content lock, wherein the content lock information enables control of audio signal processing associated with audio signals related to one or more audio sources based on a position and/or orientation input.
-
公开(公告)号:US20210407525A1
公开(公告)日:2021-12-30
申请号:US17290053
申请日:2019-10-01
IPC分类号: G10L19/02 , G10L19/008
摘要: An apparatus comprising means for: receiving values for sub-bands of a frame of an audio signal, the values comprising at least one azimuth value, at least one elevation value at least one energy ratio value and at least one spread and/or surround coherence value for each sub-band; determining a codebook for encoding at least one spread and/or surround coherence value for each sub-band based on the at least one energy ratio value and at least one azimuth value for each sub-band for a frame; discrete cosine transforming at least one vector, the at least one vector comprising the at least one spread and/or surround coherence value for a sub-band for the frame; and encoding a first number of components of the discrete cosine transformed vector based on the determined codebook.
-
-
-
-
-
-
-
-
-