Method for determining for the compression and decompression of an HOA data frame representation

    公开(公告)号:US10224044B2

    公开(公告)日:2019-03-05

    申请号:US15891066

    申请日:2018-02-07

    Abstract: When decompressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (βe) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalization of the HOA data frame representation (C(k). Then the lowest integer number of bits is set to βe=┌ log2(┌ log2(√{square root over (KMAX)}·O)┐+1)┐.

    Method and apparatus for encoding/decoding of directions of dominant directional signals within subbands of a HOA signal representation

    公开(公告)号:US10194257B2

    公开(公告)日:2019-01-29

    申请号:US15320071

    申请日:2015-07-02

    Abstract: Encoding of Higher Order Ambisonics (HOA) signals commonly results in high data rates. For data rate reduction, a method (100) for encoding direction information for frames of an input HOA signal comprises determining (s101) active candidate directions (MDIR(k)) among predefined global directions having global direction indices, dividing (s102) the input HOA signal into frequency subbands (f1 . . . , fF), determining (s103) for each frequency subband active subband directions among the active candidate directions, assigning (s104) a relative direction index to each direction per subband, assembling (s105) direction information for the frame, the direction information comprising the active candidate directions (MDIRk)), for each subband and each active candidate direction a bit indicating whether or not the active candidate direction is an active subband direction for the respective frequency subband, and for each frequency subband the relative direction indices of active subband directions in the second set of subband directions, and transmitting (s106) the assembled direction information.

    Method for decoding a higher order ambisonics (HOA) representation of a sound or soundfield

    公开(公告)号:US10165384B2

    公开(公告)日:2018-12-25

    申请号:US15702471

    申请日:2017-09-12

    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (βe) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to βe=┌log2(┌log2(√{square root over (KMAX)}·O)┐+1)┐.

    Method and apparatus for coding or decoding subband configuration data for subband groups

    公开(公告)号:US10102864B2

    公开(公告)日:2018-10-16

    申请号:US15508444

    申请日:2015-08-19

    Abstract: For an efficient encoding of subband configuration data the first, penultimate and last subband groups are treated differently than the other subband groups. Further, subband group bandwidth difference values are used in the encoding. The number of subband groups NSB is coded using a fixed number of bits representing NSB−1. The bandwidth value BSB[1] of the first subband group is coded using a unary code representing BSB[1]−1. No bandwidth value BSB[g] is coded for the last subband g=NSB. For subband groups g=2, . . . , NSB−2 bandwidth difference values ΔBSB[g]=BSB[g]−BSB[g−1] are coded using a unary code, and the bandwidth difference value ΔBSB[NSB−1] for subband group g=NSB−1 is coded using a fixed number of bits.

    METHODS AND APPARATUS FOR DECOMPRESSING A COMPRESSED HOA SIGNAL

    公开(公告)号:US20180108362A1

    公开(公告)日:2018-04-19

    申请号:US15713174

    申请日:2017-09-22

    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k−1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k−1)). The ambient HOA component ({tilde over (C)}AMB(k−1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k−1)) in lower positions and second HOA coefficient sequences (cAMB,n(k−1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.

    Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
    98.
    发明授权
    Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field 有权
    用于处理用于产生声场的核心表示的刚性球上的球形麦克风阵列的信号的方法和装置

    公开(公告)号:US09503818B2

    公开(公告)日:2016-11-22

    申请号:US14356185

    申请日:2012-10-31

    Abstract: Spherical microphone arrays capture a three-dimensional sound field (P(Ωct)) for generating an Ambisonics representation (Anm(t)), where the pressure distribution on the surface of the sphere is sampled by the capsules of the array. The impact of the microphones on the captured sound field is removed using the inverse microphone transfer function. The equalization of the transfer function of the microphone array is a big problem because the reciprocal of the transfer function causes high gains for small values in the transfer function and these small values are affected by transducer noise. The present principles minimize that noise by using a Wiener filter processing (34) in the frequency domain, which processing is automatically controlled (33) per wave number by the signal-to-noise ratio of the microphone array.

    Abstract translation: 球形麦克风阵列捕获三维声场(P(Ωct)),用于产生Ambisonics表示(Anm(t)),其中球体表面上的压力分布由阵列的胶囊取样。 使用反向麦克风传递功能可以消除麦克风对拍摄声场的影响。 麦克风阵列的传递函数的均衡是一个大问题,因为传递函数的倒数对传递函数中的小值造成高增益,这些小值受传感器噪声的影响。 本原理通过在频域中使用维纳滤波处理(34)来最小化噪声,该处理由麦克风阵列的信噪比自动控制(33)每波数。

Patent Agency Ranking