Abstract:
In general, techniques are described for transforming spherical harmonic coefficients. A device comprising one or more processors may perform the techniques. The processors may be configured to parse the bitstream to determine transformation information describing how the sound field was transformed to reduce a number of the plurality of hierarchical elements that provide information relevant in describing the sound field. The processors may further be configured to, when reproducing the sound field based on those of the plurality of hierarchical elements that provide information relevant in describing the sound field, transform the sound field based on the transformation information to reverse the transformation performed to reduce the number of the plurality of hierarchical elements.
Abstract:
A device comprising one or more processors is configured to determine a plurality of segments for each of a plurality of binaural room impulse response filters, wherein each of the plurality of binaural room impulse response filters comprises a residual room response segment and at least one direction-dependent segment for which a filter response depends on a location within a sound field; transform each of at least one direction-dependent segment of the plurality of binaural room impulse response filters to a domain corresponding to a domain of a plurality of hierarchical elements to generate a plurality of transformed binaural room impulse response filters, wherein the plurality of hierarchical elements describe a sound field; and perform a fast convolution of the plurality of transformed binaural room impulse response filters and the plurality of hierarchical elements to render the sound field.
Abstract:
In general, techniques are described for coding a number of code vectors for independent frame of higher order ambisonic coefficients. An audio decoding device comprising a memory and a processor may perform the techniques. The memory may store a first frame of a bitstream and a second frame of the bitstream. The processor may extract, from the first frame, one or more bits indicative of whether the first frame is an independent frame that includes information specifying a number of code vectors to be used when performing vector dequantization with respect to the vector. The processor may also extract, from the first frame without referencing the second frame, the information specifying the number of code vectors.
Abstract:
In general, techniques are described for coding higher-order ambisonic coefficients during multiple transitions. A device comprising a processor and a memory coupled to the processor may be configured to perform the techniques. The processor may be configured to obtain a multi-transition indication of whether an ambient HOA coefficient is in transition during a same frame of the bitstream as a foreground audio signal is in transition. The processor may also be configured to obtain a vector that describes a spatial characteristic of a corresponding foreground audio signal based on the multi-transition indication, both the vector and the corresponding HOA audio signal decomposed from the HOA audio data. The memory may be configured to store the vector.
Abstract:
In general, techniques are described for indicating reuse of a syntax element that indicates a quantization mode used when compressing a vector. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain a bitstream comprising a vector in a spherical harmonics domain. The bitstream may further comprise an indicator for whether to reuse, from a previous frame, at least one syntax element indicative of a quantization mode used when compressing the vector. The memory may be configured to store the bitstream.
Abstract:
A device comprises one or more processors configured to apply a binaural room impulse response filter to spherical harmonic coefficients representative of a sound field in three dimensions so as to render the sound field.
Abstract:
A device comprising one or more processors is configured to obtain transformation information, the transformation information describing how a sound field was transformed to reduce a number of a plurality of hierarchical elements to a reduced plurality of hierarchical elements; and perform binaural audio rendering with respect to the reduced plurality of hierarchical elements based on the transformation information.
Abstract:
In general, techniques are described for inserting audio channels into descriptions of soundfields. A device comprising a processor may be configured to perform the techniques. The processor may be configured to obtain an audio channel separate from a higher-order ambisonic representation of a soundfield. The processor may further be configured to insert the audio channel at a spatial location within the soundfield such that the audio channel is able to be extracted from the soundfield.
Abstract:
In general, techniques are described for signaling an order format for higher-order ambisonic audio data. An audio decoding device including a memory and a processor may perform the techniques. The memory may be configured to store a bitstream indicative of a coded higher-order ambisonic (HOA) audio signal. The processor may be configured to obtain, from the bitstream, a harmonic coefficient ordering format indicator indicative of a symmetric harmonic coefficient ordering format for a source set of HOA coefficients from which the coded HOA audio signal is generated. The processor may further be configured to decode the coded HOA audio signal based on the symmetric harmonic coefficient ordering format indicator.
Abstract:
In general, techniques are described for coding an ambient higher order ambisonic coefficient. An audio decoding device comprising a memory and a processor may perform the techniques. The memory may store a first frame of a bitstream and a second frame of the bitstream. The processor may obtain, from the first frame, one or more bits indicative of whether the first frame is an independent frame that includes additional reference information to enable the first frame to be decoded without reference to the second frame. The processor may further obtain, in response to the one or more bits indicating that the first frame is not an independent frame, prediction information for first channel side information data of a transport channel. The prediction information may be used to decode the first channel side information data of the transport channel with reference to second channel side information data of the transport channel.