Abstract:
In general, techniques are described for obtaining audio rendering information in a bitstream. A device configured to render higher order ambisonic coefficients comprising a processor and a memory may perform the techniques. The processor may be configured to obtain sign symmetry information indicative of sign symmetry of a matrix used to render the higher order ambisonic coefficients to generate a plurality of speaker feeds. The memory may be configured to store the sparseness information.
Abstract:
In general, techniques are described for transforming spherical harmonic coefficients. A device comprising one or more processors may perform the techniques. The processors may be configured to parse the bitstream to determine transformation information describing how the sound field was transformed to reduce a number of the plurality of hierarchical elements that provide information relevant in describing the sound field. The processors may further be configured to, when reproducing the sound field based on those of the plurality of hierarchical elements that provide information relevant in describing the sound field, transform the sound field based on the transformation information to reverse the transformation performed to reduce the number of the plurality of hierarchical elements.
Abstract:
A device comprising one or more processors is configured to determine a plurality of segments for each of a plurality of binaural room impulse response filters, wherein each of the plurality of binaural room impulse response filters comprises a residual room response segment and at least one direction-dependent segment for which a filter response depends on a location within a sound field; transform each of at least one direction-dependent segment of the plurality of binaural room impulse response filters to a domain corresponding to a domain of a plurality of hierarchical elements to generate a plurality of transformed binaural room impulse response filters, wherein the plurality of hierarchical elements describe a sound field; and perform a fast convolution of the plurality of transformed binaural room impulse response filters and the plurality of hierarchical elements to render the sound field.
Abstract:
A device comprises one or more processors configured to apply a binaural room impulse response filter to spherical harmonic coefficients representative of a sound field in three dimensions so as to render the sound field.
Abstract:
A device comprising one or more processors is configured to obtain transformation information, the transformation information describing how a sound field was transformed to reduce a number of a plurality of hierarchical elements to a reduced plurality of hierarchical elements; and perform binaural audio rendering with respect to the reduced plurality of hierarchical elements based on the transformation information.
Abstract:
In general, techniques are described for signaling an order format for higher-order ambisonic audio data. An audio decoding device including a memory and a processor may perform the techniques. The memory may be configured to store a bitstream indicative of a coded higher-order ambisonic (HOA) audio signal. The processor may be configured to obtain, from the bitstream, a harmonic coefficient ordering format indicator indicative of a symmetric harmonic coefficient ordering format for a source set of HOA coefficients from which the coded HOA audio signal is generated. The processor may further be configured to decode the coded HOA audio signal based on the symmetric harmonic coefficient ordering format indicator.
Abstract:
In general, techniques are described for obtaining an indication of whether spherical harmonic coefficients are representative of a synthetic audio object. In accordance with the techniques, a device comprising one or more processors may be configured to obtain an indication of whether spherical harmonic coefficients representative of a sound field are generated from a synthetic audio object.
Abstract:
In general, techniques are described by which to perform spatial masking with respect to spherical harmonic coefficients. As one example, an audio encoding device comprising a processor may perform various aspects of the techniques. The processor may be configured to perform spatial analysis based on the spherical harmonic coefficients describing a three-dimensional sound field to identify a spatial masking threshold. The processor may further be configured to render the multi-channel audio data from the plurality of spherical harmonic coefficients, and compress the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.
Abstract:
In general, techniques are described for specifying spherical harmonic coefficients in a bitstream. A device comprising one or more processors may perform the techniques. The processors may be configured to identify, from the bitstream, a plurality of hierarchical elements describing a sound field that are included in the bitstream. The processors may further be configured to parse the bitstream to determine the identified plurality of hierarchical elements.
Abstract:
In general, techniques are described for obtaining audio rendering information in a bitstream. A device configured to render higher order ambisonic coefficients comprising a processor and a memory may perform the techniques. The processor may be configured to obtain sparseness information indicative of a sparseness of a matrix used to render the higher order ambisonic coefficients to a plurality of speaker feeds. The memory may be configured to store the sparseness information.