Abstract:
A device comprising one or more processors is configured to determine a plurality of segments for each of a plurality of binaural room impulse response filters, wherein each of the plurality of binaural room impulse response filters comprises a residual room response segment and at least one direction-dependent segment for which a filter response depends on a location within a sound field; transform each of at least one direction-dependent segment of the plurality of binaural room impulse response filters to a domain corresponding to a domain of a plurality of hierarchical elements to generate a plurality of transformed binaural room impulse response filters, wherein the plurality of hierarchical elements describe a sound field; and perform a fast convolution of the plurality of transformed binaural room impulse response filters and the plurality of hierarchical elements to render the sound field.
Abstract:
A device comprises one or more processors configured to apply a binaural room impulse response filter to spherical harmonic coefficients representative of a sound field in three dimensions so as to render the sound field.
Abstract:
A device comprising one or more processors is configured to obtain transformation information, the transformation information describing how a sound field was transformed to reduce a number of a plurality of hierarchical elements to a reduced plurality of hierarchical elements; and perform binaural audio rendering with respect to the reduced plurality of hierarchical elements based on the transformation information.
Abstract:
In general, techniques are described for signaling an order format for higher-order ambisonic audio data. An audio decoding device including a memory and a processor may perform the techniques. The memory may be configured to store a bitstream indicative of a coded higher-order ambisonic (HOA) audio signal. The processor may be configured to obtain, from the bitstream, a harmonic coefficient ordering format indicator indicative of a symmetric harmonic coefficient ordering format for a source set of HOA coefficients from which the coded HOA audio signal is generated. The processor may further be configured to decode the coded HOA audio signal based on the symmetric harmonic coefficient ordering format indicator.
Abstract:
In general, techniques are described for obtaining an indication of whether spherical harmonic coefficients are representative of a synthetic audio object. In accordance with the techniques, a device comprising one or more processors may be configured to obtain an indication of whether spherical harmonic coefficients representative of a sound field are generated from a synthetic audio object.
Abstract:
In general, techniques are described by which to perform spatial masking with respect to spherical harmonic coefficients. As one example, an audio encoding device comprising a processor may perform various aspects of the techniques. The processor may be configured to perform spatial analysis based on the spherical harmonic coefficients describing a three-dimensional sound field to identify a spatial masking threshold. The processor may further be configured to render the multi-channel audio data from the plurality of spherical harmonic coefficients, and compress the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.
Abstract:
Systems and techniques for rendering audio data are generally disclosed. An example device for rendering a higher order ambition (HOA) audio signal includes a memory configured to store the HOA audio signal, and one or more processors coupled to the memory. The one or more processors are configured to perform a loudness compensation process as part of generating an effect matrix. The one or more processors are further configured to render the HOA audio signal based on the effect matrix.
Abstract:
In general, techniques are described for determining renderers used for rendering spherical harmonic coefficients to generate one or more loudspeaker signals. A device comprising one or more processors may perform the techniques. The one or more processors may be configured to determine a local speaker geometry of one or more speakers used for playback of spherical harmonic coefficients representative of a sound field, and configure the device to operate based on the local speaker geometry.
Abstract:
In general, techniques are described for audio editing of higher-order ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store a higher order ambisonic (HOA) representation of the audio object. The one or more processors may be configured to add a source tail to the HOA representation of the audio object by storing one or more spherical harmonic (SH) basis functions associated with the audio object to a buffer.
Abstract:
In general, techniques are described for compression and decoding of audio data are generally disclosed. An example device for compressing audio data includes one or more processors configured to apply a decorrelation transform to ambient ambisonic coefficients to obtain a decorrelated representation of the ambient ambisonic coefficients, the ambient HOA coefficients having been extracted from a plurality of higher order ambisonic coefficients and representative of a background component of a soundfield described by the plurality of higher order ambisonic coefficients, wherein at least one of the plurality of higher order ambisonic coefficients is associated with a spherical basis function having an order greater than one.