Abstract:
A device comprising one or more processors is configured to apply adaptively determined weights to a plurality of channels of the audio signal to generate a plurality of adaptively weighted channels of the audio signal. The processors are further configured to combine at least two of the plurality of adaptively weighted channels of the audio signal to generate a combined signal. The processors are further configured to apply a binaural room impulse response filter to the combined signal to generate a binaural audio signal.
Abstract:
In general, techniques are directed to intermediate compression of higher order ambisonic audio data. For example, a device comprising a processor and a memory may be configured to perform the techniques. The memory may be configured to store an intermediately formatted audio data generated as a result of an intermediate compression of higher order ambisonic audio data. The one or more processors may be configured to process the intermediately formatted audio data.
Abstract:
In general, techniques are described for performing codebook selection when coding vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store a plurality of codebooks to use when performing vector dequantization with respect to a vector quantized spatial component of a soundfield. The vector quantized spatial component may be obtained through application of a decomposition to a plurality of higher order ambisonic coefficients. The processor may be configured to select one of the plurality of codebooks.
Abstract:
In general, techniques are described for coding of vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store audio data. The processor may be configured to determine whether to perform vector dequantization or scalar dequantization with respect to a decomposed version of the plurality of HOA coefficients.
Abstract:
In general, techniques are described for closed loop quantization of HOA coefficients that provide a three-dimensional representation of the sound field. An audio encoding device may perform closed loop quantization of an audio object based at least in part on a result of performing quantization of directional information associated with the audio object. An audio decoding device may obtain an audio object that has been closed loop quantized based at least in part on a result of performing quantization of directional information associated with the audio object, and may dequantize the audio object.
Abstract:
In general, techniques are described for low frequency rendering of higher-order ambisonic audio data. As an example, a device comprising a memory and a processor may perform the techniques. The memory may be configured to store higher-order ambisonic coefficients. The processor may be configured to obtain a renderer to be used when rendering the higher-order ambisonic coefficients into a speaker feed for a low frequency speaker.
Abstract:
A device comprising one or more processors is configured to determine a plurality of segments for each of a plurality of binaural room impulse response filters, wherein each of the plurality of binaural room impulse response filters comprises a residual room response segment and at least one direction-dependent segment for which a filter response depends on a location within a sound field; transform each of at least one direction-dependent segment of the plurality of binaural room impulse response filters to a domain corresponding to a domain of a plurality of hierarchical elements to generate a plurality of transformed binaural room impulse response filters, wherein the plurality of hierarchical elements describe a sound field; and perform a fast convolution of the plurality of transformed binaural room impulse response filters and the plurality of hierarchical elements to render the sound field.
Abstract:
A device comprising one or more processors is configured to apply adaptively determined weights to a plurality of channels of the audio signal to generate a plurality of adaptively weighted channels of the audio signal. The processors are further configured to combine at least two of the plurality of adaptively weighted channels of the audio signal to generate a combined signal. The processors are further configured to apply a binaural room impulse response filter to the combined signal to generate a binaural audio signal.
Abstract:
In general, techniques are described for obtaining one or more first vectors describing distinct components of a soundfield and one or more second vectors describing background components of the soundfield, both the one or more first vectors and the one or more second vectors generated at least by performing a transformation with respect to a plurality of spherical harmonic coefficients.