Abstract:
Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.
Abstract:
In general, techniques are described for signaling channels for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of channels specified in one or more layers in the bitstream, and obtain the channels specified in the one or more layers in the bitstream based on the indication of the number of channels.
Abstract:
In general, techniques are described for obtaining audio rendering information in a bitstream. A device configured to render higher order ambisonic coefficients comprising a processor and a memory may perform the techniques. The processor may be configured to obtain sign symmetry information indicative of sign symmetry of a matrix used to render the higher order ambisonic coefficients to generate a plurality of speaker feeds. The memory may be configured to store the sparseness information.
Abstract:
In general, techniques are described for coding of spherical harmonic coefficients representative of a three dimensional soundfield. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store a plurality of spherical harmonic coefficients. The one or more processors may be configured to perform an energy analysis with respect to the plurality of spherical harmonic coefficients to determine a reduced version of the plurality of spherical harmonic coefficients.
Abstract:
In general, techniques are described for determining quantization step sizes for compression of spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. In other words, the one or more processors may be configured to determine a quantization step size to be used when compressing a spatial component of a sound field, where the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.
Abstract:
In general, techniques are described for identifying distinct audio objects from spherical harmonic coefficients (which may also be denotes as higher order ambisonic coefficients). A device comprising one or more processors may perform the techniques so as to identify the distinct audio objects from the spherical harmonic coefficients (SHC) associated with the audio objects based on a directionality determined for one or more of the audio objects.
Abstract:
A device comprises one or more processors configured to apply a binaural room impulse response filter to spherical harmonic coefficients representative of a sound field in three dimensions so as to render the sound field.
Abstract:
In general, techniques are described for determining renderers used for rendering spherical harmonic coefficients to generate one or more loudspeaker signals. A device comprising one or more processors may perform the techniques. The one or more processors may be configured to determine a local speaker geometry of one or more speakers used for playback of spherical harmonic coefficients representative of a sound field, and configure the device to operate based on the local speaker geometry.
Abstract:
In general, techniques are described for grouping audio objects into clusters. In some examples, a device for audio signal processing comprises a cluster analysis module configured to, based on a plurality of audio objects, produce a first grouping of the plurality of audio objects into L clusters, wherein the first grouping is based on spatial information from at least N among the plurality of audio objects and L is less than N. The device also includes an error calculator configured to calculate an error of the first grouping relative to the plurality of audio objects, wherein the error calculator is further configured to, based on the calculated error, produce a plurality L of audio streams according to a second grouping of the plurality of audio objects into L clusters that is different from the first grouping.