Abstract:
A method includes determining, at a decoder of a first device, an offset value corresponding to an offset between a first particular packet and a second particular packet. The first device includes a de-jitter buffer. The method also includes transmitting the offset value to an encoder of a second device to enable the second device to send packets to the first device based on the offset value.
Abstract:
A method of performing gain adjustment in an electronic device includes determining a first set of spectral frequency values and determining a second set of spectral frequency values. The first set of spectral frequency values corresponds to a high-band portion of an audio signal received at the electronic device. The second set of spectral frequency values approximates the first set of spectral frequency values in the high band portion of the audio signal. The method includes estimating a spectral distortion corresponding to a difference between the first set of spectral frequency values and the second set of spectral frequency values and adjusting, based on the spectral distortion, a gain value corresponding to at least a portion of the audio signal. The method also includes transmitting an encoded bitstream that includes information corresponding to the adjusted gain value and the second set of spectral frequency values.
Abstract:
A device includes a receiver, a buffer, and an analyzer. The receiver is configured to receive packets that correspond to at least a subset of a sequence of packets and that include error correction data. The error correction data of a first packet of the packets includes a partial copy of a second packet. The buffer is configured to store the packets. The analyzer is configured to determine whether a first particular packet of the sequence is missing from the buffer, to determine whether a partial copy of the first particular packet is stored in the buffer as error correction data in a second particular packet, to update a value based at least in part on whether the first particular packet is missing from the buffer and the partial copy of the first particular packet is stored in the buffer, and to adjust an error recovery parameter based at least in part on the value.
Abstract:
In speech processing systems, compensation is made for sudden changes in the background noise in the average signal-to-noise ratio (SNR) calculation. SNR outlier filtering may be used, alone or in conjunction with weighting the average SNR. Adaptive weights may be applied on the SNRs per band before computing the average SNR. The weighting function can be a function of noise level, noise type, and/or instantaneous SNR value. Another weighting mechanism applies a null filtering or outlier filtering which sets the weight in a particular band to be zero. This particular band may be characterized as the one that exhibits an SNR that is several times higher than the SNRs in other bands.
Abstract:
A method includes filtering, at a speech encoder, an audio signal into a first group of sub-bands within a first frequency range and a second group of sub-bands within a second frequency range. The method also includes generating a harmonically extended signal based on the first group of sub-bands. The method further includes generating a third group of sub-bands based, at least in part, on the harmonically extended signal. The third group of sub-bands corresponds to the second group of sub-bands. The method also includes determining a first adjustment parameter for a first sub-band in the third group of sub-bands or a second adjustment parameter for a second sub-band in the third group of sub-bands. The first adjustment parameter is based on a metric of a first sub-band in the second group of sub-bands, and the second adjustment parameter is based on a metric of a second sub-band in the second group of sub-bands.
Abstract:
A method for mitigating potential frame instability by an electronic device is described. The method includes obtaining a frame subsequent in time to an erased frame. The method also includes determining whether the frame is potentially unstable. The method further includes applying a substitute weighting value to generate a stable frame parameter if the frame is potentially unstable.
Abstract:
A method for determining pitch pulse period signal boundaries by an electronic device is described. The method includes obtaining a signal. The method also includes determining a first averaged curve based on the signal. The method further includes determining at least one first averaged curve peak position based on the first averaged curve and a threshold. The method additionally includes determining pitch pulse period signal boundaries based on the at least one first averaged curve peak position. The method also includes synthesizing a speech signal.
Abstract:
A method for quantizing phase information on an electronic device is described. The method includes obtaining a speech signal. The method also includes determining a prototype pitch period signal based on the speech signal and transforming the prototype pitch period signal into a first frequency-domain signal. The method additionally includes mapping the first frequency-domain signal into a plurality of subbands. The method also includes determining a global alignment based on the first frequency-domain signal and quantizing the global alignment utilizing scalar quantization to obtain a quantized global alignment. The method additionally includes determining a plurality of band alignments corresponding to the plurality of subbands. The method also includes quantizing the plurality of band alignments utilizing vector quantization to obtain a quantized plurality of band alignments. The method further includes transmitting the quantized global alignment and the quantized plurality of band alignments.
Abstract:
A method for determining an interpolation factor set by an electronic device is described. The method includes determining a value based on a current frame property and a previous frame property. The method also includes determining whether the value is outside of a range. The method further includes determining an interpolation factor set based on the value and a prediction mode indicator if the value is outside of the range. The method additionally includes synthesizing a speech signal.
Abstract:
A particular method includes determining, based on spectral information corresponding to an audio signal that includes a low-band portion and a high-band portion, that the audio signal includes a component corresponding to an artifact-generating condition. The method also includes filtering the high-band portion of the audio signal and generating an encoded signal. Generating the encoded signal includes determining gain information based on a ratio of a first energy corresponding to filtered high-band output to a second energy corresponding to the low-band portion to reduce an audible effect of the artifact-generating condition.