Abstract:
A multi-party control unit (MCU) generates, based on audio data streams that represent sounds associated terminal devices, a mixed audio data stream. In addition, the MCU modifies the mixed mono audio data to steganographically embed sub-streams that include representations of the mono audio data streams. A terminal device receives the modified mixed audio data stream. When the terminal device is configured for stereo playback, the terminal device performs an inverse steganographic process to extract, from the mixed audio data stream, the sub-streams. The terminal device generates and outputs multi-channel audio data based on the extracted sub-streams and the mixed audio data stream. When the terminal device is not configured for stereo playback, the terminal device outputs sound based on the mixed audio data stream without extracting the embedded sub-streams.
Abstract:
A particular method includes determining, at a device, a voicing classification of an input signal. The input signal corresponds to an audio signal. The method also includes controlling an amount of an envelope of a representation of the input signal based on the voicing classification. The method further includes modulating a white noise signal based on the controlled amount of the envelope. The method also includes generating a high band excitation signal based on the modulated white noise signal.
Abstract:
A method includes determining, at a speech encoder, frequency domain gain shape parameters. The frequency domain gain shape parameters are based on a second signal associated with an audio signal. The method further includes adjusting a first signal based on the frequency domain gain shape parameters. The first signal is associated with the audio signal. The method also includes inserting the frequency domain gain shape parameters into an encoded version of the audio signal to enable gain adjustment during reproduction of the audio signal from the encoded version of the audio signal.
Abstract:
A device includes a decoder configured to receive an encoded audio signal at a decoder and to generate a synthesized signal based on the encoded audio signal. The device further includes a classifier configured to classify the synthesized signal based on at least one parameter determined from the encoded audio signal.
Abstract:
A device includes a decoder configured to receive an encoded audio signal at a decoder and to generate a synthesized signal based on the encoded audio signal. The device further includes a classifier configured to classify the synthesized signal based on at least one parameter determined from the encoded audio signal.
Abstract:
A device includes a decoder that includes an extractor, a predictor, a selector, and a switch. The extractor is configured to extract a first plurality of parameters from a received input signal. The input signal corresponds to an encoded audio signal. The predictor is configured to perform blind bandwidth extension by generating a second plurality of parameters independent of high band information in the input signal. The second plurality of parameters corresponds to a high band portion of the encoded audio signal. The selector is configured to select a particular mode from multiple high band modes including a first mode using the first plurality of parameters and a second mode using the second plurality of parameters. The switch is configured to output the first plurality of parameters or the second plurality of parameters based on the selected particular mode.
Abstract:
Systems and methods of performing blind bandwidth extension are disclosed. In an embodiment, a method includes determining, based on a set of low-band parameters of an audio signal, a first set of high-band parameters and a second set of high-band parameters. The method further includes generating a predicted set of high-band parameters based on a weighted combination of the first set of high-band parameters and the second set of high-band parameters.
Abstract:
A particular method includes determining, based on spectral information corresponding to an audio signal that includes a low-band portion and a high-band portion, that the audio signal includes a component corresponding to an artifact-generating condition. The method also includes filtering the high-band portion of the audio signal and generating an encoded signal. Generating the encoded signal includes determining gain information based on a ratio of a first energy corresponding to filtered high-band output to a second energy corresponding to the low-band portion to reduce an audible effect of the artifact-generating condition.
Abstract:
A method includes determining a first modeled high-band signal based on a low-band excitation signal of an audio signal, where the audio signal includes a high-band portion and a low-band portion. The method also includes determining scaling factors based on energy of sub-frames of the first modeled high-band signal and energy of corresponding sub-frames of the high-band portion of the audio signal. The method includes applying the scaling factors to a modeled high-band excitation signal to determine a scaled high-band excitation signal and determining a second modeled high-band signal based on the scaled high-band excitation signal. The method includes determining gain parameters based on the second modeled high-band signal and the high-band portion of the audio signal.
Abstract:
A method for quantizing phase information on an electronic device is described. The method includes obtaining a speech signal. The method also includes determining a prototype pitch period signal based on the speech signal and transforming the prototype pitch period signal into a first frequency-domain signal. The method additionally includes mapping the first frequency-domain signal into a plurality of subbands. The method also includes determining a global alignment based on the first frequency-domain signal and quantizing the global alignment utilizing scalar quantization to obtain a quantized global alignment. The method additionally includes determining a plurality of band alignments corresponding to the plurality of subbands. The method also includes quantizing the plurality of band alignments utilizing vector quantization to obtain a quantized plurality of band alignments. The method further includes transmitting the quantized global alignment and the quantized plurality of band alignments.