摘要:
A speech decoding method which generates an excitation signal and a synthesis filter from coded data and which obtains a speech signal based on the excitation signal and the synthesis filter. The method includes acquiring identification information used for determining whether the speech signal to be decoded is a narrowband signal or a wideband signal; and modifying the excitation signal based on the identification information by controlling strength or presence of emphasis of pitch periodicity with respect to the excitation signal generated from the coded data, so as to generate the speech signal by use of the modified excitation signal and the synthesis filter.
摘要:
According to one embodiment, an acoustic signal corrector includes: an output module; a selection receiver; and a holder. The output module is configured to output a plurality of acoustic signals. Amplitude values of frequencies within a frequency band of each of the acoustic signals are emphasized as emphasized amplitude values. A plurality of amplitude values among the emphasized amplitude values are corrected as corrected amplitude values at some of frequencies within the frequency band. Resonance is possibly induced in the frequency band by sealing an ear canal. The selection receiver is configured to receive a selection of one of the acoustic signals output by the output module. The holder is configured to hold, as a configuration for sound quality correction, a configuration corresponding to the correction of the one of the acoustic signals at the some of the frequencies.
摘要:
A content reproducing apparatus includes a display unit configured to display a play list and candidate contents able to be added to the play list, a selection unit configured to select, from the candidate contents, an undesired content which a user does not want to add to the play list, a calculation unit configured to calculate a first retrieval statistical quantity based on first characteristic quantity of the undesired content or a second retrieval statistical quantity based on second characteristic quantity of a desired content which the user wants to add to the play list, and a retrieve unit configured to retrieve the candidate contents to prepare the play list, in accordance with similarity which has been calculated by using the first or second retrieval statistical quantity and which shows to which a given content having third characteristic quantity is similar, the desired content or the undesired content.
摘要:
A signal bandwidth extension apparatus includes a determination unit which determines whether or not a peak component of the input signal is lacked in the band to be extended, and a control unit which controls to extend the bandwidth when the determination unit determines that the peak component of the input signal is lacked in the band to be extended, and not to extend the bandwidth when the determination unit determines that the peak component is not lacked.
摘要:
According to one embodiment, a signal processing apparatus includes a speaker configured to output the received input signal on which a delay detection signal which has a frequency component of an inaudible frequency on a received input signal is superposed to an acoustic space, an extracting section configured to extract the delay detection signal from the sending input signal outputted from microphone configured to collect sound in the acoustic space a calculating section configured to calculate a delay time between the received input signal and an acoustic echo component contained in the sending input signal, a delay section configured to delay the received input signal by a time corresponding to the delay time and generate a delayed received input signal, and an echo suppression processing section configured to suppress the acoustic echo component contained in the sending input signal by use of the delayed received input signal.
摘要:
A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.
摘要:
In a background noise/speech classification method, whether a digital input signal input through an input terminal is background noise or speech is decided by a background noise/speech decision section on the basis of calculated frame power and a calculated LSP coefficient which are obtained by supplying the input signal to a feature amount calculation section and estimated frame power and an estimated LSP coefficient obtained by an estimated feature amount update section. Thereafter, the estimated feature amount update section updates the estimated frame power and the estimated LSP coefficient by using the frame power and the LSP coefficient obtained by the feature amount calculation section to prepare for the next frame.
摘要:
A speech signal is input to an excitation signal generating section, a prediction filter and a prediction parameter calculator. The prediction parameter calculator calculates a predetermined number of prediction parameters (LPC parameter or reflection coefficient) by an autocorrelation method or covariance method, and supplies the acquired prediction parameters to a prediction parameter coder. The codes of the prediction parameters are sent to a decoder and a multiplexer. The decoder sends decoded values of the codes of the prediction parameters to the prediction filter and the excitation signal generating section. The prediction filter calculates a prediction residual signal, which is the difference between the input speech signal and the decoded prediction parameter, and sends it to the excitation signal generating section. The excitation signal generating section calculates the pulse interval and amplitude for each of a predetermined number of subframes based on the input speech signal, the prediction residual signal and the quantized value of the prediction parameter, and sends them to the multiplexer. The multiplexer combines these codes and the codes of the prediction parameters, and send the results as an output signal of a coding apparatus to a transmission path or the like.
摘要:
Adjusting the shape of a spectrum of a speech signal includes the steps of using a first filter with pole-zero transfer function A(z)/B(z) for subjecting a speech signal to a spectrum envelop emphasis and a second filter cascade-connected with the first filter, for compensating for a spectral tilt due to the first filter, independently deriving two filter coefficients used in the second filter for compensating for the spectral tilt from the pole-zero transfer function, and compensating for the spectral tilt corresponding to the pole-zero transfer function according to the derived filter coefficients.
摘要:
A vector quantizing apparatus includes a first search section for obtaining an approximate vector X1 which is approximated to a desired vector R, a residual vector calculator for calculating a residual vector Rv from the desired vector R and the approximate vector X1, a weighting section for obtaining weighted vectors X2 to XN of code vectors x2 to xN, and a second search section for calculating an estimation value which is the magnitude of a projection vector of the residual vector Rv with respect to the vector space formed by the approximate vector X1 and the weighted vectors X2 to XN, and searching a code vector which maximizes this estimation value.