摘要:
A method and system for enhancing the frequency response of speech signals are provided. An average speech spectral shape estimate is calculated over time based on the input speech signal. The average speech spectral shape estimate may be calculated in the frequency domain using a first order IIR filtering or “leaky integrators.” Thus, the average speech spectral shape estimate adapts over time to changes in the acoustic characteristics of the voice path or any changes in the electrical audio path that may affect the frequency response of the system. A spectral correction factor may be determined by comparing the average speech spectral shape estimate to a desired target spectral shape. The spectral correction factor may be added (in units of dB) to the spectrum of the input speech signal in order to enhance or adjust the spectrum of the input speech signal toward the desired spectral shape, and an enhanced speech signal re-synthesized from the corrected spectrum.
摘要:
A system and methods are provided for extending the frequency bandwidth of a harmonic signal. Harmonic content of a band-limited signal is extended to frequencies outside the signal's passband by performing a non-linear transformation on the complex spectrum of the band-limited signal in the frequency domain. The non-linear transformation may be accomplishes by a linear convolution of the complex spectrum with itself. A system for extending the frequency bandwidth of a harmonic signal includes a signal processor with a forward transform module for transforming a time domain signal into the frequency domain, a non-linear transform module for performing the non-linear transformation on the complex spectrum of the harmonic signal, and a reverse transform module for transforming the extended spectrum of the harmonic signal back into the time domain.
摘要:
A speech enhancement system that improves the intelligibility and the perceived quality of processed speech includes a frequency transformer and a spectral compressor. The frequency transformer converts speech signals from the time domain to the frequency domain. The spectral compressor compresses a pre-selected portion of the high frequency band and maps the compressed high frequency band to a lower band limited frequency range. The speech enhancement system may be built into, may be a unitary part of, or may be configured to interface other systems that process audio or high frequency signals.
摘要:
A system and method are provided for improving the quality and intelligibility of speech signals. The system and method apply frequency compression to the higher frequency components of speech signals while leaving lower frequency components substantially unchanged. This preserves higher frequency information related to consonants which is typically lost to filtering and bandpass constraints. This information is preserved without significantly altering the fundamental pitch of the speech signal so that when the speech signal is reproduced its overall tone qualities are preserved. The system and method further apply frequency expansion to speech signals. Like the compression, only the upper frequencies of a received speech signal are expanded. When the frequency expansion is applied to a speech signal that has been compressed according to the invention, the speech signal is substantially returned to its pre-compressed state. However, frequency compression according to the invention provides improved intelligibility even when the speech signal is not subsequently re-expanded. Likewise, speech signals may be expanded even though the original signal was not compressed, without significant degradation of the speech signal quality. Thus, a transmitter may include the system for applying high frequency compression without regard to whether a receiver will be capable of re-expanding the signal. Likewise, a receiver may expand a received speech signal without regard to whether the signal was previously compressed.
摘要:
A signal enhancement system reinforces signal content and improves the signal-to-noise ratio of a signal. The system detects, tracks, and reinforces non-stationary periodic signal components of a signal. The periodic signal components may represent vowel sounds or other voiced sounds. The system may detect, track, and attenuate quasi-stationary signal components in the signal.
摘要:
A voice reinforcement system extracts a portion of a converted speech signal and redirects it towards a listening area where it may be added with the original signal. The system includes a speech input, a filter, and a converter. The speech input generates an intermediate signal from a speech signal. The filter extracts a portion of the signal extending above a cutoff frequency. The converter converts the filtered signal to an aural signal directed towards a listening area.
摘要:
An automatic gain control system maintains desired signal content level, such as voice, in an output signal. The system includes automatic gain control over an input signal, and compensates the output signal based on input signal content. When the input signal level exceeds an upper or lower processing threshold level, or is distorted (e.g., clipped), the system applies a gain to the input signal level. The system may compensate for the gain in the output signal when the input signal includes desired signal content.
摘要:
A speech enhancement system that improves the intelligibility and the perceived quality of processed speech includes a frequency transformer and a spectral compressor. The frequency transformer converts speech signals from the time domain to the frequency domain. The spectral compressor compresses a pre-selected portion of the high frequency band and maps the compressed high frequency band to a lower band limited frequency range.
摘要:
A voice enhancement system is provided for improving the perceptual quality of a processed voice signal. The system improves the perceptual quality of a received voice signal by removing unwanted noise from a voice signal recorded by a microphone or from some other source. Specifically, the system removes sounds that occur within the environment of the signal source but which are unrelated to speech. The system is especially well adapted for removing transient road noises from speech signals recorded in moving vehicles. Transient road noises include common temporal and spectral characteristics that can be modeled. A transient road noise detector employs such models to detect the presence of transient road noises in a voice signal. If transient road noises are found to be present, a transient road noise attenuator is provided to remove them from the signal.
摘要:
A signal enhancement system improves the understandability of speech or other audio signals. The system reinforces selected parts of the signal, may attenuate selected parts of the signal, and may increase SNR. The system includes delay logic, a partitioned adaptive filter, and signal reinforcement logic. The partitioned adaptive filter may track and enhance the fundamental frequency and harmonics in the input signal. The partitioned filter output signals may approximately reproduce the input signal, delayed by an integer multiple of the period of the fundamental frequency of the input signal. The reinforcement logic combines the input signal and the filtered signals to produce an enhanced output signal.