Abstract:
A system, a method, logic embodied in a computer-readable medium, and a computer-readable medium comprising instructions that when executed carry out a method. The method processes: (a) a plurality of input signals, e.g., signals from a plurality of spatially separated microphones; and, for echo suppression, (b) one or more reference signals, e.g., signals from or to be rendered by one or more loudspeakers and that can cause echoes. The method processes the input signals and one or more reference signals to carry out in an integrated manner simultaneous noise suppression and out-of-location signal suppression, and in some versions, echo suppression.
Abstract:
One or more audio signals are processed using a multi-stage (hierarchical) voice and/or signal activity detector (VAD/SAD). A first stage is capable of reducing the workload bandwidth by employing an inexpensive VAD/SAD processor. One or more subsequent stages may further process the audio signals from the first stage. Other implementations may include a first stage that also performs continuity preservation between last blocks of audio signal and the first blocks of audio after it is detected that relevant audio signals are resumed. In yet other implementations, the first stage may extract features from audio signals when they are presented in their coded domain, and possibly with little or no decoding of the audio signal.
Abstract:
One or more audio signals are processed using a multi-stage (hierarchical) voice and/or signal activity detector (VAD/SAD). A first stage is capable of reducing the workload bandwidth by employing an inexpensive VAD/SAD processor. One or more subsequent stages may further process the audio signals from the first stage. Other implementations may include a first stage that also performs continuity preservation between last blocks of audio signal and the first blocks of audio after it is detected that relevant audio signals are resumed. In yet other implementations, the first stage may extract features from audio signals when they are presented in their coded domain, and possibly with little or no decoding of the audio signal.
Abstract:
A system, a method, logic embodied in a computer-readable medium, and a computer-readable medium comprising instructions that when executed carry out a method. The method processes: (a) a plurality of input signals, e.g., signals from a plurality of spatially separated microphones; and, for echo suppression, (b) one or more reference signals, e.g., signals from or to be rendered by one or more loudspeakers and that can cause echoes. The method processes the input signals and one or more reference signals to carry out in an integrated manner simultaneous noise suppression and out-of-location signal suppression, and in some versions, echo suppression.
Abstract:
A plurality of acoustic sensors in a non-anechoic environment are calibrated with the aim of removing manufacturing tolerances and degradation over time but preserving position-dependent differences between the sensors, The sensors are excited by an acoustic stimulus which has either time-dependent characteristics or finite duration. The calibration is to be based on diffuse-field excitation only, in which indirect propagation (including single or multiple reflections) dominate over any direct-path excitation. For this purpose, the calibration process considers only a non-initial portion of sensor outputs and/or of an impulse response derived there-from. Based on these data, a frequency-dependent magnitude response function is estimated and compared with a target response function, from which a calibration function is derived.
Abstract:
A plurality of acoustic sensors in a non-anechoic environment are calibrated with the aim of removing manufacturing tolerances and degradation over time but preserving position-dependent differences between the sensors, The sensors are excited by an acoustic stimulus which has either time-dependent characteristics or finite duration. The calibration is to be based on diffuse-field excitation only, in which indirect propagation (including single or multiple reflections) dominate over any direct-path excitation. For this purpose, the calibration process considers only a non-initial portion of sensor outputs and/or of an impulse response derived there-from. Based on these data, a frequency-dependent magnitude response function is estimated and compared with a target response function, from which a calibration function is derived.