摘要:
Described is modeling a room to obtain estimates for walls and a ceiling, and using the model to improve sound source localization by incorporating reflection (reverberation) data into the location estimation computations. In a calibration step, reflections of a known sound are detected at a microphone array, with their corresponding signals processed to estimate wall (and ceiling) locations. In a sound source localization step, when an actual sound (including reverberations) is detected, the signals are processed into hypotheses that include reflection data predictions based upon possible locations, given the room model. The location corresponding to the hypothesis that matches (maximum likelihood) the actual sound data is the estimated location of the sound source.
摘要:
The sound source tracking device of the present invention comprises a plurality of differential microphones having bidirectionality, and a support member adapted to support the plurality of differential microphones such that the plurality of differential microphones are disposed in an array within a given plane. The plurality of differential microphones are supported on the support member such that their principal axes of directionality are approximately orthogonal to the given plane.
摘要:
Based on phase differences between corresponding frequency components of different channels of a multichannel signal, a measure of directional coherency is calculated. Application of such a measure to voice activity detection and noise reduction are also disclosed.
摘要:
A direction-finding method and apparatus for detection and tracking of successive bearing angles of sound-emitting targets, wherein intensity plots of successive clock cycles in a waterfall plot show bearing traces of successive bearing angles, and preferred bearing traces are marked by a tracker. In order to automate the setting and deletion of trackers, starting from trace state vectors, which are determined at the time t=k−1, are each associated with one bearing trace and each have a bearing angle as well as its time derivative, which is referred to as the bearing rate, and possibly an intensity and its time derivative, which is referred to as the intensity rate, and trace errors associated with the trace state vectors for the time t=k, predicted state vectors are predicted together with predicted estimation errors. Bearing traces are displayed as a function of a trace quality
摘要:
A method and apparatus for robust speaker localization and a camera control system employing the same are provided. The apparatus for speaker localization includes: a difference spectrum obtaining section which obtains a difference spectrum of a first pseudo-power spectrum for a speech section and a second pseudo-power spectrum for a non-speech section detected in a voice signal output from a microphone array; and a speaker direction estimation section which detects a peak value in any one of the difference spectrum and the first pseudo-power spectrum, and estimates the direction of a speaker based on the direction angle corresponding to the detected peak value.
摘要:
A method for utilizing a matched field-processing algorithm employing a number of sensors wherein the sensor output is the measured acoustic data as the first input and is translated to a frequency by applying a Fourier transform to a set of time samples as a data vector output. A replica vector is the second data input as a predicted quantity which is computed by an acoustic model with an assumed acoustic location. The output is an ambiguity surface ranging between zero and one with the highest values indicating the likely position of an acoustic location. The matched field response is generalized by averaging the response over multiple frequencies. A response for an array may be computed by forming beams and then combining them by multiplying each by an eigenray factor before summing. The computation of the response may be further defined by voxel interpolation.
摘要:
In one aspect, a method to determine multipath angles of arrival includes performing an autocorrelation on a first signal received at a first received beam from a signal source, performing a cross-correlation between the first signal and a second signal received at a second receive beam from the signal source, and determining an angle of arrival for a first path from the signal source and an angle of arrival for a second path from the signal source based on the autocorrelation and the cross-correlation.
摘要:
A signal processing apparatus includes: a learning processing unit that finds a separating matrix for separating mixed signals in which outputs from a plurality of sound sources are mixed, by a learning process that applies ICA (Independent Component Analysis) to observed signals including the mixed signals; a separation processing unit that applies the separating matrix to the observed signals to separate the mixed signals and generate separated signals corresponding to each of the sound sources; and a sound source direction estimating unit that computes a sound source direction of each of the generated separated signals. The sound source direction estimating unit calculates cross-covariance matrices between the observed signals and the separated signals in corresponding time segments in time-frequency domain, computes phase differences between elements of the cross-covariance matrices, and computes a sound source direction corresponding to each of the separated signals by applying the computed phase differences.
摘要:
An autonomous sonar system and method provide an arrangement capable of beamforming in three dimensions, detecting loud targets, adaptively beamforming in three dimensions to avoid the loud targets, detecting quiet targets, localizing the loud or quiet targets in range, bearing, and depth, detecting modulation of noise associated with propellers of the loud or quiet targets, generating three dimensional tracks of the loud or quiet targets in bearing, range and depth, making classification of the loud or quiet targets, assigning probabilities to the classifications, and generating classification reports according to the classifications for communication to a receiving station, all without human assistance.
摘要:
In order to locate electromagnetic or acoustic signal sources of a sensor configuration (1 a through 1 c) fitted with at least two electric outputs; where the incidence-dependent transfer functions between the acoustic signals incident on the input(s) of the sensor configuration (1 a through 1 c) and the electric output signals are different, the ratio (7X through 7XX) of the output signal is formed and the result then is correlated with the previously determined ratio function (11).