摘要:
In one embodiment, a pickup system includes a wind detector and a wind suppressor. The wind detector has a plurality of analyzers each configured to analyze first and second input signals, and a combiner configured to combine outputs of the plurality of analyzers and issue, based on the combined outputs, a wind level indication signal indicative of wind activity. The analyzers can be selected from a group of analyzers including a spectral slope analyzer, a ratio analyzer, a coherence analyzer, a phase variance analyzer and the like. The wind suppressor has a ratio calculator configured to generate a ratio of the first and second input signals, and a mixer configured to select one of the first or second input signals and to apply to the selected input signal one of first or second panning coefficients based on the wind level indication signal and on the ratio.
摘要:
In a game, multiple players are communicatively coupled with a network. A progression of game action states is tracked. The action states relate to events that occur during the game and a situation of each of the players that corresponds to the events. Upon reaching a first action state, in which a player is expected to utter a first vociferation based on that player's situation with respect to a game event that occurs in association with the first action state, it is detected whether the vociferation is uttered. The first vociferation is captured. The vociferation may be sent to the other players asynchronously with respect to its capture where it may be rendered locally.
摘要:
Sound source localization apparatuses and methods are described. A frame amplitude difference vector is calculated based on short time frame data acquired through an array of microphones. The frame amplitude difference vector reflects differences between amplitudes captured by microphones of the array during recording the short time frame data. Similarity between the frame amplitude difference vector and each of a plurality of reference frame amplitude difference vectors is evaluated. Each of the plurality of reference frame amplitude difference vectors reflects differences between amplitudes captured by microphones of the array during recording sound from one of a plurality of candidate locations. A desired location of sound source is estimated based at least on the candidate locations and associated similarity. The sound source localization can be performed based at least on amplitude difference.
摘要:
This invention relates to reformatting a plurality of audio input signals from a first format to a second format by applying them to a dynamically-varying transformatting matrix. In particular, this invention obtains information attributable to the direction and intensity of one or more directional signal components, calculates the transformatting matrix based on the first and second rules, and applies the audio input signals to the transformatting matrix to produce output signals.
摘要:
In a class of embodiments, a method and system for calibrating a display using feedback indicative of measurements of light, emitted from the display (typically during display of a test pattern), by a camera device whose camera has a sensitivity function that is unknown a priori but which is operable to measure light emitted by a display in a manner emulating at least one measurement by a reference camera having a known sensitivity function. Typically, the camera device is a handheld camera device including an inexpensive, uncalibrated camera. In another class of embodiments, a system including a display (to be recalibrated), a video preprocessor coupled to the display, and a feedback subsystem including a camera device operable to measure light emitted by the display. The feedback subsystem is coupled and configured to generate preprocessor control parameters in response to measurement data (indicative of measurements by the camera device) and to assert the preprocessor control parameters as calibration feedback to the preprocessor. The preprocessor is operable to calibrate (e.g., recalibrate) the display in response to the control parameters by filtering input image data (e.g., input video data) to be displayed, for example to automatically and dynamically correct for variations in calibration of the display.
摘要:
An auditory event boundary detector employs down-sampling of the input digital audio signal without an anti-aliasing filter, resulting in a narrower bandwidth intermediate signal with aliasing. Spectral changes of that intermediate signal, indicating event boundaries, may be detected using an adaptive filter to track a linear predictive model of the samples of the intermediate signal. Changes in the magnitude or power of the filter error correspond to changes in the spectrum of the input audio signal. The adaptive filter converges at a rate consistent with the duration of auditory events, so filter error magnitude or power changes indicate event boundaries. The detector is much less complex than methods employing time-to-frequency transforms for the full bandwidth of the audio signal.
摘要:
Derivation of a fingerprint includes generating feature matrices based on one or more training images, generating projection matrices based on the feature matrices in a training process, and deriving a fingerprint for one or more images by, at least in part, projecting a feature matrix based on the one or more images onto the projection matrices generated in the training process.
摘要:
A method of processing at least one input signal by a set of binaural filters such that the outputs are playable over headphones to provide a sense of listening to sound in a listening room via one or more virtual speakers, with the further property that a monophonic mix down sounds good. Also an apparatus for processing the at least one input signals. Also a method of modifying a pair of binaural filters to achieve the property that a monophonic mix down sounds good, while still providing spatialization when listening through headphones.
摘要:
In some embodiments, a method for processing output of at least one microphone of a device (e.g., a headset) to identify at least one touch gesture exerted by a user on the device, including by distinguishing the gesture from input to the microphone other than a touch gesture intended by the user, and by distinguishing between a tap exerted by the user on the device and at least one dynamic gesture exerted by the user on the device, where the output of the at least one microphone is also indicative of ambient sound (e.g., voice utterences). Other embodiments are systems for detecting ambient sound (e.g., voice utterences) and touch gestures, each including a device including at least one microphone and a processor coupled and configured to process output of each microphone to identify at least one touch gesture exerted by a user on the device.
摘要:
A method of post-processing banded gains for applying to an audio signal, an apparatus to post-processed banded gains, and a tangible computer-readable storage medium comprising instructions that when executed carry out the method. The banded gains are determined by input processing one or more input audio signals. The method includes post-processing the banded gains to generate post-processed gains, generating a particular post-processed gain for a particular frequency band including percentile filtering using gain values from one or more previous frames of the one or more input audio signals and from gain values for frequency bands adjacent to the particular frequency band.