摘要:
Disclosed is an operation device including an interaction member, a microphone, a control circuit, and an audio signal processing circuit. The interaction member is used for interacting with a user. The control circuit periodically acquires scan data indicating the acting status of the interaction member. The audio signal processing circuit executes a noise removal process of removing noise from a collected audio signal collected by the microphone. The control circuit periodically transmits previously acquired scan data to the audio signal processing circuit. The audio signal processing circuit executes the noise removal process by using the scan data transmitted from the control circuit.
摘要:
A method, computer program, and computer system is provided for an all-deep-learning based AEC system by recurrent neural networks. The model consists of two stages, echo estimation stage and echo suppression stage, respectively. Two different schemes for echo estimation are presented herein: linear echo estimation by multi-tap filtering on far-end reference signal and non-linear echo estimation by single-tap masking on microphone signal. A microphone signal waveform and a far-end reference signal waveform are received. An echo signal waveform is estimated based on the microphone signal waveform and a far-end reference signal waveform. A near-end speech signal waveform is output based on subtracting the estimated echo signal waveform from the microphone signal waveform, and echoes are suppressed within the near-end speech signal waveform.
摘要:
Provided are apparatuses and associated methods for video communications and related features. In one embodiment, a big-screen video communications apparatus is provided that includes a projector and speaker for projecting received images and sounds and includes a camera and microphone for capturing images and sounds for transmission.
摘要:
An example implementation may involve driving an audio output module of a wearable device with a first audio signal and then receiving, via at least one microphone of wearable device, a second audio signal comprising first ambient noise. The device may determine that the first ambient noise is indicative of user speech and responsively duck the first audio signal. While the first audio signal is ducked, the device may detect, in a subsequent portion of the second audio signal, second ambient noise, and determine that the second ambient noise is indicative of ambient speech. Responsive to the determination that the second ambient noise is indicative of ambient speech, the device may continue the ducking of the first audio signal.
摘要:
The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要:
The disclosure relates to a microphone assembly comprising a multibit analog-to-digital converter configured to receive, sample, and quantize a microphone signal to generate N-bit digital microphone samples representative of the microphone signal at a first sampling frequency. The microphone assembly also comprises a first digital-to-digital converter configured to receive, quantize, and noise-shape the N-bit digital microphone samples to generate a corresponding M-bit Pulse Density Modulated (PDM) signal, wherein N and M are positive integers, and N>M. The microphone assembly may comprise a SoundWire compliant data interface configured to repeatedly receive samples of the M-bit PDM signal and write bits of the M-bit PDM signal to a SoundWire data frame.
摘要:
A noise attenuation apparatus receives an audio signal comprising a desired and a noise signal component. Two codebooks (109, 111) comprise respectively desired signal candidates representing a possible desired signal component and noise signal contribution candidates representing possible noise contributions. A segmenter (103) segments the audio signal into time segments and for each time segment a noise attenuator (105) generates estimated signal candidates by for each of the desired signal candidates generating an estimated signal candidate as a combination of a scaled version of the desired signal candidate and a weighted combination of the noise signal contribution candidates. The noise attenuator (105) minimizes a cost function indicative of a difference between the estimated signal candidate and the audio signal in the time segment. A signal candidate is then determined for the time segment from the estimated signal candidates and the audio signal is noise compensated based on this signal candidate.
摘要:
The disclosure relates to a microphone assembly comprising a multibit analog-to-digital converter configured to receive, sample, and quantize a microphone signal to generate N-bit digital microphone samples representative of the microphone signal at a first sampling frequency. The microphone assembly also comprises a first digital-to-digital converter configured to receive, quantize, and noise-shape the N-bit digital microphone samples to generate a corresponding M-bit Pulse Density Modulated (PDM) signal, wherein N and M are positive integers, and N>M. The microphone assembly may comprise a SoundWire compliant data interface configured to repeatedly receive samples of the M-bit PDM signal and write bits of the M-bit PDM signal to a SoundWire data frame.
摘要:
Methods of providing for filtering noise and/or restoring attenuated spectral components in acoustic signals, are provided. An exemplary embodiment of a method includes dynamically filtering each of a plurality of raw FFT data samples of a record to remove or attenuate background noise contained therein to thereby produce a corresponding plurality of cleaned FFT data samples. The sample-specific background noise is removed or attenuated by a record-specific dynamic filter to produce the corresponding cleaned FFT data samples. The method can also include restoring the attenuated high-frequency components of the cleaned data samples through application of a record-specific Restoring Processor at least partially defined by a portion of the cleaned data samples and a Gain Function to thereby produce cleaned and restored data samples, and applying an inverse transformation to convert the cleaned and restored data samples into cleaned and restored data samples in time domain data.
摘要:
A system including first and second gain modules, an operator module, and a priori and posteriori modules. The first gain module applies a non-linear function to generate a gain signal based on an amplitude of a first speech signal and an estimated a priori variance of noise included in the first speech signal. The operator module generates an operator based on the gain signal and the estimated a priori variance of noise. The a priori module determines an a priori signal-to-noise ratio based on the operator. The posteriori module determines a posteriori signal-to-noise ratio based on the amplitude of the first speech signal and (ii) the estimated a priori variance of noise. The second gain module: determines a gain value based on the a priori signal-to-noise ratio and the a posteriori signal-to-noise ratio; and generates, based on the amplitude of the first speech signal and the gain value, a second speech signal that corresponds to an estimate of an amplitude of the first speech signal, where the second speech signal is substantially void of music noise.