Binary and multi-class classification systems and methods using one spike connectionist temporal classification

    公开(公告)号:US11087213B2

    公开(公告)日:2021-08-10

    申请号:US16723974

    申请日:2019-12-20

    Abstract: A classification training system for binary and multi-class classification comprises a neural network operable to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module operable to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is operable to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and One Spike Connectionist Temporal Classification (OSCTC) cost function. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.

    ACTIVE NOISE CANCELLING EARBUD DEVICES

    公开(公告)号:US20210104217A1

    公开(公告)日:2021-04-08

    申请号:US17063656

    申请日:2020-10-05

    Abstract: Systems and methods for audio listening devices, comprise a speaker coupled to a first housing, a sound port having a first end and a second end, wherein the first end is coupled to the first housing, and the second end is configured to be inserted in an ear canal of a person such that sound waves emitted from the speaker propagates via a secondary path to the ear canal through the sound port, active noise cancellation (ANC) components configured to generate anti-noise signals through the micro-speakers to cancel external noise, and a first microphone disposed within the sound port at the second end of the sound port such that the first microphone is configured to detect the anti-noise signal that propagates through the sound port via the secondary path and the external noise that propagates via a primary path.

    Selective audio source enhancement

    公开(公告)号:US10123113B2

    公开(公告)日:2018-11-06

    申请号:US15595854

    申请日:2017-05-15

    Abstract: A selective audio source enhancement system includes a processor and a memory, and a pre-processing unit configured to receive audio data including a target audio signal, and to perform sub-band domain decomposition of the audio data to generate buffered outputs. In addition, the system includes a target source detection unit configured to receive the buffered outputs, and to generate a target presence probability corresponding to the target audio signal, as well as a spatial filter estimation unit configured to receive the target presence probability, and to transform frames buffered in each sub-band into a higher resolution frequency-domain. The system also includes a spectral filtering unit configured to retrieve a multichannel image of the target audio signal and noise signals associated with the target audio signal, and an audio synthesis unit configured to extract an enhanced mono signal corresponding to the target audio signal from the multichannel image.

    NON-LINEAR FEEDBACK CONTROL FOR TEMPERATURE AND POWER PROTECTION OF LOUDSPEAKERS

    公开(公告)号:US20180279044A1

    公开(公告)日:2018-09-27

    申请号:US15933347

    申请日:2018-03-22

    Abstract: A system and a method provide for protecting a loudspeaker from thermal and/or mechanical failure by monitoring for over-temperature and over-power conditions. The system generates a first gain from a first speaker protection controller in response to a driving voltage and/or a driving current of a loudspeaker, and generates a second gain from a second speaker protection controller in response to the driving voltage and/or a driving current of the loudspeaker, if the temperature exceeds a thermal limit or if the power exceeds a maximum power. The system applies the second gain to an audio signal to lower the audio signal if the first speaker protection controller fails.

    EFFICIENT CONNECTIONIST TEMPORAL CLASSIFICATION FOR BINARY CLASSIFICATION

    公开(公告)号:US20180232632A1

    公开(公告)日:2018-08-16

    申请号:US15894872

    申请日:2018-02-12

    Abstract: A classification system and method for training a neural network includes receiving a stream of segmented, labeled training data having a sequence of frames, computing a stream of input features data for the sequence of frames, and generating neural network outputs for the sequence of frames in a forward pass through the training data and in accordance weights and biases. The weights and biases are updated in a backward pass through the training data, including determining Region of Target (ROT) information from the segmented, labeled training data, computing modified forward and backward variables based on the neural network outputs and the ROT information, deriving a signal error for each frame within the sequence of frames based on the modified forward and backward variables, and updating the weights and biases based on the derived signal error. An adaptive learning module is provided to improve a convergence rate of the neural network.

    System and method for suppressing transient noise in a multichannel system

    公开(公告)号:US10049678B2

    公开(公告)日:2018-08-14

    申请号:US15088073

    申请日:2016-03-31

    Abstract: Methods for processing a multichannel audio signal that includes transient noise signals are provided. The method includes buffering the multichannel audio signal in a subband domain, and estimating the subband frames for transient noise likelihood. A probability of transient noise for the buffered subband frames is determined and a multichannel spatial filter is applied to decompose the subband frames to transient attenuated target source and noise estimation cancelled of the target source signal. A spectral filter is applied to the target source frame to enhance the target source frame and the subband frames that are determined to have a probability of the transient noise greater than a first threshold and a probability of target source less than a second threshold are muted.

    LOW DELAY DECIMATOR AND INTERPOLATOR FILTERS

    公开(公告)号:US20190132679A1

    公开(公告)日:2019-05-02

    申请号:US16177308

    申请日:2018-10-31

    Abstract: Systems and methods for low latency adaptive noise cancellation include an audio sensor to sense environmental noise and generate a noise signal, an audio processing path to receive an audio signal, process the audio signal through an interpolation filter, and generate a primary audio signal having a first sample frequency, an adaptive noise cancellation processor to receive the noise signal and generate an anti-noise signal, a direct interpolator to receive the anti-noise signal and generate an anti-noise signal having the first sample frequency, and a limiter to provide clipping to reduce a number of bits in the anti-noise signal, an adder operable to combine the primary audio signal and the anti-noise signal and generate a combined output signal, and a low latency filter to process the combined output signal.

    Robust speech boundary detection system and method

    公开(公告)号:US09886968B2

    公开(公告)日:2018-02-06

    申请号:US14197149

    申请日:2014-03-04

    CPC classification number: G10L25/84

    Abstract: A system for audio processing comprising an initial background statistical model system configured to generate an initial background statistical model using a predetermined sample size of audio data. A parameter computation system configured to generate parametric data for the audio data including cepstral and energy parameters. A background statistics computation system configured to generate preliminary background statistics for determining whether speech has been detected. A first speech detection system configured to determine whether speech was present in the initial sample of audio data. An adaptive background statistical model system configured to provide an adaptive background statistical model for use in continuous processing of audio data for speech detection. A parameter computation system configured to calculate cepstral parameters, energy parameters and other suitable parameters for speech detection. A speech/non-speech classification system configured to classify individual frames as speech frames or non-speech frames, based on the computed parameters and the adaptive background statistical model data. A background statistics update system configured to update the background statistical model based on detected speech and non-speech frames. A second speech detection system configured to perform speech detection processing and to generate a suitable indicator for use in processing audio data that is determined to include speech signals.

Patent Agency Ranking