-
公开(公告)号:US10672414B2
公开(公告)日:2020-06-02
申请号:US15952353
申请日:2018-04-13
发明人: Ivan Jelev Tashev , Shuayb M Zarar , Yan-Hui Tu , Chin-Hui Lee , Han Zhao
摘要: Systems, methods, and computer-readable storage devices are disclosed for improved real-time audio processing. One method including: receiving audio data including a plurality of frames having a plurality of frequency bins; calculating, for each frequency bin, an approximate speech signal estimation based on the plurality of frames; calculating, for each approximate speech signal estimation, a clean speech estimation and at least one additional target including an ideal ratio mask using a trained neural network model; and calculating, for each frequency bin, a final clean speech estimation using the calculated at least one additional target including the calculated ideal ratio mask and the calculated clean speech estimation.
-
公开(公告)号:US20180254050A1
公开(公告)日:2018-09-06
申请号:US15626016
申请日:2017-06-16
发明人: Ivan Jelev Tashev , Shuayb Zarar
IPC分类号: G10L21/02 , G10L21/0208 , G06F17/27
CPC分类号: G06F17/2735 , G10L21/0208 , G10L21/0232
摘要: A system is provided that employs a statistical approach to semi-supervised speech enhancement with a low-order non-negative matrix factorization (“NMF”). The system enhances noisy speech based on multiple dictionaries with dictionary atoms derived from the same clean speech samples and generates an enhanced speech representation of the noisy speech by combining, for each dictionary, a clean speech representation of the noisy speech generated based on a NMF using the dictionary atoms of the dictionary. The system generates frequency-domain (“FD”) clean speech sample representations of the clean speech samples, for example, using a Fourier transform. To generate each dictionary, the system generates a dictionary-unique initialization of the dictionary atoms and the activations and performs a NMF of the FD clean speech samples.
-
公开(公告)号:US11847261B2
公开(公告)日:2023-12-19
申请号:US17815532
申请日:2022-07-27
发明人: Andrew D. Wilson , Hakim Si Mohammed , Christian Holz , Adrian Kuo Ching Lee , Ivan Jelev Tashev , Hannes Gamper , Edward Bryan Cutrell , David Emerson Johnston , Dimitra Emmanouilidou , Mihai R. Jalobeanu
CPC分类号: G06F3/015 , A61B5/378 , G01R19/0084 , G02B27/0176 , G06F1/163
摘要: A computer device is provided that includes a display device, and a sensor system configured to be mounted adjacent to a user's head and to measure an electrical potential near one or more electrodes of the sensor system. The computer device further includes a processor configured to present a periodic motion-based visual stimulus having a changing motion that is frequency-modulated for a target frequency or code-modulated for a target code, detect changes in the electrical potential via the one or more electrodes, identify a corresponding visual evoked potential feature in the detected changes in electrical potential that corresponds to the periodic motion-based visual stimulus, and recognize a user input to the computing device based on identifying the corresponding visual evoked potential feature.
-
公开(公告)号:US10984315B2
公开(公告)日:2021-04-20
申请号:US15582456
申请日:2017-04-28
发明人: Shuayb M Zarar , Ivan Jelev Tashev
摘要: A facility for processing output from a network of mechanical sensors is described. The facility accesses time-series data outputted by the network of sensors. The facility applies to the accessed time-series data a trained autoencoder to obtain a version of the accessed time-series data in which noise present in the accessed time-series data is at least partially suppressed. The facility stores the obtained version of the accessed time-series data, such as in order to perform human activity recognition against the obtained version of the accessed time-series data.
-
公开(公告)号:US11409361B2
公开(公告)日:2022-08-09
申请号:US16780173
申请日:2020-02-03
发明人: Andrew D. Wilson , Hakim Si Mohammed , Christian Holz , Adrian Kuo Ching Lee , Ivan Jelev Tashev , Hannes Gamper , Edward Bryan Cutrell , David Emerson Johnston , Dimitra Emmanouilidou , Mihai R. Jalobeanu
摘要: A computer device is provided that includes a display device, and a sensor system configured to be mounted adjacent to a user's head and to measure an electrical potential near one or more electrodes of the sensor system. The computer device further includes a processor configured to present a periodic motion-based visual stimulus having a changing motion that is frequency-modulated for a target frequency or code-modulated for a target code, detect changes in the electrical potential via the one or more electrodes, identify a corresponding visual evoked potential feature in the detected changes in electrical potential that corresponds to the periodic motion-based visual stimulus, and recognize a user input to the computing device based on identifying the corresponding visual evoked potential feature.
-
公开(公告)号:US20170156017A1
公开(公告)日:2017-06-01
申请号:US15428965
申请日:2017-02-09
CPC分类号: H04S7/307 , G02B27/017 , G02B2027/0178 , H04R1/1075 , H04R3/04 , H04R5/033 , H04R5/0335 , H04R29/00 , H04R29/001 , H04S1/00 , H04S7/304 , H04S2420/01
摘要: Systems and methods of providing an audio signal are disclosed herein. In one embodiment, a method of delivering an audio signal from a device toward a user's ear includes, for example, transmitting a filtered audio signal from a transducer positioned at a location on the device that is longitudinally spaced apart from an entrance of an auditory canal of the user's ear when the device is worn on the user's head.
-
公开(公告)号:US11012802B2
公开(公告)日:2021-05-18
申请号:US16459918
申请日:2019-07-02
发明人: Christoph Felix Hold , Hannes Gamper , Ville Topias Pulkki , Nikunj Raghuvanshi , Ivan Jelev Tashev
摘要: A computing system that facilitates decoding a spherical harmonics (SH) representation of a three-dimensional sound signal to a binaural sound signal is described herein. The computing system generates a binaural sound signal based upon the SH representation, a tapering window function that is selected based on an SH encoding order of the SH representation, and a coloration compensation filter that incorporates the tapering window function. The computing system causes the binaural sound signal to be played over at least two speakers.
-
公开(公告)号:US10528147B2
公开(公告)日:2020-01-07
申请号:US15640327
申请日:2017-06-30
发明人: Ivan Jelev Tashev , Shuayb Zarar , Amit Das
摘要: An ultrasonic gesture recognition system is provided that recognizes gestures based on analysis of return signals of an ultrasonic pulse that is reflected from a gesture. The system transmits an ultrasonic chirp and samples a microphone array at sample intervals to collect a return signal for each microphone. The system then applies a beamforming technique to frequency domain representations of the return signals to generate an acoustic image with a beamformed return signal for multiple directions. The system then generates a feature image from the acoustic images to identify, for example, distance or depth from the microphone array to the gesture for each direction. The system then submits the feature image to a deep learning system to classify the gesture.
-
公开(公告)号:US10129684B2
公开(公告)日:2018-11-13
申请号:US15428965
申请日:2017-02-09
摘要: Systems and methods of providing an audio signal are disclosed herein. In one embodiment, a method of delivering an audio signal from a device toward a user's ear includes, for example, transmitting a filtered audio signal from a transducer positioned at a location on the device that is longitudinally spaced apart from an entrance of an auditory canal of the user's ear when the device is worn on the user's head.
-
公开(公告)号:US12019808B2
公开(公告)日:2024-06-25
申请号:US18075786
申请日:2022-12-06
发明人: Raymond Michael Winters, IV , Tan Gemicioglu , Thomas Matthew Gable , Yu-Te Wang , Ivan Jelev Tashev
摘要: This document relates to employing tongue gestures to control a computing device, and training machine learning models to detect tongue gestures. One example relates to a method or technique that can include receiving one or more motion signals from an inertial sensor. The method or technique can also include detecting a tongue gesture based at least on the one or more motion signals, and outputting the tongue gesture.
-
-
-
-
-
-
-
-
-