-
公开(公告)号:US11302306B2
公开(公告)日:2022-04-12
申请号:US16452760
申请日:2019-06-26
发明人: Zhenyong Zhang , Wei Ma
IPC分类号: G10L15/02 , G10L21/0264 , G10L21/0224 , G10L15/32 , G10L15/22 , G10L25/21 , G10L25/09 , G10L15/30
摘要: A sound recognition system including time-dependent analog filtered feature extraction and sequencing. An analog front end (AFE) in the system receives input analog signals, such as signals representing an audio input to a microphone. Features in the input signal are extracted, by measuring such attributes as zero crossing events and total energy in filtered versions of the signal with different frequency characteristics at different times during the audio event. In one embodiment, a tunable analog filter is controlled to change its frequency characteristics at different times during the event. In another embodiment, multiple analog filters with different filter characteristics filter the input signal in parallel, and signal features are extracted from each filtered signal; a multiplexer selects the desired features at different times during the event.
-
公开(公告)号:US10665222B2
公开(公告)日:2020-05-26
申请号:US16022376
申请日:2018-06-28
申请人: Intel Corporation
发明人: Suyoung Bang , Muhammad Khellah , Somnath Paul , Charles Augustine , Turbo Majumder , Wootaek Lim , Tobias Bocklet , David Pearce
摘要: A system, article, and method provide temporal-domain feature extraction for automatic speech recognition.
-
公开(公告)号:US10297268B2
公开(公告)日:2019-05-21
申请号:US15802379
申请日:2017-11-02
申请人: Acer Incorporated
发明人: Po-Jen Tu , Jia-Ren Chang , Kai-Meng Tzeng
IPC分类号: G10L21/02 , G10L21/0272 , G10L19/26 , G10L15/02 , G10L25/93 , H04R25/00 , G10L21/003 , G10L25/09
摘要: A voice signal processing apparatus and a voice signal processing method are provided. Adjust a consonant signal judgment condition of a target voice frame according to whether an original voice sampling signal corresponding to a previous voice frame adjacent to the target voice frame is a consonant signal, so as to improve comfort of listening to the sound and recognition of a voice signal.
-
公开(公告)号:US20180033442A1
公开(公告)日:2018-02-01
申请号:US15221937
申请日:2016-07-28
申请人: MediaTek Inc.
发明人: Chi-Peng CHANG , Sung-Han WEN , Chieh-Tsen LIN
CPC分类号: G10L25/09 , G10L19/00 , G10L25/21 , H03G3/3089 , H03G7/007
摘要: An audio codec system which uses a memory to buffer frames of audio while signal power levels of the frames of audio buffered in the memory are detected to generate a signal power look-forward value and zero-crossing points of the frames of audio buffered in the memory are detected to obtain available calibration points for gain control due to a change of the signal power look-forward value. The gain control is divided to be performed at the available calibration points.
-
公开(公告)号:US08874440B2
公开(公告)日:2014-10-28
申请号:US12761489
申请日:2010-04-16
申请人: Chi-youn Park , Nam-hoon Kim , Jeong-mi Cho
发明人: Chi-youn Park , Nam-hoon Kim , Jeong-mi Cho
摘要: A speech detection apparatus and method are provided. The speech detection apparatus and method determine whether a frame is speech or not using feature information extracted from an input signal. The speech detection apparatus may estimate a situation related to an input frame and determine which feature information is required for speech detection for the input frame in the estimated situation. The speech detection apparatus may detect a speech signal using dynamic feature information that may be more suitable to the situation of a particular frame, instead of using the same feature information for each and every frame.
摘要翻译: 提供语音检测装置和方法。 语音检测装置和方法使用从输入信号提取的特征信息来确定帧是否是语音。 语音检测装置可以在估计的情况下估计与输入帧相关的情况并确定哪个特征信息是用于输入帧的语音检测所需要的。 语音检测装置可以使用可能更适合于特定帧的情况的动态特征信息来检测语音信号,而不是为每个帧使用相同的特征信息。
-
公开(公告)号:US11942107B2
公开(公告)日:2024-03-26
申请号:US17183288
申请日:2021-02-23
IPC分类号: G10L25/78 , G06N5/01 , G06N20/10 , G10L25/09 , G10L25/30 , G10L25/51 , H04R25/00 , G10L15/16 , G10L19/26
CPC分类号: G10L25/78 , G06N5/01 , G06N20/10 , G10L25/09 , G10L25/30 , G10L25/51 , H04R25/40 , H04R25/505 , H04R25/604 , G10L15/16 , G10L19/26
摘要: The present disclosure is directed to a device and method for detecting presence or absence of human speech. The device and method utilize a low-power accelerometer. The device and method generate an acceleration signal using the accelerometer, filter the acceleration signal with a band pass filter or a high pass filter, determine at least one calculation of the filtered acceleration signal, detect a presence or absence of a voice based on the at least one calculation, and output a detection signal that indicates the presence or absence of the voice. The device and method are well suited for portable audio devices, such as true wireless stereo headphones, that have a limited power supply.
-
公开(公告)号:US20220215829A1
公开(公告)日:2022-07-07
申请号:US17702253
申请日:2022-03-23
发明人: Zhenyong Zhang , Wei Ma
IPC分类号: G10L15/02 , G10L21/0264 , G10L21/0224 , G10L15/32 , G10L15/22 , G10L25/21 , G10L25/09 , G10L15/30
摘要: A sound recognition system including time-dependent analog filtered feature extraction and sequencing. An analog front end (AFE) in the system receives input analog signals, such as signals representing an audio input to a microphone. Features in the input signal are extracted, by measuring such attributes as zero crossing events and total energy in filtered versions of the signal with different frequency characteristics at different times during the audio event. In one embodiment, a tunable analog filter is controlled to change its frequency characteristics at different times during the event. In another embodiment, multiple analog filters with different filter characteristics filter the input signal in parallel, and signal features are extracted from each filtered signal; a multiplexer selects the desired features at different times during the event.
-
公开(公告)号:US11080006B2
公开(公告)日:2021-08-03
申请号:US16665906
申请日:2019-10-28
申请人: Digimarc Corporation
发明人: Ravi K. Sharma , Shankar Thagadur Shivappa , Osama M. Alattar , Brett A. Bradley , Scott M. Long , Ajith M. Kamath , Vojtech Holub , Hugh L. Brunk , Robert G. Lyons , Aparna R. Gurijala
摘要: Methods and arrangements involving electronic devices, such as smartphones, tablet computers, wearable devices, etc., are disclosed. One arrangement involves a low-power processing technique for discerning cues from audio input. Another involves a technique for detecting audio activity based on the Kullback-Liebler divergence (KLD) (or a modified version thereof) of the audio input. Still other arrangements concern techniques for managing the manner in which policies are embodied on an electronic device. Others relate to distributed computing techniques. A great variety of other features are also detailed.
-
公开(公告)号:US10482920B2
公开(公告)日:2019-11-19
申请号:US15763295
申请日:2015-09-28
发明人: Kazutaka Kanari , Michifumi Kojima
摘要: A digital content reproduction control signal for controlling a reproduction of digital content includes: a first control signal (SG1) having a time code signal recorded therein and having a predetermined frequency; and a second control signal (SG2) having a 2n-fold frequency of the frequency of the first control signal, n representing a natural number, in which the first control signal (SG1) and the second control signal (SG2) are combined such that zero-cross points of a waveform of the first control signal (SG1) are aligned with zero-cross points of a waveform of the second control signal (SG2) on a time axis.
-
公开(公告)号:US20190325900A1
公开(公告)日:2019-10-24
申请号:US16460651
申请日:2019-07-02
摘要: In described examples, a method for detecting voice activity includes: receiving a first input signal containing noise; sampling the first input signal to form noise samples; determining a first value corresponding to the noise samples; subsequently receiving a second input signal; sampling the second input signal to form second signal samples; determining a second value corresponding to the second signal samples; forming a ratio of the second value to the first value; comparing the ratio to a predetermined threshold value; and responsive to the comparing, indicating whether voice activity is detected in the second input signal.
-
-
-
-
-
-
-
-
-