摘要:
A hearing aid, that outputs a microphone input signal and an external input signal, improved includes a directional microphone, an external input terminal, a hearing aid processor that inputs sound signals from the microphone and the external input terminal, and a receiver that outputs sound signal that have undergone hearing aid processing by the hearing aid processor. The hearing aid processor has a mixer that mixes a sound signal from the microphone with a sound signal from the external input terminal and outputs a sound signal to the receiver, a mixing ratio decider that decides a mixing ratio between the sound signal from the microphone and a sound signal from the external input terminal, a front sound detector that is connected to the mixing ratio decider, and a similarity calculator that determines whether or not sound collected by the directional microphone is that of an external device.
摘要:
The noise removal device includes plural microphones, a time axis adjustment unit, an FFT analysis unit, and a noise removal processing unit, and determines frequency signals of a to-be-extracted sound by performing a threshold judgment on each of the phase distances, of the mixed sounds each received through a corresponding one of the microphones, in the case where the phases are expressed by the expression ψ′(t)=mod 2π(ψ(t)−2πft) (f denotes a reference frequency).
摘要翻译:噪声消除装置包括多个麦克风,时间轴调整单元,FFT分析单元和噪声去除处理单元,并且通过对每个相位距离执行阈值判定来确定提取的声音的频率信号, (t)= mod 2&pgr(ψ(t)-2&pgr; ft)表示相位的情况下的混合声音(f表示参考频率 )。
摘要:
A speech recognition apparatus equipped with the garbage acoustic model storage unit storing the garbage acoustic model which learned the collection of unnecessary words. A feature value calculation unit calculates the feature parameter necessary for recognition by acoustically analyzing the unidentified input speech including the non-language speech per frame which is a unit for speech analysis. A garbage acoustic score calculation unit calculates the garbage acoustic score by comparing the feature parameter and the garbage acoustic model, and a garbage acoustic score correction unit corrects the garbage acoustic score calculated by the garbage acoustic score calculation unit so as to raise it in the frame where the non-language speech is inputted. A recognition result output unit outputs, as the recognition result of the unidentified input speech, the word string with the highest cumulative score of the language score, the word acoustic score, and the garbage acoustic score which is corrected by the garbage acoustic score correcting unit.
摘要:
A sound determination device (100) includes: an FFT unit (2402) which receives a mixed sound including a to-be-extracted sound and a noise, and obtains a frequency signal of the mixed sound for each of a plurality of times included in a predetermined duration; and a to-be-extracted sound determination unit (101 (j)) which determines, when the number of the frequency signals at the plurality of times included in the predetermined duration is equal to or larger than a first threshold value and a phase distance between the frequency signals out of the frequency signals at the plurality of times is equal to or smaller than a second threshold value, each of the frequency signals with the phase distance as a frequency signal of the to-be-extracted sound. The phase distance is a distance between phases of the frequency signals when a phase of a frequency signal at a time t is ψ(t) (radian) and the phase is represented by ψ′(t)=mod 2π(ψ(t)−2πft) (where f is an analysis-target frequency).
摘要翻译:声音确定装置(100)包括:FFT单元(2402),其接收包括被提取的声音和噪声的混合声音,并且获得包含在多个时间中的多个频率的混合声音的频率信号 预定的持续时间; 以及被提取的声音确定单元(101(j)),其在所述预定持续时间中包括的多个频率信号的数量等于或大于第一阈值和相位距离时,确定 在多个频率信号之间的频率信号之间的频率信号等于或小于第二阈值,每个频率信号具有相位距离作为要提取的声音的频率信号。 相位距离是当时间t的频率信号的相位为psi(t)(弧度)时的频率信号的相位之间的距离,并且相位由psi'(t)= mod 2pi(psi(t))表示, -2pift)(其中f是分析目标频率)。
摘要:
A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency. The system includes: a specific audio's frequency feature value extraction unit (106) which performs pattern matching between a first set which is the pieces of local frequency information and a second set of pieces of frequency information (S103) of a predetermined specific audio, and extracts the first set of the pieces of local frequency information (S103), based on a result of the pattern matching; and an audio signal generation unit which generates a signal of the specific audio, based on the first set of the pieces of local frequency information (S103) extracted by the specific audio's frequency feature value extraction unit.
摘要:
The speech recognition apparatus (1) is equipped with the garbage acoustic model storage unit (110) storing the garbage acoustic model which learned the collection of the unnecessary words; the feature value calculation unit (101) which calculates the feature parameter necessary for recognition by acoustically analyzing the unidentified input speech including the non-language speech per frame which is a unit for speech analysis; the garbage acoustic score calculation unit (111) which calculates the garbage acoustic score by comparing the feature parameter and the garbage acoustic model; the garbage acoustic score correction unit (113) which corrects the garbage acoustic score calculated by the garbage acoustic score calculation unit (111) so as to raise it in the frame where the non-language speech is inputted; and the recognition result output unit (105) which outputs, as the recognition result of the unidentified input speech, the word string with the highest cumulative score of the language score, the word acoustic score, and the garbage acoustic score which is corrected by the garbage acoustic score correcting means.
摘要:
With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.
摘要:
Apparatus for expanding the bandwidth of speech signals such that a narrowband speech signal is input and digitized, the spectral envelope information and residual information are extracted from the digitized signal by linear predictive coding analysis, the spectral envelope information is expanded into wideband information by a spectral envelope converter, the residual information is expanded into wideband information by a residual converter, the converted spectral envelope information and residual information are combined to produce a wideband speech signal, frequency information not contained in the input signal is extracted from the obtained wideband speech signal by a filter, and the resulting signal is added to the original digitized input signal, and the obtained signal is converted into an analog signal as the output signal of the apparatus. The apparatus comprises a linear mapping function codebook used for converting spectral parameters, and a weights calculator and an adder for weighing and summing function outputs.
摘要:
A circuit for generating a pulse-like timing signal driving a stepping motor which is used to drive for example a magnetic tape to run in an audio tape or video tape recording and reproducing apparatus. A rewritable memory stores output pattern data and time information for generating the timing signal. This timing signal generating circuit comprises a rewrite control circuit responsive to an operation mode instructing signal, for performing a predetermined operational processing on basic pattern data and basic time information and storing the resulting output pattern data and uniquely related time information in the rewritable memory. This timing signal generating circuit further comprises a circuit for generating a clock signal having a predetermined cycle, a counter for counting this clock signal and a circuit for sequentially reading out the time information from the rewritable memory, detecting match between the read-out time information and count value of the counter, and outputting the output data stored in the rewritable memory corresponding to the time information for which match is detected, as the timing signal. This output circuit comprises a circuit for continuously outputting the present output data until another match is detected between the subsequently read-out time information and the count value of the counter.