Abstract:
An input voice detect is detected after starting a voice input waiting state; the detected voice is recognized; an elapsed time from the start of the voice input waiting state is counted; an informative sound which urges a user to input the voice is outputted when the elapsed time reaches a preset output set time; and the output of the informative sound is stopped when the elapsed time at the time of inputting the voice is shorter than the output set timedetect.
Abstract:
A noise estimation unit estimates a noise signal in an input signal. A section decision unit distinguishes a target signal section from a noise signal section in the input signal. A noise suppression unit suppresses the noise signal based on a first suppression coefficient from the input signal. A noise excess suppression unit suppresses the noise signal based on a second suppression coefficient from the input signal. The second suppression coefficient is larger than the first suppression coefficient. A switching unit switches between an output signal from the noise suppression unit and an output signal from the noise excess suppression unit based on a decision result of the section decision unit.
Abstract:
An input voice detect is detected after starting a voice input waiting state; the detected voice is recognized; an elapsed time from the start of the voice input waiting state is counted; an informative sound which urges a user to input the voice is outputted when the elapsed time reaches a preset output set time; and the output of the informative sound is stopped when the elapsed time at the time of inputting the voice is shorter than the output set timedetect.
Abstract:
An audible signal process method includes preparing, in at least one dictionary, a plurality of weighting factors each learned to optimize evaluation function established by a weighted learning audible signal and a target speech signal corresponding to the learning audible signal and used for weighting, estimating a noise component included in the input audible signal, calculating a feature quantity depending upon the noise component of the input audible signal, selecting a weighting factor corresponding to the feature quantity from the dictionary, and weighting the input audible signal using the selected weighting factor to generate a processed output audible signal.
Abstract:
A speech processing apparatus includes a plurality of microphones which receive speech produced by a first sound source to obtain first speech signals for a plurality of channels having one-to-one correspondence with the plurality of microphones, a calculation unit configured to calculate a first characteristic amount indicative of an inter-channel correlation of the first speech signals, a storage unit configured to store in advance a second characteristic amount indicative of an inter-channel correlation of second speech signals for the plurality of channels obtained by receiving speech produced by a second sound source by the plurality of microphones, and a collation unit configured to collate the first characteristic amount with the second characteristic amount to determine whether the first sound source matches with the second sound source.
Abstract:
An audio signal processing method for processing input audio signals of plural channels includes calculating at least one feature quantity representing a difference between channels of input audio signals, selecting at least one weighting factor according to the feature quantity from at least one weighting factor dictionary prepared by learning beforehand, and subjecting the input audio signals of plural channels to signal processing including noise suppression and weighting addition using the selected weighting factor to generate output an output audio signal.
Abstract:
A speech encoding method and apparatus including analyzing, using a codebook expressing speech parameters within a predetermined search range, an input speech signal in an audibility weighting filter corresponding to a pitch period longer than the search range of the codebook, and searching, from the codebook, on the basis of the analysis result, a combination of speech parameters by which the distortion of the input speech signal is minimized, and encoding the combination. The apparatus uses an adaptive codebook of pitch and a noise codebook. The codebooks search a group formed by extracting vectors of predetermined length from one original code vector, while sequentially shifting position so that the vectors overlap each other. The search group is further restricted and another preselection is made before the final search. Search is based on inversely convoluted, orthogonally transformed vectors.
Abstract:
In one embodiment, a signal processing method is disclosed. The method can perform filter processing of convoluting a tap coefficient in a first signal sequence to generate a second signal sequence. The method can subtract the second signal sequence from a third signal sequence to generate a fourth signal sequence. The third signal sequence includes an echo signal of the first signal sequence. The method can correct the tap coefficient in accordance with an amount of correction determined using a function. The function includes at least one of a first region and a second region, and has values limited. The first region is included in a negative value region of the fourth signal sequence. The second region is included in a positive value region of the fourth signal sequence.
Abstract:
In one embodiment, there is provided an audio signal processor. The processor includes: a person position detector configured to detect each position of one or more persons present in a specific space; a grouping module configured to allocate the detected persons to one or more groups, wherein the number of the groups is less than a given number; a plurality of directionality controllers configured to control directionality of a microphone array; and a directionality setting module configured to set directionality of each of the groups in a corresponding one of the directionality controllers.
Abstract:
According to one embodiment, a pickup signal processing apparatus includes microphones, a sound determining unit, a signal level calculating unit, a setting unit, and a calculating unit. The sound determining unit determines whether pickup signals picked up by the microphones are signals from a neighboring sound source or a background noise signal. The signal level calculating unit calculates the signal levels for the microphones. The setting unit sets a gain value of at least one microphone and reduces a difference between the signal levels for the microphones on the basis of the signal levels for the microphones, when determined that the pickup signal is the background noise signal. The calculating unit multiplies the pickup signal of the at least one microphone by the gain value set by the setting unit.