Abstract:
A speech processing device includes a distance acquisition unit configured to acquire a distance between a sound collection unit configured to record speech from a sound source and the sound source, a reverberation characteristic estimation unit configured to estimate a reverberation characteristic based on the distance acquired by the distance acquisition unit, a correction data generation unit configured to generate correction data indicating a contribution of a reverberation component from the reverberation characteristic estimated by the reverberation characteristic estimation unit; and a dereverberation unit configured to remove the reverberation component from the speech by correcting the amplitude of the speech based on the correction data.
Abstract:
A speech-processing apparatus includes: a representative transfer function estimation unit that uses a sound signal which is collected by using a microphone array of which the arrangement is unknown, which has a plurality of channels, and of which the number of sound sources is unknown and that estimates a transfer function with respect to a sound source.
Abstract:
A speech processing device includes a speech recognition unit configured to sequentially recognize recognition segments from an input speech, a reverberation influence storage unit configured to store a degree of reverberation influence indicating an influence of a reverberation based on a preceding speech to a subsequent speech subsequent to the preceding speech and a recognition segment group including a plurality of recognition segments in correlation with each other, a reverberation influence selection unit configured to select the degree of reverberation influence corresponding to the recognition segment group which includes the plurality of recognition segments recognized by the speech recognition unit from the reverberation influence storage unit, and a reverberation reduction unit configured to remove a reverberation component weighted with the degree of reverberation influence from the speech from which at least a part of recognition segments of the recognition segment group is recognized.
Abstract:
A navigation system includes an evaluation unit for evaluating a degree of a health state quality for a driver of a moving body, and a route guidance unit for searching a route from a current position of the moving body to a destination, and for performing guidance of the searched route, the route guidance unit determines a search condition of a route to the destination based on an evaluation result of a health state of the driver in the evaluation unit.
Abstract:
A reception system includes: a visitor recognition unit that recognizes a visitor; a receiving person recognition unit that recognizes a receiving person that corresponds to the visitor; a receiving person contact information storage unit that stores contact information of the receiving person; a notification unit that notifies the receiving person of a visit of the visitor at the contact information of the receiving person stored by the receiving person contact information storage unit; and a receiving person selection unit that selects a substitute receiving person associated with the receiving person in a case where the receiving person is absent when the notification unit notifies the receiving person at the contact information of the receiving person, wherein the notification unit notifies the substitute receiving person selected by the receiving person selection unit when the receiving person is absent.
Abstract:
A sound direction estimation device includes a transfer function storage unit configured to store transfer functions of sound sources in correlation with directions of the sound sources, a calculation unit configured to calculate the number of classes to be searched and a search interval for each class based on a desired search range and a desired spatial resolution for searching for the directions of the sound sources, and a sound source localization unit configured to search the search range for every search interval using the transfer function, to estimate the direction of the sound source based on the search result, to update the search range and the search interval based on the estimated direction of the sound source until the number of classes calculated by the calculation unit is reached, and to estimate the direction of the sound source.
Abstract:
An information processing device includes a first information processing unit, a communication unit, and a control unit. The first information processing unit performs predetermined information processing on input data to generate first processing result data. The communication unit is capable of receiving second processing result data generated by a second information processing unit capable of executing the same kind of information processing as the information processing on the input data under a condition with higher versatility. The control unit selects either the first processing result data or the second processing result data according to the use environment of the device.
Abstract:
A voice processing apparatus includes: a sound input unit configured to acquire an audio signal; a voice recognition unit configured to perform voice recognition on the audio signal acquired by the sound input unit; an intention understanding unit configured to understand a user's intention on the basis of a recognition result recognized by the voice recognition unit; and a question unit configured to question the user on the basis of an understood result understood by the intention understanding unit. The question unit changes question content for the user according to the understood result and a predetermined priority.
Abstract:
A voice processing apparatus includes: a feature amount acquisition unit configured to acquire a spectrum of an audio signal for each frame; an utterance state determination unit configured to determine an utterance state for each frame on the basis of the audio signal; and a spectrum normalization unit configured to calculate a normalized spectrum in a current utterance by normalizing a spectrum for each frame in the current utterance using at least an average spectrum acquired until the present time.