Abstract:
A method for speech recognition in accordance with the present invention includes: extracting a speech feature from an inputted speech signal; estimating a noise component of the speech signal; compensating the extracted speech feature by use of the estimated noise component; transforming a given acoustic model based on the extracted speech feature, the compensated speech feature, and the noise component; and performing speech recognition by use of the compensated speech feature and the transformed acoustic model.
Abstract:
Disclosed herein are a multimodal unsupervised meta-learning method and apparatus. The multimodal unsupervised meta-learning method includes training, by a multimodal unsupervised feature representation learning unit, an encoder configured to extract features of individual single-modal signals from a source multimodal dataset, generating, by a multimodal unsupervised task generation unit, a source task based on the features of individual single-modal signals, deriving, by a multimodal unsupervised learning method derivation unit, a learning method from the source task using the encoder, and training, by a target task performance unit, a model based on the learning method and features extracted from a small number of target datasets by the encoder, thus performing the target task.
Abstract:
Disclosed herein is a voice-based CAPTCHA method and apparatus which can perform a CAPTCHA procedure using the voice of a human being. In the voice-based CAPTCHA) method, a plurality of uttered sounds of a user are collected. A start point and an end point of a voice from each of the collected uttered sounds are detected and then speech sections are detected. Uttered sounds of the respective detected speech sections are compared with reference uttered sounds, and then it is determined whether the uttered sounds are correctly uttered sounds. It is determined whether the uttered sounds have been made by an identical speaker if it is determined that the uttered sounds are correctly uttered sounds. Accordingly, a CAPTCHA procedure is performed using the voice of a human being, and thus it can be easily checked whether a human being has personally made a response using a voice online