-
公开(公告)号:US20230009771A1
公开(公告)日:2023-01-12
申请号:US17856377
申请日:2022-07-01
Inventor: Byung-Ok KANG , Jeon-Gue PARK , Hyung-Bae JEON
Abstract: Disclosed herein is a method for data augmentation, which includes pretraining latent variables using first data corresponding to target speech and second data corresponding to general speech, training data augmentation parameters by receiving the first data and the second data as input, and augmenting target data using the first data and the second data through the pretrained latent variables and the trained parameters.
-
公开(公告)号:US20230134942A1
公开(公告)日:2023-05-04
申请号:US17961830
申请日:2022-10-07
Inventor: Hoon CHUNG , Byung-Ok KANG , Jeom-Ja KANG , Yun-Kyung LEE , Hyung-Bae JEON
IPC: G10L15/06 , G10L15/187
Abstract: Disclosed herein are an apparatus and method for self-supervised training of an end-to-end speech recognition model. The apparatus includes memory in which at least one program is recorded and a processor for executing the program. The program trains an end-to-end speech recognition model, including an encoder and a decoder, using untranscribed speech data. The program may add predetermined noise to the input signal of the end-to-end speech recognition model, and may calculate loss by reflecting a predetermined constraint based on the output of the encoder of the end-to-end speech recognition model.
-
公开(公告)号:US20140163986A1
公开(公告)日:2014-06-12
申请号:US14095622
申请日:2013-12-03
Inventor: Sung-Joo LEE , Ho-Young JUNG , Hwa-Jeon SONG , Eui-Sok CHUNG , Byung-Ok KANG , Hoon CHUNG , Jeon-Gue PARK , Hyung-Bae JEON , Yoo-Rhee OH , Yun-Keun LEE
IPC: G10L15/02
CPC classification number: G10L15/00 , G06F21/31 , G06F2221/2133 , G10L17/00
Abstract: Disclosed herein is a voice-based CAPTCHA method and apparatus which can perform a CAPTCHA procedure using the voice of a human being. In the voice-based CAPTCHA) method, a plurality of uttered sounds of a user are collected. A start point and an end point of a voice from each of the collected uttered sounds are detected and then speech sections are detected. Uttered sounds of the respective detected speech sections are compared with reference uttered sounds, and then it is determined whether the uttered sounds are correctly uttered sounds. It is determined whether the uttered sounds have been made by an identical speaker if it is determined that the uttered sounds are correctly uttered sounds. Accordingly, a CAPTCHA procedure is performed using the voice of a human being, and thus it can be easily checked whether a human being has personally made a response using a voice online
Abstract translation: 本文公开了一种基于语音的CAPTCHA方法和装置,其可以使用人的声音执行人机验证程序。 在基于语音的CAPTCHA)方法中,收集用户的多个发出的声音。 检测来自每个收集到的发出的声音的声音的起始点和终点,然后检测语音部分。 将各个检测到的语音部分的被调节的声音与参考发出的声音进行比较,然后确定发出的声音是否被正确发出的声音。 如果确定发出的声音被正确发出声音,则确定发声是否由相同的扬声器发出。 因此,使用人的声音执行人机识别程序,因此可以容易地检查人是否已经使用在线语音亲自作出了响应
-
-