-
公开(公告)号:US10923106B2
公开(公告)日:2021-02-16
申请号:US16256835
申请日:2019-01-24
Applicant: Korea Electronics Technology Institute
Inventor: Jong Yeol Yang , Young Han Lee , Choong Sang Cho , Hye Dong Jung
IPC: G10L13/00 , G10L13/10 , H04N21/233 , G06K9/00
Abstract: An audio synthesis method adapted to video characteristics is provided. The audio synthesis method according to an embodiment includes: extracting characteristics x from a video in a time-series way; extracting characteristics p of phonemes from a text; and generating an audio spectrum characteristic St used to generate an audio to be synthesized with a video at a time t, based on correlations between an audio spectrum characteristic St-1, which is used to generate an audio to be synthesized with a video at a time t−1, and the characteristics x. Accordingly, an audio can be synthesized according to video characteristics, and speech according to a video can be easily added.
-
公开(公告)号:US10978049B2
公开(公告)日:2021-04-13
申请号:US16256563
申请日:2019-01-24
Applicant: Korea Electronics Technology Institute
Inventor: Young Han Lee , Jong Yeol Yang , Choong Sang Cho , Hye Dong Jung
Abstract: An audio segmentation method based on an attention mechanism is provided. The audio segmentation method according to an embodiment obtains a mapping relationship between an “inputted text” and an “audio spectrum feature vector for generating an audio signal”, the audio spectrum feature vector being automatically synthesized by using the inputted text, and segments an inputted audio signal by using the mapping relationship. Accordingly, high quality can be guaranteed and the effort, time, and cost can be noticeably reduced through audio segmentation utilizing the attention mechanism.
-