-
公开(公告)号:US12159640B2
公开(公告)日:2024-12-03
申请号:US17884364
申请日:2022-08-09
Inventor: Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang
Abstract: Provided is an encoding method according to various example embodiments and an encoder performing the method. The encoding method includes outputting a linear prediction (LP) coefficients bitstream and a residual signal by performing a linear prediction analysis on an input signal, outputting a first latent signal obtained by encoding a periodic component of the residual signal, using a first neural network module, outputting a first bitstream obtained by quantizing the first latent signal, using a quantization module, outputting a second latent signal obtained by encoding an aperiodic component of the residual signal, using the first neural network module, and outputting a second bitstream obtained by quantizing the second latent signal, using the quantization module, wherein the aperiodic component of the residual signal is calculated based on a periodic component of the residual signal decoded from the quantized first latent signal output by de-quantizing the first bitstream.
-
公开(公告)号:US11978465B2
公开(公告)日:2024-05-07
申请号:US17507746
申请日:2021-10-21
Inventor: Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC: G10L19/13 , G10L19/032 , G10L19/06
CPC classification number: G10L19/13 , G10L19/032 , G10L19/06
Abstract: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
-
13.
公开(公告)号:US11804230B2
公开(公告)日:2023-10-31
申请号:US17711908
申请日:2022-04-01
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
Inventor: Inseon Jang , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Jongwon Shin , Youngju Cheon , Sangwook Han , Soojoong Hwang
IPC: G10L19/02 , G10L19/038 , G06N3/04
CPC classification number: G10L19/038 , G06N3/04 , G10L19/02
Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
-
14.
公开(公告)号:US20220020385A1
公开(公告)日:2022-01-20
申请号:US17377157
申请日:2021-07-15
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi
IPC: G10L19/06 , G10L19/032
Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
-
公开(公告)号:US12223970B2
公开(公告)日:2025-02-11
申请号:US18103993
申请日:2023-01-31
Inventor: Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Byeongho Cho
IPC: G10L19/087 , G10L19/038 , G10L19/13 , G10L25/30 , G10L19/02
Abstract: An encoding method, a decoding method, an encoder for performing the encoding method, and a decoder for performing the decoding method are provided. The encoding method includes outputting LP coefficients bitstream and a residual signal by performing an LP analysis on an input signal, outputting a first latent signal obtained by encoding a periodic component of the residual signal, a second latent signal obtained by encoding a non-periodic component of the residual signal, and a weight vector for each of the first latent signal and the second latent signal, using a first neural network module, and outputting a first bitstream obtained by quantizing the first latent signal, a second bitstream obtained by quantizing the second latent signal, and a weight bitstream obtained by quantizing the weight vector, using a quantization module.
-
16.
公开(公告)号:US12223426B2
公开(公告)日:2025-02-11
申请号:US18166407
申请日:2023-02-08
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , YONSEI UNIVERSITY WONJU INDUSTRY-ACADEMIC COOPERATION FOUNDATION
Inventor: Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Byeongho Cho , Young Cheol Park , Joon Byun , Seungmin Shin
IPC: G10L19/00 , G06N3/08 , G10L19/028 , G10L19/038 , G10L25/30 , G10L25/60 , G10L25/69 , G06N3/084 , G10L15/00 , G10L19/22
Abstract: Provided is a method and apparatus for designing and testing an audio codec using quantization based on white noise modeling. A neural network-based audio encoder design method includes generating a quantized latent vector and a reconstructed signal corresponding to an input signal by using a white noise modeling-based quantization process, computing a total loss for training a neural network-based audio codec, based on the input signal, the reconstruction signal, and the quantized latent vector, training the neural network-based audio codec by using the total loss, and validating the trained neural network-based audio codec to select the best neural network-based audio codec.
-
公开(公告)号:US11694703B2
公开(公告)日:2023-07-04
申请号:US17672041
申请日:2022-02-15
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang
Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
-
18.
公开(公告)号:US11651778B2
公开(公告)日:2023-05-16
申请号:US17520895
申请日:2021-11-08
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang , Jong-Won Seok , Yunsu Kim
IPC: G10L19/16 , G10L19/02 , G10L25/30 , G10L19/038
CPC classification number: G10L19/038 , G10L19/02 , G10L19/167 , G10L25/30
Abstract: Disclosed are methods of encoding and decoding an audio signal, and an encoder and a decoder for performing the methods. The method of encoding an audio signal includes identifying an input signal corresponding to a low frequency band of the audio signal, windowing the input signal, generating a first latent vector by inputting the windowed input signal to a first encoding model, transforming the windowed input signal into a frequency domain, generating a second latent vector by inputting the transformed input signal to a second encoding model, generating a final latent vector by combining the first latent vector and the second latent vector, and generating a bitstream corresponding to the final latent vector.
-
公开(公告)号:US10552711B2
公开(公告)日:2020-02-04
申请号:US16203668
申请日:2018-11-29
Inventor: Woo-taek Lim , Seung Kwon Beack
IPC: G06K9/62 , G06N3/02 , G06F16/683 , G10L19/008
Abstract: Disclosed is an apparatus and method for extracting a sound source from a multi-channel audio signal. A sound source extracting method includes transforming a multi-channel audio signal into two-dimensional (2D) data, extracting a plurality of feature maps by inputting the 2D data into a convolutional neural network (CNN) including at least one layer, and extracting a sound source from the multi-channel audio signal using the feature maps.
-
公开(公告)号:US10271137B1
公开(公告)日:2019-04-23
申请号:US16018359
申请日:2018-06-26
Inventor: Young Ho Jeong , Sang Won Suh , Jae-hyoun Yoo , Tae Jin Lee , Woo-taek Lim , Hui Yong Kim
Abstract: A method of detecting a sound event includes receiving sound signals using one or more directional microphones, extracting a time interval of each of the sound signals, extracting time information and an azimuth of a sound event included in the sound signals during the extracted time interval, mixing the sound signals received from the directional microphones using the extracted time interval, and determining a direction of the sound event generated at a specific time from a mixed sound signal obtained through the mixing using the extracted time information and azimuth of the sound event.
-
-
-
-
-
-
-
-
-