-
公开(公告)号:US11562757B2
公开(公告)日:2023-01-24
申请号:US17377157
申请日:2021-07-15
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi
IPC: G10L19/06 , G10L19/032
Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
-
公开(公告)号:US11456001B2
公开(公告)日:2022-09-27
申请号:US16814103
申请日:2020-03-10
Applicant: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hochong Park
IPC: G10L19/02 , G06N3/04 , G10L21/038 , G10L19/032
Abstract: Disclosed are a method of encoding a high band of an audio, a method of decoding a high band of an audio, and an encoder and a decoder for performing the methods. The method of decoding a high band of an audio, the method performed by a decoder, includes identifying a parameter extracted through a first neural network, identifying side information extracted through a second neural network, and restoring a high band of an audio by applying the parameter and the side information to a third neural network.
-
公开(公告)号:US11133015B2
公开(公告)日:2021-09-28
申请号:US16180298
申请日:2018-11-05
Inventor: Seung Kwon Beack , Woo-taek Lim , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC: G10L19/04 , G10L25/30 , G10L19/008
Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
-
14.
公开(公告)号:US11823688B2
公开(公告)日:2023-11-21
申请号:US17390753
申请日:2021-07-30
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , The Trustees of Indiana University
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang , Minje Kim
IPC: G10L19/008 , G06N3/04 , G10L19/032 , G10L19/24
CPC classification number: G10L19/008 , G06N3/04 , G10L19/032
Abstract: Disclosed are a method of encoding and decoding an audio signal and an encoder and a decoder performing the method. The method of encoding an audio signal includes identifying an input signal, and generating a bitstring of each encoding layer by applying, to the input signal, an encoding model including a plurality of successive encoding layers that encodes the input signal, in which a current encoding layer among the encoding layers is trained to generate a bitstring of the current encoding layer by encoding an encoded signal which is a signal encoded in a previous encoding layer and quantizing an encoded signal which is a signal encoded in the current encoding layer.
-
公开(公告)号:US11783844B2
公开(公告)日:2023-10-10
申请号:US17527351
申请日:2021-11-16
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang , Jong Won Shin , Soojoong Hwang , Youngju Cheon , Sangwook Han
Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
-
16.
公开(公告)号:US11581000B2
公开(公告)日:2023-02-14
申请号:US17105835
申请日:2020-11-27
Inventor: Woo-Taek Lim , Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee
IPC: G10L19/00 , G10L25/30 , G10L19/16 , G06N3/08 , G10L19/038
Abstract: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
-
17.
公开(公告)号:US11580999B2
公开(公告)日:2023-02-14
申请号:US17331416
申请日:2021-05-26
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035
Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
18.
公开(公告)号:US11276413B2
公开(公告)日:2022-03-15
申请号:US16543095
申请日:2019-08-16
Applicant: Electronics and Telecommunications Research Institute , THE TRUSTEES OF INDIANA UNIVERSITY
Inventor: Mi Suk Lee , Jongmo Sung , Minje Kim , Kai Zhen
Abstract: Disclosed are an audio signal encoding method and audio signal decoding method, and an encoder and decoder performing the same. The audio signal encoding method includes applying an audio signal to a training model including N autoencoders provided in a cascade structure, encoding an output result derived through the training model, and generating a bitstream with respect to the audio signal based on the encoded output result.
-
公开(公告)号:US12205605B2
公开(公告)日:2025-01-21
申请号:US17670172
申请日:2022-02-11
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Inventor: Inseon Jang , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-Taek Lim , Hong-Goo Kang , Jihyun Lee , Chanwoo Lee , Hyungseob Lim
IPC: G10L19/038 , G10L19/00 , G10L25/30
Abstract: An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
-
公开(公告)号:US12159640B2
公开(公告)日:2024-12-03
申请号:US17884364
申请日:2022-08-09
Inventor: Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang
Abstract: Provided is an encoding method according to various example embodiments and an encoder performing the method. The encoding method includes outputting a linear prediction (LP) coefficients bitstream and a residual signal by performing a linear prediction analysis on an input signal, outputting a first latent signal obtained by encoding a periodic component of the residual signal, using a first neural network module, outputting a first bitstream obtained by quantizing the first latent signal, using a quantization module, outputting a second latent signal obtained by encoding an aperiodic component of the residual signal, using the first neural network module, and outputting a second bitstream obtained by quantizing the second latent signal, using the quantization module, wherein the aperiodic component of the residual signal is calculated based on a periodic component of the residual signal decoded from the quantized first latent signal output by de-quantizing the first bitstream.
-
-
-
-
-
-
-
-
-