-
公开(公告)号:US11922962B2
公开(公告)日:2024-03-05
申请号:US17895256
申请日:2022-08-25
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Seungkwon Beack , Tae Jin Lee , Min Je Kim , Kyeongok Kang , Dae Young Jang , Jeongil Seo , Jin Woo Hong , Chieteuk Ahn , Ho Chong Park , Young-cheol Park
IPC: G10L19/022 , G10L19/06 , G10L19/18 , G10L19/22
CPC classification number: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18
Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
-
42.
公开(公告)号:US11804230B2
公开(公告)日:2023-10-31
申请号:US17711908
申请日:2022-04-01
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
Inventor: Inseon Jang , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Jongwon Shin , Youngju Cheon , Sangwook Han , Soojoong Hwang
IPC: G10L19/02 , G10L19/038 , G06N3/04
CPC classification number: G10L19/038 , G06N3/04 , G10L19/02
Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
-
公开(公告)号:US11508385B2
公开(公告)日:2022-11-22
申请号:US16686859
申请日:2019-11-18
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC: G06N3/04 , G06N3/08 , G10L19/032 , G10L19/02
Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
公开(公告)号:US11488613B2
公开(公告)日:2022-11-01
申请号:US17098090
申请日:2020-11-13
Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University
Inventor: Minje Kim , Kai Zhen , Mi Suk Lee , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Jin Soo Choi
IPC: G10L19/08 , G10L19/032 , G10L19/26 , G06N3/08 , G10L25/30 , G10L13/02 , G10L21/0208
Abstract: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
-
公开(公告)号:US11430458B2
公开(公告)日:2022-08-30
申请号:US16835728
申请日:2020-03-31
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Seungkwon Beack , Tae Jin Lee , Min Je Kim , Kyeongok Kang , Dae Young Jang , Jeongil Seo , Jin Woo Hong , Chieteuk Ahn , Ho Chong Park , Young-cheol Park
IPC: G10L19/022 , G10L19/22 , G10L19/06 , G10L19/18
Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
-
46.
公开(公告)号:US20220020385A1
公开(公告)日:2022-01-20
申请号:US17377157
申请日:2021-07-15
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi
IPC: G10L19/06 , G10L19/032
Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
-
公开(公告)号:US20210005208A1
公开(公告)日:2021-01-07
申请号:US16686859
申请日:2019-11-18
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC: G10L19/02 , G10L19/032 , G06N3/08 , G06N3/04
Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
公开(公告)号:US10783893B2
公开(公告)日:2020-09-22
申请号:US16126964
申请日:2018-09-10
Inventor: Seung Kwon Beack , Tae Jin Lee , Jong Mo Sung , Jeong Il Seo , Kyeong Ok Kang , Dae Young Jang , Jin Woong Kim
IPC: H04S3/00 , G10L19/008
Abstract: An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
-
公开(公告)号:US10777212B2
公开(公告)日:2020-09-15
申请号:US16179120
申请日:2018-11-02
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-cheol Park
Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output.
-
公开(公告)号:US10580419B2
公开(公告)日:2020-03-03
申请号:US16126964
申请日:2018-09-10
Inventor: Seung Kwon Beack , Tae Jin Lee , Jong Mo Sung , Jeong Il Seo , Kyeong Ok Kang , Dae Young Jang , Jin Woong Kim
IPC: H04S3/00 , G10L19/008
Abstract: An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
-
-
-
-
-
-
-
-
-