-
11.
公开(公告)号:US11195537B2
公开(公告)日:2021-12-07
申请号:US16747533
申请日:2020-01-21
Applicant: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY , WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Inventor: Taegyu Lee , Hyunoh Oh , Youngcheol Park , Daehee Youn , Jeongil Seo , Yongju Lee , Seungkwon Beack , Kyeongok Kang , Daeyoung Jang
IPC: G10L19/008 , H04S3/00 , H04R5/033 , H04R3/00
Abstract: The present invention relates to a method and an apparatus for binaural rendering an audio signal using variable order filtering in frequency domain. To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving a set of truncated subband filter coefficients for filtering each subband signal of the input audio signal, the set of truncated subband filter coefficients being constituted by one or more FFT filter coefficients generated by performing FFT by a predetermined block size; generating at least one subframe for each subband; generating at least one filtered subframe for each subband; performing inverse FFT on the filtered subframe for each subband; and generating a filtered subband signal by overlap-adding the transformed subframe for each subband and an apparatus for processing an audio signal using the same.
-
公开(公告)号:US10410646B2
公开(公告)日:2019-09-10
申请号:US15625623
申请日:2017-06-16
Inventor: Jeongil Seo , Seungkwon Beack , Kyeongok Kang , Jin Woo Hong , Jinwoong Kim , Chieteuk Ahn , Kwangki Kim , Minsoo Hahn
IPC: G10L19/20 , G10L19/008
Abstract: A multi-object audio encoding and decoding apparatus supporting a post downmix signal may be provided. The multi-object audio encoding apparatus may include: an object information extraction and downmix generation unit to generate object information and a downmix signal from input object signals; a parameter determination unit to determine a downmix information parameter using the extracted downmix signal and the post downmix signal; and a bitstream generation unit to combine the object information and the downmix information parameter, and to generate an object bitstream.
-
公开(公告)号:US09860668B2
公开(公告)日:2018-01-02
申请号:US15300277
申请日:2015-04-02
Applicant: WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Inventor: Hyun Oh Oh , Taegyu Lee , Jeongil Seo
CPC classification number: H04S7/307 , G10L19/008 , H04R3/04 , H04R2430/03 , H04R2499/11 , H04R2499/15 , H04S3/008 , H04S7/303 , H04S7/306 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2420/01 , H04S2420/07
Abstract: The present invention relates to a method and an apparatus for processing an audio signal, and more particularly, to a method and an apparatus for processing an audio signal, which synthesizes an object signal and a channel signal and effectively binaural-render the synthesized signal.To this end, the present invention provides a method for processing an audio signal, including: receiving an input audio signal including a multi-channel signal; receiving filter order information variably determined for each subband of a frequency domain; receiving block length information for each subband based on a fast Fourier transform length for each subband of filter coefficients for binaural filtering of the input audio signal; receiving Variable Order Filtering in Frequency-domain (VOFF) coefficients corresponding to each subband and each channel of the input audio signal per block of the corresponding subband, a total sum of lengths of the VOFF coefficients corresponding to the same subband and the same channel being determined based on the filter order information of the corresponding subband; and filtering each subband signal of the input audio signal by using the received VOFF coefficients to generate a binaural output signal and an apparatus for processing an audio signal by using the same.
-
公开(公告)号:US11430457B2
公开(公告)日:2022-08-30
申请号:US16846272
申请日:2020-04-10
Inventor: Seung Kwon Beack , Tae Jin Lee , Min Je Kim , Kyeongok Kang , Dae Young Jang , Jin Woo Hong , Jeongil Seo , Chieteuk Ahn , Hochong Park , Young-Cheol Park
IPC: G10L19/087 , G10L19/22 , G10L19/125 , G10L19/26
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
公开(公告)号:US11222645B2
公开(公告)日:2022-01-11
申请号:US16562921
申请日:2019-09-06
Inventor: Jeongil Seo , Seungkwon Beack , Kyeongok Kang , Jin Woo Hong , Jinwoong Kim , Chieteuk Ahn , Kwangki Kim , Minsoo Hahn
IPC: G10L19/20 , G10L19/008
Abstract: A multi-object audio encoding and decoding apparatus supporting a post downmix signal may be provided. The multi-object audio encoding apparatus may include: an object information extraction and downmix generation unit to generate object information and a downmix signal from input object signals; a parameter determination unit to determine a downmix information parameter using the extracted downmix signal and the post downmix signal; and a bitstream generation unit to combine the object information and the downmix information parameter, and to generate an object bitstream.
-
公开(公告)号:US10714103B2
公开(公告)日:2020-07-14
申请号:US16557238
申请日:2019-08-30
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-Cheol Park
Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
-
17.
公开(公告)号:US10580417B2
公开(公告)日:2020-03-03
申请号:US15031275
申请日:2014-10-22
Applicant: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY , WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Inventor: Taegyu Lee , Hyunoh Oh , Youngcheol Park , Daehee Youn , Jeongil Seo , Yongju Lee , Seungkwon Beack , Kyeongok Kang , Daeyoung Jang
IPC: G10L19/008 , H04S3/00 , H04R5/033 , H04R3/00
Abstract: The present invention relates to a method and an apparatus for binaural rendering an audio signal using variable order filtering in frequency domain. To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving a set of truncated subband filter coefficients for filtering each subband signal of the input audio signal, the set of truncated subband filter coefficients being constituted by one or more FFT filter coefficients generated by performing FFT by a predetermined block size; generating at least one subframe for each subband; generating at least one filtered subframe for each subband; performing inverse FFT on the filtered subframe for each subband; and generating a filtered subband signal by overlap-adding the transformed subframe for each subband and an apparatus for processing an audio signal using the same.
-
18.
公开(公告)号:US10558881B2
公开(公告)日:2020-02-11
申请号:US15662616
申请日:2017-07-28
Inventor: Yong Ju Cho , Soon-heung Jung , Hyun Cheol Kim , Jeongil Seo , Joo Myoung Seok , Sangwoo Ahn , Seung Jun Yang , Injae Lee , Hee Kyung Lee , Seong Yong Lim
IPC: G06K9/36 , G06K9/20 , H04N9/097 , G06T3/00 , H04N13/225 , G06K9/00 , H04N5/262 , H04N5/232 , G06T3/40 , H04N13/239 , H04N13/00
Abstract: Provided is a parallax minimization stitching method and apparatus using control points in an overlapping region. A parallax minimization stitching method may include defining a plurality of control points in an overlapping region of a first image and a second image received from a plurality of cameras, performing a first geometric correction by applying a homography to the control points, defining a plurality of patches based on the control points, and performing a second geometric correction by mapping the patches.
-
公开(公告)号:US10403293B2
公开(公告)日:2019-09-03
申请号:US15810732
申请日:2017-11-13
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-Cheol Park
Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
-
公开(公告)号:US10121482B2
公开(公告)日:2018-11-06
申请号:US15618689
申请日:2017-06-09
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-cheol Park
Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and an audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is an audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; a frequency band expander for expanding a frequency band of the input signal whose output is transmitted to either the time-domain encoding module or the transform encoding module based on the input characteristic; and a bitstream generator to generate a bitstream using an output signal of the first conversion encoder and an output signal of the LPC encoder.
-
-
-
-
-
-
-
-
-