-
公开(公告)号:US10403293B2
公开(公告)日:2019-09-03
申请号:US15810732
申请日:2017-11-13
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-Cheol Park
Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
-
公开(公告)号:US10121482B2
公开(公告)日:2018-11-06
申请号:US15618689
申请日:2017-06-09
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-cheol Park
Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and an audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is an audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; a frequency band expander for expanding a frequency band of the input signal whose output is transmitted to either the time-domain encoding module or the transform encoding module based on the input characteristic; and a bitstream generator to generate a bitstream using an output signal of the first conversion encoder and an output signal of the LPC encoder.
-
公开(公告)号:US12225370B2
公开(公告)日:2025-02-11
申请号:US18096439
申请日:2023-01-12
Inventor: Dae Young Jang , Kyeongok Kang , Jae-hyoun Yoo , Yong Ju Lee
IPC: H04S7/00 , G10L19/008
Abstract: Disclosed is an apparatus for immersive spatial audio modeling and rendering for effectively transmitting and playing immersive spatial audio content. The apparatus for immersive spatial audio modeling and rendering disclosed herein may model a spatial audio scene, generate and transmit parameters necessary for spatial audio rendering, and generate various spatial audio effects using the spatial audio parameters, to provide an immersive three-dimensional (3D) audio source coinciding with visual experience in a virtual reality space in response to free changes in the position and direction of a remote user in the space.
-
公开(公告)号:US11895480B2
公开(公告)日:2024-02-06
申请号:US17590288
申请日:2022-02-01
Inventor: Dae Young Jang , Kyeongok Kang , Jae-hyoun Yoo , Yong Ju Lee
IPC: H04S7/00
CPC classification number: H04S7/302 , H04S7/307 , H04S2400/01 , H04S2400/11 , H04S2420/01
Abstract: A method and system for processing an obstacle effect in a virtual acoustic space are disclosed. The method includes receiving a parameter for an obstacle candidate plane extracted from spatial information, determining, in response to the parameter, whether the obstacle candidate plane is an obstacle related to a path between a position of a virtual sound source and a position of a user, and applying a sound effect according to the obstacle to an audio signal when the obstacle candidate plane is the obstacle. The obstacle candidate plane is a plane of an object that may become the obstacle in a sound propagation path between the virtual sound source and the user.
-
公开(公告)号:US20220337968A1
公开(公告)日:2022-10-20
申请号:US17590288
申请日:2022-02-01
Inventor: Dae Young Jang , Kyeongok Kang , Jae-hyoun Yoo , Yong Ju Lee
IPC: H04S7/00
Abstract: A method and system for processing an obstacle effect in a virtual acoustic space are disclosed. The method includes receiving a parameter for an obstacle candidate plane extracted from spatial information, determining, in response to the parameter, whether the obstacle candidate plane is an obstacle related to a path between a position of a virtual sound source and a position of a user, and applying a sound effect according to the obstacle to an audio signal when the obstacle candidate plane is the obstacle. The obstacle candidate plane is a plane of an object that may become the obstacle in a sound propagation path between the virtual sound source and the user.
-
公开(公告)号:US11456002B2
公开(公告)日:2022-09-27
申请号:US17018295
申请日:2020-09-11
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Tae Jin Lee , Seung-Kwon Baek , Min Je Kim , Dae Young Jang , Jeongil Seo , Kyeongok Kang , Jin-Woo Hong , Hochong Park , Young-cheol Park
Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output.
-
27.
公开(公告)号:US11195537B2
公开(公告)日:2021-12-07
申请号:US16747533
申请日:2020-01-21
Applicant: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY , WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Inventor: Taegyu Lee , Hyunoh Oh , Youngcheol Park , Daehee Youn , Jeongil Seo , Yongju Lee , Seungkwon Beack , Kyeongok Kang , Daeyoung Jang
IPC: G10L19/008 , H04S3/00 , H04R5/033 , H04R3/00
Abstract: The present invention relates to a method and an apparatus for binaural rendering an audio signal using variable order filtering in frequency domain. To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving a set of truncated subband filter coefficients for filtering each subband signal of the input audio signal, the set of truncated subband filter coefficients being constituted by one or more FFT filter coefficients generated by performing FFT by a predetermined block size; generating at least one subframe for each subband; generating at least one filtered subframe for each subband; performing inverse FFT on the filtered subframe for each subband; and generating a filtered subband signal by overlap-adding the transformed subframe for each subband and an apparatus for processing an audio signal using the same.
-
公开(公告)号:US10410646B2
公开(公告)日:2019-09-10
申请号:US15625623
申请日:2017-06-16
Inventor: Jeongil Seo , Seungkwon Beack , Kyeongok Kang , Jin Woo Hong , Jinwoong Kim , Chieteuk Ahn , Kwangki Kim , Minsoo Hahn
IPC: G10L19/20 , G10L19/008
Abstract: A multi-object audio encoding and decoding apparatus supporting a post downmix signal may be provided. The multi-object audio encoding apparatus may include: an object information extraction and downmix generation unit to generate object information and a downmix signal from input object signals; a parameter determination unit to determine a downmix information parameter using the extracted downmix signal and the post downmix signal; and a bitstream generation unit to combine the object information and the downmix information parameter, and to generate an object bitstream.
-
公开(公告)号:US12148438B2
公开(公告)日:2024-11-19
申请号:US17373243
申请日:2021-07-12
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Seung Kwon Beack , Tae Jin Lee , Min Je Kim , Dae Young Jang , Kyeongok Kang , Jin Woo Hong , Ho Chong Park , Young-cheol Park
IPC: G10L19/022 , G10L19/02 , G10L19/18
Abstract: An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
-
30.
公开(公告)号:US12014744B2
公开(公告)日:2024-06-18
申请号:US17517630
申请日:2021-11-02
Applicant: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY , WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Inventor: Taegyu Lee , Hyunoh Oh , Youngcheol Park , Daehee Youn , Jeongil Seo , Yongju Lee , Seungkwon Beack , Kyeongok Kang , Daeyoung Jang
IPC: G10L19/008 , H04R3/00 , H04R5/033 , H04S3/00
CPC classification number: G10L19/008 , H04R5/033 , H04S3/00 , H04S3/002 , H04S3/004 , H04S3/008 , G10H2250/111 , G10H2250/145 , H04R3/00 , H04S2400/01 , H04S2420/01 , H04S2420/03
Abstract: The present invention relates to a method and an apparatus for binaural rendering an audio signal using variable order filtering in frequency domain. To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving a set of truncated subband filter coefficients for filtering each subband signal of the input audio signal, the set of truncated subband filter coefficients being constituted by one or more FFT filter coefficients generated by performing FFT by a predetermined block size; generating at least one subframe for each subband; generating at least one filtered subframe for each subband; performing inverse FFT on the filtered subframe for each subband; and generating a filtered subband signal by overlap-adding the transformed subframe for each subband and an apparatus for processing an audio signal using the same.
-
-
-
-
-
-
-
-
-