Patent search ap:("ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE") AND inv:"Kyeongok KANG" Page 2

11.

发明申请
APPARATUS AND METHOD FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO 审中-公开

公开(公告)号：US20170345435A1

公开(公告)日：2017-11-30

申请号：US15618689

申请日：2017-06-09

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation

Inventor： Tae Jin LEE , Seung-Kwon BAEK , Min Je KIM , Dae Young JANG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Hochong PARK , Young-cheol PARK

IPC: G10L19/12 , G10L19/008 , G10L19/20 , G10L19/22

CPC classification number: G10L19/12 , G10L19/008 , G10L19/02 , G10L19/20 , G10L19/22 , G11C2207/16

Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output signal of the first conversion encoder and an output signal of the LPC encoder.

12.

发明申请
METHOD AND APPARATUS FOR PROCESSING AUDIO SIGNAL 审中-公开
Title translation: 处理音频信号的方法和装置

公开(公告)号：US20160277865A1

公开(公告)日：2016-09-22

申请号：US15031275

申请日：2014-10-22

Applicant: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY , WILUS INSTITUTE OF STANDARDS AND TECHNOLOGY INC. , ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Taegyu LEE , Hyunoh OH , Youngcheol PARK , Daehee YOUN , Jeongil SEO , Yongju LEE , Seungkwon BEACK , Kyeongok KANG , Daeyoung JANG

IPC: H04S3/00 , G10L19/008

CPC classification number: G10L19/008 , G10H2250/111 , G10H2250/145 , H04R3/00 , H04R5/033 , H04S3/00 , H04S3/002 , H04S3/004 , H04S3/008 , H04S2400/01 , H04S2420/01 , H04S2420/03

Abstract: The present invention relates to a method and an apparatus for processing a signal, which are used to effectively reproduce an audio signal, and more particularly, to a method and an apparatus for processing an audio signal, which are used for implementing a filtering for input audio signals with a low computational complexity.To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving truncated subband filter coefficients for filtering each subband signal of the input audio signal, the truncated subband filter coefficients being at least a portion of subband filter coefficients obtained from binaural room impulse response (BRIR) filter coefficients for binaural filtering of the input audio signal, the lengths of the truncated subband filter coefficients being determined based on filter order information obtained by at least partially using characteristic information extracted from the corresponding subband filter coefficients, and the truncated subband filter coefficients being constituted by at least one FFT filter coefficient in which fast Fourier transform (FFT) by a predetermined block size in the corresponding subband has been performed; performing the fast Fourier transform of the subband signal based on a predetermined subframe size in the corresponding subband; generating a filtered subframe by multiplying the fast Fourier transformed subframe and the FFT filter coefficients; inverse fast Fourier transforming the filtered subframe; and generating a filtered subband signal by overlap-adding at least one subframe which is inverse fast Fourier transformed and an apparatus for processing an audio signal using the same.

Abstract translation: 为此，提供了一种处理音频信号的方法，包括：接收输入音频信号; 接收用于对输入音频信号的每个子带信号进行滤波的截断的子带滤波器系数，截断的子带滤波器系数是从用于输入音频信号的双耳滤波的双耳室脉冲响应（BRIR）滤波器系数获得的子带滤波器系数的至少一部分，基于通过至少部分地使用从相应的子带滤波器系数提取的特征信息获得的滤波器顺序信息来确定截断的子带滤波器系数的长度，并且截断的子带滤波器系数由至少一个FFT滤波器系数构成，其中快速傅里叶已经执行了在相应子带中的预定块大小的变换（FFT）; 基于相应子带中的预定子帧大小执行子带信号的快速傅里叶变换; 通过乘以快速傅里叶变换子帧和FFT滤波器系数来产生滤波后的子帧; 对经滤波的子帧进行快速傅立叶逆变换; 以及通过重叠添加至少一个逆快速傅里叶变换的子帧和使用该子帧处理音频信号的装置来生成滤波后的子带信号。

13.

发明申请
APPARATUS FOR GENERATING AND PLAYING OBJECT BASED AUDIO CONTENTS 审中-公开
Title translation: 用于生成和播放基于目标的音频内容的设备

公开(公告)号：US20130101122A1

公开(公告)日：2013-04-25

申请号：US13709475

申请日：2012-12-10

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jae-Hyoun YOO , Hwan SHIM , Hyun-Joo CHUNG , Koen-Mo SUNG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Chieteuk AHN

IPC: H04R5/00

CPC classification number: H04R5/00 , G10L19/008 , H04S7/301 , H04S7/305 , H04S7/308 , H04S2400/11 , H04S2400/15 , H04S2420/03 , H04S2420/13

Abstract: Disclosed is an object based audio contents generating/playing apparatus. The object based audio contents generating/playing apparatus may include an object audio signal obtaining unit to obtain a plurality of object audio signals by recording a plurality of sound source signals, a recording space information obtaining unit to obtain recording space information with respect to a recording space of the plurality of sound source signals, a sound source location information obtaining unit to obtain sound location information of the plurality of sound source signals, and an encoding unit to generate object based audio contents by encoding at least one of the plurality of object audio signals, the recording space information, and the sound source location information, thereby enabling the object based audio contents to be played using at least one of a WFS scheme and a multi-channel surround scheme regardless of a reproducing environment of the audience.

Abstract translation: 公开了一种基于对象的音频内容生成/播放装置。基于对象的音频内容生成/播放装置可以包括通过记录多个声源信号来获得多个对象音频信号的对象音频信号获取单元，记录空间信息获取单元，用于获得关于记录的记录空间信息多个声源信号的空间，用于获得多个声源信号的声音位置信息的声源位置信息获取单元，以及通过对多个对象音频中的至少一个进行编码来生成基于对象的音频内容的编码单元信号，记录空间信息和声源位置信息，从而使得能够使用WFS方案和多声道环绕声方案中的至少一种来播放基于对象的音频内容，而不管观众的再现环境如何。

14.

发明公开
UNIFIED SPEECH/AUDIO CODEC (USAC) PROCESSING WINDOWS SEQUENCE BASED MODE SWITCHING 审中-公开

公开(公告)号：US20240212698A1

公开(公告)日：2024-06-27

申请号：US18426726

申请日：2024-01-30

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventor： Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK

IPC: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

CPC classification number: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

15.

发明公开
AUDIO RENDERING METHOD AND ELECTRONIC DEVICE PERFORMING THE SAME 审中-公开

公开(公告)号：US20240135953A1

公开(公告)日：2024-04-25

申请号：US18489764

申请日：2023-10-17

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Dae Young JANG , Kyeongok KANG , Jae-hyoun YOO , Yong Ju LEE

IPC: G10L25/18 , H04B17/318

CPC classification number: G10L25/18 , H04B17/318

Abstract: An audio rendering method and an electronic device performing the same are disclosed. The disclosed audio rendering method includes determining an air absorption attenuation amount of an audio signal based on a recording distance included in metadata of the audio signal and a source distance between a sound source of the audio signal and a listener; and rendering the audio signal based on the air absorption attenuation amount.

16.

发明公开
APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO 审中-公开

公开(公告)号：US20240119948A1

公开(公告)日：2024-04-11

申请号：US18212364

申请日：2023-06-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation

Inventor： Tae Jin LEE , Seung-Kwon BAEK , Min Je KIM , Dae Young JANG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Hochong PARK , Young-Cheol PARK

IPC: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/12 , G10L19/20

CPC classification number: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/12 , G10L19/20 , G10L19/00

Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.

17.

发明公开
METHOD OF RENDERING OBJECT-BASED AUDIO AND ELECTRONIC DEVICE FOR PERFORMING THE METHOD 审中-公开

公开(公告)号：US20230345197A1

公开(公告)日：2023-10-26

申请号：US18304257

申请日：2023-04-20

Applicant: Electronics and Telecommunications Research Institute

Inventor： Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Soo Young PARK , Tae Jin LEE , Young Ho JEONG

IPC: H04S7/00

CPC classification number: H04S7/303 , H04S7/307 , H04S2400/11 , H04S2400/13

Abstract: A method of rendering object-based audio and an electronic device for performing the method are disclosed. The method includes identifying metadata of the object-based audio, determining whether the metadata includes a parameter set for an atmospheric absorption effect for each distance, and rendering the object-based audio, using a distance between the object-based audio and a listener obtained using the metadata and the atmospheric absorption effect according to an effect of a medium attenuation based on the parameter, when the metadata includes the parameter.

18.

发明公开
ACOUSTIC SIGNAL PROCESSING DEVICE FOR SPATIALLY EXTENDED SOUND SOURCE AND METHOD 审中-公开

公开(公告)号：US20230171559A1

公开(公告)日：2023-06-01

申请号：US17992036

申请日：2022-11-22

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jae-hyoun YOO , Kyeongok KANG , Yong Ju LEE , Dae Young JANG

IPC: H04S7/00

CPC classification number: H04S7/303 , H04S2400/11

Abstract: Provided is an acoustic signal processing device for a spatially extended sound source and a method thereof. The acoustic signal processing device includes a memory configured to store instructions, and a processor electrically connected to the memory and configured to execute the instructions. When the instructions are executed by the processor, the processor performs a plurality of operations, and the plurality of operations includes transforming an object provided as a spatially extended sound source into a cuboid in a virtual reality (VR) space, obtaining coordinates of the cuboid, and determining a position of a sound source of the object based on the coordinates of the cuboid and coordinates of a user in the VR space.

19.

发明申请
UNIFIED SPEECH/AUDIO CODEC (USAC) PROCESSING WINDOWS SEQUENCE BASED MODE SWITCHING 有权

公开(公告)号：US20220406321A1

公开(公告)日：2022-12-22

申请号：US17895256

申请日：2022-08-25

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventor： Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK

IPC: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

20.

发明申请
METHOD AND APPARATUS FOR PERFORMING BINAURAL RENDERING OF AUDIO SIGNAL 有权

公开(公告)号：US20220014869A1

公开(公告)日：2022-01-13

申请号：US17199757

申请日：2021-03-12

Applicant: Electronics and Telecommunications Research Institute

Inventor： Yong Ju LEE , Jae-hyoun YOO , Mi Suk LEE , Kyeongok KANG , Dae Young JANG

IPC: H04S7/00 , G10L19/008 , H04S3/00

Abstract: A method and apparatus for performing binaural rendering of an audio signal are provided. The method includes identifying an input signal that is based on an object, and metadata that includes distance information indicating a distance to the object, generating a binaural filter that is based on the metadata, using a binaural room impulse response, obtaining a binaural filter to which a low-pass filter (LPF) is applied, using a frequency response control that is based on the distance information, and generating a binaural-rendered output signal by performing a convolution of the input signal and the binaural filter to which the LPF is applied.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification