专利检索 ap:("Tencent Technology (Shenzhen) Company Limited") AND inv:"Yupeng SHI" 第 1 页

1.

发明公开
AUDIO PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开

公开(公告)号：US20240296854A1

公开(公告)日：2024-09-05

申请号：US18647394

申请日：2024-04-26

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Qingbo HUANG , Yuyong KANG , Wei XIAO , Meng WANG , Yupeng SHI

IPC分类号： G10L19/02 , G10L19/032

CPC分类号： G10L19/0204 , G10L19/032

摘要： An audio processing method includes: filtering an audio signal to obtain a low-frequency signal and a high-frequency signal; encoding the low-frequency signal to obtain a bitstream of the low-frequency signal; performing frequency domain transform on the low-frequency signal and the high-frequency signal respectively, to obtain a low-frequency spectrum and a high-frequency spectrum; performing spectral envelope extraction on the low-frequency spectrum and the high-frequency spectrum to obtain spectral envelope information, and performing spectral flatness extraction on the high-frequency spectrum to obtain spectral flatness information; and performing quantization encoding on the spectral flatness information and the spectral envelope information to obtain a bandwidth extension bitstream, and combining the bandwidth extension bitstream and the bitstream of low-frequency signal into an encoded bitstream.

2.

发明公开
AUDIO CODING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20240296855A1

公开(公告)日：2024-09-05

申请号：US18646521

申请日：2024-04-25

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Yuyong KANG , Meng WANG , Qingbo HUANG , Yupeng SHI , Wei XIAO

IPC分类号： G10L19/26 , G10L19/032

CPC分类号： G10L19/265 , G10L19/032

摘要： An audio coding method includes: performing feature extraction on an audio signal at a first layer to obtain a signal feature at the first layer; splicing, for an ith layer among N layers, the audio signal and a signal feature at an (i-1)th layer to obtain a spliced feature, and performing feature extraction on the spliced feature at the ith layer to obtain a signal feature at the ith layer, traversing ith layers of the N layers to obtain a signal feature at each layer among the N layers, and a data dimension of the signal feature being less than a data dimension of the audio signal; and coding the signal feature at the first layer and the signal feature at each layer among the N layers separately to obtain a bitstream of the audio signal at each layer.

3.

发明公开
AUDIO SIGNAL PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开

公开(公告)号：US20240276160A1

公开(公告)日：2024-08-15

申请号：US18647430

申请日：2024-04-26

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Tingzhao WU , Wei XIAO , Yuyong KANG , Yupeng SHI , Shidong SHANG , Zurong WU

IPC分类号： H04R25/00 , A61B5/12 , G06F3/16

CPC分类号： H04R25/505 , A61B5/123 , G06F3/165 , H04R25/558 , H04R25/70 , H04R2225/41 , H04R2225/43 , H04R2225/55 , H04R2430/01

摘要： An audio signal processing method includes: displaying a hearing test control in a human-computer interaction interface; outputting a first test audio signal in response to a trigger operation on the hearing test control; displaying a first hearing test result of a target object in response to a feedback operation on the first test audio signal; and transmitting, to an audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result. The first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.

4.

发明申请
SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20230097520A1

公开(公告)日：2023-03-30

申请号：US18076047

申请日：2022-12-06

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Wei XIAO , Yupeng SHI , Meng WANG

IPC分类号： G10L13/02 , G10L21/0316 , G06N3/045

摘要： A speech enhancement method includes: performing pre-enhancement on a target speech frame according to a complex spectrum corresponding to the target speech frame, to obtain a first complex spectrum; performing speech decomposition on the target speech frame according to the first complex spectrum, to obtain a glottal parameter, a gain, and an excitation signal that correspond to the target speech frame; and performing synthesis according to the glottal parameter, the gain, and the excitation signal, to obtain an enhanced speech signal corresponding to the target speech frame.

5.

发明公开
AUDIO CODING METHOD AND APPARATUS, AUDIO DECODING METHOD AND APPARATUS, ELECTRONIC DEVICE, COMPUTER-READABLE STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT 审中-公开

公开(公告)号：US20240274144A1

公开(公告)日：2024-08-15

申请号：US18643717

申请日：2024-04-23

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Yupeng SHI , Wei XIAO , Meng WANG , Yuyong KANG , Qingbo HUANG

IPC分类号： G10L19/06 , G10L19/02

CPC分类号： G10L19/06 , G10L19/0204

摘要： Embodiments of this application provide an audio coding method and apparatus, an audio decoding method and apparatus, an electronic device, and a storage medium, applied to an on board scene. The audio decoding method includes obtaining a bitstream of an audio signal; performing label extraction processing on a predicted value of a feature vector of the audio signal associated with the bitstream to obtain a label information vector, a dimension of the label information vector being the same as a dimension of the predicted value of the feature vector; performing signal reconstruction based on the predicted value of the feature vector and the label information vector; and identifying a predicted value of the audio signal obtained by the signal reconstruction as a decoding result of the bitstream.

6.

发明公开
AUDIO PROCESSING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT 审中-公开

公开(公告)号：US20240265928A1

公开(公告)日：2024-08-08

申请号：US18640393

申请日：2024-04-19

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Meng WANG , Wei XIAO , Yuyong KANG , Qingbo HUANG , Yupeng SHI

IPC分类号： G10L19/008 , G10L19/02 , G10L19/032 , G10L25/30

CPC分类号： G10L19/008 , G10L19/0204 , G10L19/032 , G10L25/30

摘要： An audio processing method/apparatus including performing multichannel signal decomposition on an audio signal to obtain N subband signals of the audio signal, the frequency bands of the N subband signals increase sequentially and N is an integer greater than 2, performing signal compression on each subband signal of the N subband signals to obtain a subband signal feature of each subband signal; and performing quantization encoding on the subband signal feature of each subband signal to obtain a bitstream of each subband signal.

7.

发明申请
SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20230050519A1

公开(公告)日：2023-02-16

申请号：US17977772

申请日：2022-10-31

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Wei XIAO , Yupeng SHI , Meng WANG , Shidong SHANG , Zurong WU

IPC分类号： G10L21/0264 , G10L21/034 , G10L25/30 , G10L21/0232 , G10L21/0364

摘要： A speech enhancement method includes: determining a glottal parameter corresponding to a target speech frame according to a frequency domain representation of the target speech frame; determining a gain corresponding to the target speech frame according to a gain corresponding to a historical speech frame of the target speech frame; determining an excitation signal corresponding to the target speech frame according to the frequency domain representation of the target speech frame; and synthesizing the glottal parameter corresponding to the target speech frame, the gain corresponding to the target speech frame, and the excitation signal corresponding to the target speech frame, to obtain an enhanced speech signal corresponding to the target speech frame.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类