-
1.
公开(公告)号:US20240296854A1
公开(公告)日:2024-09-05
申请号:US18647394
申请日:2024-04-26
发明人: Qingbo HUANG , Yuyong KANG , Wei XIAO , Meng WANG , Yupeng SHI
IPC分类号: G10L19/02 , G10L19/032
CPC分类号: G10L19/0204 , G10L19/032
摘要: An audio processing method includes: filtering an audio signal to obtain a low-frequency signal and a high-frequency signal; encoding the low-frequency signal to obtain a bitstream of the low-frequency signal; performing frequency domain transform on the low-frequency signal and the high-frequency signal respectively, to obtain a low-frequency spectrum and a high-frequency spectrum; performing spectral envelope extraction on the low-frequency spectrum and the high-frequency spectrum to obtain spectral envelope information, and performing spectral flatness extraction on the high-frequency spectrum to obtain spectral flatness information; and performing quantization encoding on the spectral flatness information and the spectral envelope information to obtain a bandwidth extension bitstream, and combining the bandwidth extension bitstream and the bitstream of low-frequency signal into an encoded bitstream.
-
公开(公告)号:US20240296855A1
公开(公告)日:2024-09-05
申请号:US18646521
申请日:2024-04-25
发明人: Yuyong KANG , Meng WANG , Qingbo HUANG , Yupeng SHI , Wei XIAO
IPC分类号: G10L19/26 , G10L19/032
CPC分类号: G10L19/265 , G10L19/032
摘要: An audio coding method includes: performing feature extraction on an audio signal at a first layer to obtain a signal feature at the first layer; splicing, for an ith layer among N layers, the audio signal and a signal feature at an (i-1)th layer to obtain a spliced feature, and performing feature extraction on the spliced feature at the ith layer to obtain a signal feature at the ith layer, traversing ith layers of the N layers to obtain a signal feature at each layer among the N layers, and a data dimension of the signal feature being less than a data dimension of the audio signal; and coding the signal feature at the first layer and the signal feature at each layer among the N layers separately to obtain a bitstream of the audio signal at each layer.
-
3.
公开(公告)号:US20240276160A1
公开(公告)日:2024-08-15
申请号:US18647430
申请日:2024-04-26
发明人: Tingzhao WU , Wei XIAO , Yuyong KANG , Yupeng SHI , Shidong SHANG , Zurong WU
CPC分类号: H04R25/505 , A61B5/123 , G06F3/165 , H04R25/558 , H04R25/70 , H04R2225/41 , H04R2225/43 , H04R2225/55 , H04R2430/01
摘要: An audio signal processing method includes: displaying a hearing test control in a human-computer interaction interface; outputting a first test audio signal in response to a trigger operation on the hearing test control; displaying a first hearing test result of a target object in response to a feedback operation on the first test audio signal; and transmitting, to an audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result. The first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
-
公开(公告)号:US20230097520A1
公开(公告)日:2023-03-30
申请号:US18076047
申请日:2022-12-06
发明人: Wei XIAO , Yupeng SHI , Meng WANG
IPC分类号: G10L13/02 , G10L21/0316 , G06N3/045
摘要: A speech enhancement method includes: performing pre-enhancement on a target speech frame according to a complex spectrum corresponding to the target speech frame, to obtain a first complex spectrum; performing speech decomposition on the target speech frame according to the first complex spectrum, to obtain a glottal parameter, a gain, and an excitation signal that correspond to the target speech frame; and performing synthesis according to the glottal parameter, the gain, and the excitation signal, to obtain an enhanced speech signal corresponding to the target speech frame.
-
公开(公告)号:US20240274144A1
公开(公告)日:2024-08-15
申请号:US18643717
申请日:2024-04-23
发明人: Yupeng SHI , Wei XIAO , Meng WANG , Yuyong KANG , Qingbo HUANG
CPC分类号: G10L19/06 , G10L19/0204
摘要: Embodiments of this application provide an audio coding method and apparatus, an audio decoding method and apparatus, an electronic device, and a storage medium, applied to an on board scene. The audio decoding method includes obtaining a bitstream of an audio signal; performing label extraction processing on a predicted value of a feature vector of the audio signal associated with the bitstream to obtain a label information vector, a dimension of the label information vector being the same as a dimension of the predicted value of the feature vector; performing signal reconstruction based on the predicted value of the feature vector and the label information vector; and identifying a predicted value of the audio signal obtained by the signal reconstruction as a decoding result of the bitstream.
-
6.
公开(公告)号:US20240265928A1
公开(公告)日:2024-08-08
申请号:US18640393
申请日:2024-04-19
发明人: Meng WANG , Wei XIAO , Yuyong KANG , Qingbo HUANG , Yupeng SHI
IPC分类号: G10L19/008 , G10L19/02 , G10L19/032 , G10L25/30
CPC分类号: G10L19/008 , G10L19/0204 , G10L19/032 , G10L25/30
摘要: An audio processing method/apparatus including performing multichannel signal decomposition on an audio signal to obtain N subband signals of the audio signal, the frequency bands of the N subband signals increase sequentially and N is an integer greater than 2, performing signal compression on each subband signal of the N subband signals to obtain a subband signal feature of each subband signal; and performing quantization encoding on the subband signal feature of each subband signal to obtain a bitstream of each subband signal.
-
公开(公告)号:US20230050519A1
公开(公告)日:2023-02-16
申请号:US17977772
申请日:2022-10-31
发明人: Wei XIAO , Yupeng SHI , Meng WANG , Shidong SHANG , Zurong WU
IPC分类号: G10L21/0264 , G10L21/034 , G10L25/30 , G10L21/0232 , G10L21/0364
摘要: A speech enhancement method includes: determining a glottal parameter corresponding to a target speech frame according to a frequency domain representation of the target speech frame; determining a gain corresponding to the target speech frame according to a gain corresponding to a historical speech frame of the target speech frame; determining an excitation signal corresponding to the target speech frame according to the frequency domain representation of the target speech frame; and synthesizing the glottal parameter corresponding to the target speech frame, the gain corresponding to the target speech frame, and the excitation signal corresponding to the target speech frame, to obtain an enhanced speech signal corresponding to the target speech frame.
-
-
-
-
-
-