AUDIO PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM

    公开(公告)号:US20240296854A1

    公开(公告)日:2024-09-05

    申请号:US18647394

    申请日:2024-04-26

    IPC分类号: G10L19/02 G10L19/032

    CPC分类号: G10L19/0204 G10L19/032

    摘要: An audio processing method includes: filtering an audio signal to obtain a low-frequency signal and a high-frequency signal; encoding the low-frequency signal to obtain a bitstream of the low-frequency signal; performing frequency domain transform on the low-frequency signal and the high-frequency signal respectively, to obtain a low-frequency spectrum and a high-frequency spectrum; performing spectral envelope extraction on the low-frequency spectrum and the high-frequency spectrum to obtain spectral envelope information, and performing spectral flatness extraction on the high-frequency spectrum to obtain spectral flatness information; and performing quantization encoding on the spectral flatness information and the spectral envelope information to obtain a bandwidth extension bitstream, and combining the bandwidth extension bitstream and the bitstream of low-frequency signal into an encoded bitstream.

    AUDIO CODING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20240296855A1

    公开(公告)日:2024-09-05

    申请号:US18646521

    申请日:2024-04-25

    IPC分类号: G10L19/26 G10L19/032

    CPC分类号: G10L19/265 G10L19/032

    摘要: An audio coding method includes: performing feature extraction on an audio signal at a first layer to obtain a signal feature at the first layer; splicing, for an ith layer among N layers, the audio signal and a signal feature at an (i-1)th layer to obtain a spliced feature, and performing feature extraction on the spliced feature at the ith layer to obtain a signal feature at the ith layer, traversing ith layers of the N layers to obtain a signal feature at each layer among the N layers, and a data dimension of the signal feature being less than a data dimension of the audio signal; and coding the signal feature at the first layer and the signal feature at each layer among the N layers separately to obtain a bitstream of the audio signal at each layer.

    SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230097520A1

    公开(公告)日:2023-03-30

    申请号:US18076047

    申请日:2022-12-06

    摘要: A speech enhancement method includes: performing pre-enhancement on a target speech frame according to a complex spectrum corresponding to the target speech frame, to obtain a first complex spectrum; performing speech decomposition on the target speech frame according to the first complex spectrum, to obtain a glottal parameter, a gain, and an excitation signal that correspond to the target speech frame; and performing synthesis according to the glottal parameter, the gain, and the excitation signal, to obtain an enhanced speech signal corresponding to the target speech frame.

    SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230050519A1

    公开(公告)日:2023-02-16

    申请号:US17977772

    申请日:2022-10-31

    摘要: A speech enhancement method includes: determining a glottal parameter corresponding to a target speech frame according to a frequency domain representation of the target speech frame; determining a gain corresponding to the target speech frame according to a gain corresponding to a historical speech frame of the target speech frame; determining an excitation signal corresponding to the target speech frame according to the frequency domain representation of the target speech frame; and synthesizing the glottal parameter corresponding to the target speech frame, the gain corresponding to the target speech frame, and the excitation signal corresponding to the target speech frame, to obtain an enhanced speech signal corresponding to the target speech frame.