Multichannel Audio Speech Classification
    1.
    发明公开

    公开(公告)号:US20240312477A1

    公开(公告)日:2024-09-19

    申请号:US18396788

    申请日:2023-12-27

    摘要: Examples of the present disclosure describe systems and methods for multichannel audio speech classification. In examples, an audio signal comprising multiple audio channels is received at a processing device. Each of the audio channels in the audio signal is transcoded to a predefined audio format. For each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. A correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. Each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. Based on the classification, an action associated with the audio signal may be performed.

    Multichannel Audio Speech Classification
    4.
    发明公开

    公开(公告)号:US20230386505A1

    公开(公告)日:2023-11-30

    申请号:US17804606

    申请日:2022-05-31

    摘要: Examples of the present disclosure describe systems and methods for multichannel audio speech classification. In examples, an audio signal comprising multiple audio channels is received at a processing device. Each of the audio channels in the audio signal is transcoded to a predefined audio format. For each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. A correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. Each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. Based on the classification, an action associated with the audio signal may be performed.

    Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder

    公开(公告)号:US11832087B2

    公开(公告)日:2023-11-28

    申请号:US17504080

    申请日:2021-10-18

    发明人: Zexin Liu Lei Miao

    摘要: A multi-channel signal encoding method includes determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, and reverberation gain parameters corresponding to different subbands of the first channel signal and the second channel signal, where the obtained reverberation gain parameters are belonging to at least two reverberation gain parameter groups. The method further includes selecting, from the at least two reverberation gain parameter groups, a target reverberation gain parameter group. The method further includes generating parameter indication information, where the parameter indication information indicates the target reverberation gain parameter group. The method further includes encoding reverberation gain parameters corresponding to the target reverberation gain parameter group, the parameter indication information, and the downmixed signal to obtain a bitstream.

    ENCODING AND DECODING OF AUDIO SIGNALS
    9.
    发明申请

    公开(公告)号:US20170243595A1

    公开(公告)日:2017-08-24

    申请号:US15519007

    申请日:2015-10-23

    摘要: An audio signal (X) is represented by a bitstream (B) segmented into frames. An audio processing system (500) comprises a buffer (510) and a decoding section (520). The buffer joins sets of audio data (D1; D2, . . . , DN) carried by N respective frames (F1, F2, . . . , FN) into one decodable set of audio data (D) corresponding to a first frame rate and to a first number of samples of the audio signal per frame. The frames have a second frame rate corresponding to a second number of samples of the audio signal per frame. The first number of samples is N times the second number of samples. The decoding section decodes the decodable set of audio data into a segment of the audio signal by at least employing signal synthesis, based on the decodable set of audio data, with a stride corresponding to the first number of samples of the audio signal.

    AUDIO TRANSMITTER AND RECEIVER
    10.
    发明申请

    公开(公告)号:US20170235543A1

    公开(公告)日:2017-08-17

    申请号:US15429140

    申请日:2017-02-09

    IPC分类号: G06F3/16 G10L19/00 G10L19/16

    摘要: Disclosed herein is an audio transmitter receiver device. The device includes an audio interface providing an audio signal, the audio signal including at least one of an audio input signal and an audio output signal; a digital communications interface for at least communicating audio information; and an audio codec for transcoding the audio information such that the audio information includes at least a high quality distortion free lossless representation of the audio signal and the audio signal includes an audio representation of the audio information.