Patent search ap:("QUALCOMM Incorporated") AND inv:"Zisis Iason SKORDILIS" Page 1

1.

发明申请
BUNDLED MULTI-RATE FEEDBACK AUTOENCODER 有权

公开(公告)号：US20250104723A1

公开(公告)日：2025-03-27

申请号：US18728154

申请日：2023-01-23

Applicant: QUALCOMM Incorporated

Inventor： Guillaume Konrad SAUTIERE , Vivek RAJENDRAN , Zisis Iason SKORDILIS

IPC: G10L19/04

Abstract: A method includes generating an input data state for each data sample in a time series of data samples of a portion of an audio data stream. The method also includes providing at least one input data state to a first bottleneck and at least one other input data state to a second bottleneck. The first bottleneck is associated with a first bitrate and the second bottleneck is associated with a second bitrate. The method further includes generating a first encoded frame based on a first output data state from the first bottleneck and a second encoded frame based on a second output data state from the second bottleneck. The first encoded frame and the second encoded frame are bundled in a packet.

2.

发明申请
AUDIO CODING USING MACHINE LEARNING BASED LINEAR FILTERS AND NON-LINEAR NEURAL SOURCES 有权

公开(公告)号：US20240428813A1

公开(公告)日：2024-12-26

申请号：US18689053

申请日：2022-10-10

Applicant: QUALCOMM Incorporated

Inventor： Guillaume Konrad SAUTIERE , Duminda DEWASURENDRA , Zisis Iason SKORDILIS , Vivek RAJENDRAN

IPC: G10L19/12 , G10L25/24 , G10L25/30

Abstract: Systems and techniques are described for coding audio signals. For example, a voice decoder can generate, using a first neural network, an excitation signal for at least one sample of an audio signal at least in part by performing a non-linear operation based on one or more inputs to the first neural network, the excitation signal being configured to excite a learned linear filter. The voice decoder can further generate, using the learned linear filter and the excitation signal, at least one sample of a reconstructed audio signal. For example, a second neural network can be used to generate coefficients for one or more learned linear filters, which receive as input the excitation signal generated by the first neural network trained to perform the non-linear operation.

3.

发明公开
AUDIO SIGNAL RECONSTRUCTION 审中-公开

公开(公告)号：US20240355344A1

公开(公告)日：2024-10-24

申请号：US18684074

申请日：2022-09-09

Applicant: QUALCOMM Incorporated

Inventor： Zisis Iason SKORDILIS , Duminda DEWASURENDRA , Vivek RAJENDRAN

IPC: G10L19/02 , G10L19/032 , G10L25/18 , G10L25/30

CPC classification number: G10L19/0204 , G10L19/032 , G10L25/18 , G10L25/30

Abstract: A method includes receiving audio data that includes magnitude spectrum data descriptive of an audio signal. The method also includes providing the audio data as input to a neural network to generate an initial phase estimate for one or more samples of the audio signal. The method further includes determining, using a phase estimation algorithm, target phase data for the one or more samples of the audio signal based on the initial phase estimate and a magnitude spectrum of the one or more samples of the audio signal indicated by the magnitude spectrum data. The method also includes reconstructing the audio signal based on a target phase of the one or more samples of the audio signal indicated by the target phase data and based on the magnitude spectrum.

4.

发明申请
SAMPLE GENERATION BASED ON JOINT PROBABILITY DISTRIBUTION 有权

公开(公告)号：US20250046323A1

公开(公告)日：2025-02-06

申请号：US18718717

申请日：2022-11-16

Applicant: QUALCOMM Incorporated

Inventor： Zisis Iason SKORDILIS , Vivek RAJENDRAN , Tushar AGARWAL

IPC: G10L19/09 , G10L25/12 , G10L25/30

Abstract: A device includes a neural network and a sample generator. The neural network is configured to process one or more neural network inputs to generate a joint probability distribution. The one or more neural network inputs include at least first previous sample data and second previous sample data associated with at least one previous data sample of a sequence of data samples. The sample generator is configured to generate first sample data and second sample data based on the joint probability distribution. The first sample data and the second sample data are associated with at least one data sample of the sequence of data samples.

5.

发明申请
ARTIFICIAL INTELLIGENCE BASED AUDIO CODING 有权

公开(公告)号：US20210074308A1

公开(公告)日：2021-03-11

申请号：US16709873

申请日：2019-12-10

Applicant: QUALCOMM Incorporated

Inventor： Zisis Iason SKORDILIS , Vivek RAJENDRAN , Guillaume Konrad SAUTIÈRE , Daniel Jared SINDER

IPC: G10L19/13 , G10L19/09 , G10L25/30 , G06N3/08

Abstract: Techniques are described for coding audio signals. For example, using a neural network, a residual signal is generated for a sample of an audio signal based on inputs to the neural network. The residual signal is configured to excite a long-term prediction filter and/or a short-term prediction filter. Using the long-term prediction filter and/or the short-term prediction filter, a sample of a reconstructed audio signal is determined. The sample of the reconstructed audio signal is determined based on the residual signal generated using the neural network for the sample of the audio signal.

6.

发明申请
EFFICIENT PACKET-LOSS PROTECTED DATA ENCODING AND/OR DECODING 有权

公开(公告)号：US20250090963A1

公开(公告)日：2025-03-20

申请号：US18294490

申请日：2022-09-08

Applicant: QUALCOMM Incorporated

Inventor： Zisis Iason SKORDILIS , Vivek RAJENDRAN , Guillaume Konrad SAUTIERE , Duminda DEWASURENDRA , Daniel Jared SINDER

IPC: A63F13/77 , A63F13/35

Abstract: A device includes a memory and one or more processors coupled to the memory and configured to execute instructions from the memory. Execution of the instructions causes the one or more processors to combine two or more data portions to generate input data for a decoder network. A first data portion of the two or more data portions is based on a first encoding of a data sample by a multiple description coding network and content of a second data portion of the two or more data portions depends on whether data based on a second encoding of the data sample by the multiple description coding network is available. Execution of the instructions also causes the one or more processors to obtain, from the decoder network, output data based on the input data and to generate a representation of the data sample based on the output data.

7.

发明申请
SAMPLE GENERATION USING PIPELINED PROCESSING UNITS 有权

公开(公告)号：US20250046295A1

公开(公告)日：2025-02-06

申请号：US18718939

申请日：2022-11-28

Applicant: QUALCOMM Incorporated

Inventor： Vivek RAJENDRAN , Prajakt KULKARNI , Zisis Iason SKORDILIS

IPC: G10L13/047 , G10L13/027 , G10L25/18

Abstract: A device includes a memory configured to store instructions and a processor coupled to the memory. The processor includes a first processing unit configured to perform a first stage of a sample synthesis operation. The processor includes a second processing unit configured to perform a second stage of the sample synthesis operation based on an output of the first processing unit. The processor also includes a sample synthesizer configured to process input data, using the first processing unit and the second processing unit, to generate output data. The first processing unit and the second processing unit are configured to operate in a pipelined configuration that includes performance of the second stage at the second processing unit in parallel with performance of the first stage at the first processing unit.

8.

发明申请
AUDIO CODING USING COMBINATION OF MACHINE LEARNING BASED TIME-VARYING FILTER AND LINEAR PREDICTIVE CODING FILTER 有权

公开(公告)号：US20240428814A1

公开(公告)日：2024-12-26

申请号：US18689054

申请日：2022-10-10

Applicant: QUALCOMM Incorporated

Inventor： Duminda DEWASURENDRA , Guillaume Konrad SAUTIERE , Zisis Iason SKORDILIS , Vivek RAJENDRAN

IPC: G10L19/12 , G10L25/24 , G10L25/30

Abstract: Systems and techniques are described for coding audio signals. For example, a voice decoder can generate, using a neural network, an excitation signal for at least one sample of an audio signal based on one or more inputs to the neural network, the excitation signal being configured to excite a linear predictive coding (LPC) filter. The voice decoder can further generate, using the LPC filter based on the excitation signal, at least one sample of a reconstructed audio signal. For example, the neural network can generate coefficients for one or more linear time-varying filters (e.g., a linear time-varying harmonic filter and a linear time-varying noise filter). The voice decoder can use the one or more linear time-varying filters including the generated coefficients to generate the excitation signal.

9.

发明申请
SYSTEMS AND METHODS FOR MULTI-BAND AUDIO CODING 有权

公开(公告)号：US20240371384A1

公开(公告)日：2024-11-07

申请号：US18689052

申请日：2022-10-10

Applicant: QUALCOMM Incorporated

Inventor： Zisis Iason SKORDILIS , Vivek RAJENDRAN , Duminda DEWASURENDRA , Guillaume Konrad SAUTIERE

IPC: G10L19/26 , G10L19/087

Abstract: Systems and techniques are described for audio coding. An audio system receives feature(s) corresponding an audio signal, for example from an encoder and/or a speech synthesis engine. The audio system generates an excitation signal, such as a harmonic signal and/or a noise signal, based on the feature(s). The audio system uses a filterbank to generate band-specific signals from the excitation signal. The band-specific signals correspond to frequency bands. The audio system inputs the feature(s) into a machine learning (ML) filter estimator to generate parameter(s) associated with linear filter(s). The audio system inputs the feature(s) into a voicing estimator to generate gain value(s). The audio system generates an output audio signal based on modification of the band-specific signals, application of the linear filter(s) according to the parameter(s), and amplification using the gain amplifier(s) according to the gain value(s).

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification