Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Dipanjan Sen" Page 1

1.

发明授权
Six degrees of freedom and three degrees of freedom backward compatibility 有权

公开(公告)号：US11843932B2

公开(公告)日：2023-12-12

申请号：US17329120

申请日：2021-05-24

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen

IPC: H04S7/00 , G06T19/00 , G06F3/0346 , H04W4/029 , H04W4/02 , G06F3/01 , G06V20/20

CPC classification number: H04S7/304 , G06F3/012 , G06F3/0346 , G06T19/006 , G06V20/20 , H04W4/027 , H04W4/029 , H04S2420/01

Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

2.

发明授权
Higher order ambisonic audio data 有权

公开(公告)号：US11270711B2

公开(公告)日：2022-03-08

申请号：US16868259

申请日：2020-05-06

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen

IPC: G10L19/008 , H04S3/02

Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.

3.

发明申请
SPATIAL TRANSFORMATION OF AMBISONIC AUDIO DATA 有权

公开(公告)号：US20220028401A1

公开(公告)日：2022-01-27

申请号：US17493789

申请日：2021-10-04

Applicant: Qualcomm Incorporated

Inventor： Nils Gunther Peters , Moo Young Kim , Dipanjan Sen

IPC: G10L19/008 , G10L19/16 , H04S3/00

Abstract: A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.

4.

发明授权
Audio-driven viewport selection 有权

公开(公告)号：US11164606B2

公开(公告)日：2021-11-02

申请号：US15672080

申请日：2017-08-08

Applicant: QUALCOMM Incorporated

Inventor： Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen

IPC: G11B27/22 , H04N21/81 , H04N21/439 , H04N21/2343 , H04N21/2368 , H04N21/6587 , H04N5/225 , H04N9/82 , H04N5/60 , H04N5/232 , H04N9/802 , H04S7/00 , G06T19/00 , H04N7/01 , H04R1/02 , H04R1/40

Abstract: An example device includes a memory device, and a processor coupled to the memory device. The memory is configured to store audio spatial metadata associated with a soundfield and video data. The processor is configured to identify one or more foreground audio objects of the soundfield using the audio spatial metadata stored to the memory device, and to select, based on the identified one or more foreground audio objects, one or more viewports associated with the video data. Display hardware coupled to the processor and the memory device is configured to output a portion of the video data being associated with the one or more viewports selected by the processor.

5.

发明授权
Signaling layers for scalable coding of higher order ambisonic audio data 有权

公开(公告)号：US11138983B2

公开(公告)日：2021-10-05

申请号：US16557650

申请日：2019-08-30

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , Dipanjan Sen

IPC: H04R5/04 , G10L19/008 , G10L19/16 , H04S3/00 , H04S5/00

Abstract: In general, techniques are described for signaling layers for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of layers specified in the bitstream, and obtain the layers of the bitstream based on the indication of the number of layers.

6.

发明授权
Ambisonic signal noise reduction for microphone arrays 有权

公开(公告)号：US11026019B2

公开(公告)日：2021-06-01

申请号：US16352272

申请日：2019-03-13

Applicant: QUALCOMM Incorporated

Inventor： S M Akramus Salehin , Dipanjan Sen

IPC: G10L19/16 , H04R1/22 , H04S7/00 , H03G3/24 , H04R3/00 , H04R1/40

Abstract: A device to apply noise reduction to ambisonic signals includes a memory configured to store noise data corresponding to microphones in a microphone array. A processor is configured to perform signal processing operations on signals captured by microphones in the microphone array to generate multiple sets of ambisonic signals including a first set corresponding to a first particular ambisonic order and a second set corresponding to a second particular ambisonic order. The processor is configured to perform a first noise reduction operation that includes applying a first gain factor to each ambisonic signal in the first set and to perform a second noise reduction operation that includes applying a second gain factor to each ambisonic signal in the second set. The first gain factor and the second gain factor are based on the noise data, and the second gain factor is distinct from the first gain factor.

7.

发明授权
Six degrees of freedom and three degrees of freedom backward compatibility 有权

公开(公告)号：US11019449B2

公开(公告)日：2021-05-25

申请号：US16567700

申请日：2019-09-11

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen

IPC: H04S7/00 , G06K9/00 , G06F3/0346 , G06T19/00 , G06F3/01 , H04W4/029 , H04W4/02

Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

8.

发明授权
Signalling beam pattern with objects 有权

公开(公告)号：US10972853B2

公开(公告)日：2021-04-06

申请号：US16719392

申请日：2019-12-18

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Dipanjan Sen

IPC: H04S7/00 , G10L19/008 , H04R5/02 , H04S3/00 , H04R5/04 , H04R3/12

Abstract: A device for processing coded audio is disclosed. The device is configured to store an audio object and audio object metadata associated with the audio object. The audio object metadata includes frequency dependent beam pattern metadata. The device may apply, based on the frequency dependent beam pattern metadata, a renderer to the audio object to obtain one or more speaker feeds and output the one or more speaker feeds.

9.

发明申请
RENDERING DIFFERENT PORTIONS OF AUDIO DATA USING DIFFERENT RENDERERS 审中-公开

公开(公告)号：US20190394605A1

公开(公告)日：2019-12-26

申请号：US16450660

申请日：2019-06-24

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Ferdinando Olivieri , Dipanjan Sen

IPC: H04S7/00 , H04R5/02 , G10L19/008 , G10L19/16 , H04S3/02 , H04S3/00

Abstract: In general, techniques are described by which to render different portions of audio data using different renderers. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store audio renderers. The processor(s) may obtain a first audio renderer of the plurality of audio renderers, and apply the first audio renderer with respect to a first portion of the audio data to obtain one or more first speaker feeds. The processor(s) may next obtain a second audio renderer of the plurality of audio renderers, and apply the second audio renderer with respect to a second portion of the audio data to obtain one or more second speaker feeds. The processor(s) may output, to one or more speakers, the one or more first speaker feeds and the one or more second speaker feeds.

10.

发明授权
Inserting audio channels into descriptions of soundfields 有权

公开(公告)号：US10412522B2

公开(公告)日：2019-09-10

申请号：US14663225

申请日：2015-03-19

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Nils Günther Peters

IPC: H04S7/00 , G10L19/008 , G10L25/48 , G10L19/018

Abstract: In general, techniques are described for inserting audio channels into descriptions of soundfields. A device comprising a processor may be configured to perform the techniques. The processor may be configured to obtain an audio channel separate from a higher-order ambisonic representation of a soundfield. The processor may further be configured to insert the audio channel at a spatial location within the soundfield such that the audio channel is able to be extracted from the soundfield.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification