-
公开(公告)号:US11843932B2
公开(公告)日:2023-12-12
申请号:US17329120
申请日:2021-05-24
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen
CPC classification number: H04S7/304 , G06F3/012 , G06F3/0346 , G06T19/006 , G06V20/20 , H04W4/027 , H04W4/029 , H04S2420/01
Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
-
公开(公告)号:US11270711B2
公开(公告)日:2022-03-08
申请号:US16868259
申请日:2020-05-06
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen
IPC: G10L19/008 , H04S3/02
Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
-
公开(公告)号:US20220028401A1
公开(公告)日:2022-01-27
申请号:US17493789
申请日:2021-10-04
Applicant: Qualcomm Incorporated
Inventor: Nils Gunther Peters , Moo Young Kim , Dipanjan Sen
IPC: G10L19/008 , G10L19/16 , H04S3/00
Abstract: A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.
-
公开(公告)号:US11164606B2
公开(公告)日:2021-11-02
申请号:US15672080
申请日:2017-08-08
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen
IPC: G11B27/22 , H04N21/81 , H04N21/439 , H04N21/2343 , H04N21/2368 , H04N21/6587 , H04N5/225 , H04N9/82 , H04N5/60 , H04N5/232 , H04N9/802 , H04S7/00 , G06T19/00 , H04N7/01 , H04R1/02 , H04R1/40
Abstract: An example device includes a memory device, and a processor coupled to the memory device. The memory is configured to store audio spatial metadata associated with a soundfield and video data. The processor is configured to identify one or more foreground audio objects of the soundfield using the audio spatial metadata stored to the memory device, and to select, based on the identified one or more foreground audio objects, one or more viewports associated with the video data. Display hardware coupled to the processor and the memory device is configured to output a portion of the video data being associated with the one or more viewports selected by the processor.
-
公开(公告)号:US11138983B2
公开(公告)日:2021-10-05
申请号:US16557650
申请日:2019-08-30
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: H04R5/04 , G10L19/008 , G10L19/16 , H04S3/00 , H04S5/00
Abstract: In general, techniques are described for signaling layers for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of layers specified in the bitstream, and obtain the layers of the bitstream based on the indication of the number of layers.
-
公开(公告)号:US11026019B2
公开(公告)日:2021-06-01
申请号:US16352272
申请日:2019-03-13
Applicant: QUALCOMM Incorporated
Inventor: S M Akramus Salehin , Dipanjan Sen
Abstract: A device to apply noise reduction to ambisonic signals includes a memory configured to store noise data corresponding to microphones in a microphone array. A processor is configured to perform signal processing operations on signals captured by microphones in the microphone array to generate multiple sets of ambisonic signals including a first set corresponding to a first particular ambisonic order and a second set corresponding to a second particular ambisonic order. The processor is configured to perform a first noise reduction operation that includes applying a first gain factor to each ambisonic signal in the first set and to perform a second noise reduction operation that includes applying a second gain factor to each ambisonic signal in the second set. The first gain factor and the second gain factor are based on the noise data, and the second gain factor is distinct from the first gain factor.
-
公开(公告)号:US11019449B2
公开(公告)日:2021-05-25
申请号:US16567700
申请日:2019-09-11
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen
Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
-
公开(公告)号:US10972853B2
公开(公告)日:2021-04-06
申请号:US16719392
申请日:2019-12-18
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , S M Akramus Salehin , Dipanjan Sen
Abstract: A device for processing coded audio is disclosed. The device is configured to store an audio object and audio object metadata associated with the audio object. The audio object metadata includes frequency dependent beam pattern metadata. The device may apply, based on the frequency dependent beam pattern metadata, a renderer to the audio object to obtain one or more speaker feeds and output the one or more speaker feeds.
-
公开(公告)号:US20190394605A1
公开(公告)日:2019-12-26
申请号:US16450660
申请日:2019-06-24
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Ferdinando Olivieri , Dipanjan Sen
Abstract: In general, techniques are described by which to render different portions of audio data using different renderers. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store audio renderers. The processor(s) may obtain a first audio renderer of the plurality of audio renderers, and apply the first audio renderer with respect to a first portion of the audio data to obtain one or more first speaker feeds. The processor(s) may next obtain a second audio renderer of the plurality of audio renderers, and apply the second audio renderer with respect to a second portion of the audio data to obtain one or more second speaker feeds. The processor(s) may output, to one or more speakers, the one or more first speaker feeds and the one or more second speaker feeds.
-
公开(公告)号:US10412522B2
公开(公告)日:2019-09-10
申请号:US14663225
申请日:2015-03-19
Applicant: QUALCOMM Incorporated
Inventor: Dipanjan Sen , Nils Günther Peters
IPC: H04S7/00 , G10L19/008 , G10L25/48 , G10L19/018
Abstract: In general, techniques are described for inserting audio channels into descriptions of soundfields. A device comprising a processor may be configured to perform the techniques. The processor may be configured to obtain an audio channel separate from a higher-order ambisonic representation of a soundfield. The processor may further be configured to insert the audio channel at a spatial location within the soundfield such that the audio channel is able to be extracted from the soundfield.
-
-
-
-
-
-
-
-
-