-
公开(公告)号:US10728689B2
公开(公告)日:2020-07-28
申请号:US16219714
申请日:2018-12-13
Applicant: QUALCOMM Incorporated
Inventor: Siddhartha Goutham Swaminathan , S M Akramus Salehin , Dipanjan Sen , Michael Ericson
Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.
-
公开(公告)号:US20190385622A1
公开(公告)日:2019-12-19
申请号:US16557650
申请日:2019-08-30
Applicant: QUALCOMM Incorporated
Inventor: Moo Young KIM , Nils Günther Peters , Dipanjan Sen
IPC: G10L19/008 , H04S3/00 , G10L19/16
Abstract: In general, techniques are described for signaling layers for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of layers specified in the bitstream, and obtain the layers of the bitstream based on the indication of the number of layers.
-
公开(公告)号:US10455321B2
公开(公告)日:2019-10-22
申请号:US15727334
申请日:2017-10-06
Applicant: QUALCOMM Incorporated
Inventor: Ricardo De Jesus Bernal Castillo , Wade Heimbigner , Dipanjan Sen
IPC: H04R1/08 , H04R9/08 , H04R11/04 , H04R1/32 , H04R1/02 , H04R5/027 , H04R1/40 , H04R3/00 , H04R3/02 , H04S7/00
Abstract: A microphone device includes a microphone array configured to capture one or more audio objects associated with a three-dimensional sound field. The microphone array includes clusters of two or more microphone elements. Each cluster includes one or more acoustic port openings and two or more microphone elements coupled to the one or more acoustic port openings via corresponding acoustic ports. The microphone device also includes a processor coupled to the microphone array.
-
公开(公告)号:US20190198028A1
公开(公告)日:2019-06-27
申请号:US16227880
申请日:2018-12-20
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen
IPC: G10L19/008 , H04S3/00 , H04S7/00
CPC classification number: G10L19/008 , G10L19/167 , H04S3/008 , H04S7/30 , H04S2400/15 , H04S2420/11
Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
-
公开(公告)号:US20190005986A1
公开(公告)日:2019-01-03
申请号:US15672080
申请日:2017-08-08
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen
Abstract: An example device includes a memory device, and a processor coupled to the memory device. The memory is configured to store audio spatial metadata associated with a soundfield and video data. The processor is configured to identify one or more foreground audio objects of the soundfield using the audio spatial metadata stored to the memory device, and to select, based on the identified one or more foreground audio objects, one or more viewports associated with the video data. Display hardware coupled to the processor and the memory device is configured to output a portion of the video data being associated with the one or more viewports selected by the processor.
-
公开(公告)号:US20180338212A1
公开(公告)日:2018-11-22
申请号:US15804718
申请日:2017-11-06
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: H04S7/00 , G10L19/008 , H04R1/40 , H04R3/00 , H04S3/00
CPC classification number: G10L19/008 , G10L19/167 , G10L19/173 , H04R3/005 , H04R5/027 , H04R2499/13 , H04S3/008 , H04S2400/01 , H04S2400/15 , H04S2420/11
Abstract: In general, techniques are described for performing layered intermediate compression for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store HOA coefficients of the HOA audio data. The processors may decompose the HOA coefficients into a predominant sound component and a corresponding spatial component. The spatial component may be representative of the directions, shape, and width of the predominant sound component, and defined in the spherical harmonic domain. The processor may specify, in a bitstream conforming to an intermediate compression format, a subset of the HOA coefficients that represent an ambient component. The processor may also specify, in the bitstream and irrespective of a determination of a minimum number of ambient channels and a number of elements to specify in the bitstream for the spatial component, all elements of the spatial component.
-
公开(公告)号:US20180206057A1
公开(公告)日:2018-07-19
申请号:US15868656
申请日:2018-01-11
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: H04S7/00 , G10L19/008 , H04S3/00
CPC classification number: H04S7/304 , G10L19/008 , H04R5/033 , H04S3/008 , H04S7/306 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2400/13 , H04S2420/01 , H04S2420/03 , H04S2420/11
Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
-
公开(公告)号:US09984693B2
公开(公告)日:2018-05-29
申请号:US14878729
申请日:2015-10-08
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: G10L19/008 , H04S7/00 , G10L19/16
CPC classification number: G10L19/008 , G10L19/167 , H04S7/30 , H04S2420/11
Abstract: In general, techniques are described for signaling channels for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of channels specified in one or more layers in the bitstream, and obtain the channels specified in the one or more layers in the bitstream based on the indication of the number of channels.
-
公开(公告)号:US09959876B2
公开(公告)日:2018-05-01
申请号:US14712638
申请日:2015-05-14
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: G10L19/008 , G10L19/038 , H04S3/02 , H04S5/00 , G10L19/032 , G10L19/20
CPC classification number: G10L19/008 , G10L19/032 , G10L19/038 , G10L19/20 , H04S3/02 , H04S5/005 , H04S2400/01 , H04S2420/11
Abstract: In general, techniques are described for closed loop quantization of HOA coefficients that provide a three-dimensional representation of the sound field. An audio encoding device may perform closed loop quantization of an audio object based at least in part on a result of performing quantization of directional information associated with the audio object. An audio decoding device may obtain an audio object that has been closed loop quantized based at least in part on a result of performing quantization of directional information associated with the audio object, and may dequantize the audio object.
-
公开(公告)号:US09940937B2
公开(公告)日:2018-04-10
申请号:US14878948
申请日:2015-10-08
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , Martin James Morrell , Dipanjan Sen
IPC: G10L19/008 , G10L19/032 , H04S3/00 , H04S7/00
CPC classification number: G10L19/008 , G10L19/032 , H04S3/008 , H04S7/301 , H04S7/302 , H04S2420/11
Abstract: This disclosure describes techniques for coding of higher-order ambisonics audio data comprising at least one higher-order ambisonic (HOA) coefficient corresponding to a spherical harmonic basis function having an order greater than one. This disclosure describes techniques for adjusting HOA soundfields to potentially improve spatial alignment of the acoustic elements to the visual component in a mixed audio/video reproduction scenario. In one example, a device for rendering an HOA audio signal includes one or more processors configured to render the HOA audio signal over one or more speakers based on one or more field of view (FOV) parameters of a reference screen and one or more FOV parameters of a viewing window.
-
-
-
-
-
-
-
-
-