-
公开(公告)号:US20140355766A1
公开(公告)日:2014-12-04
申请号:US14289602
申请日:2014-05-28
Applicant: QUALCOMM Incorporated
Inventor: Martin James Morrell , Dipanjan Sen , Nils Günther Peters
IPC: H04S5/00
CPC classification number: G10L19/008 , H04S7/30 , H04S7/304 , H04S2400/01 , H04S2420/01 , H04S2420/11
Abstract: A device comprising one or more processors is configured to obtain transformation information, the transformation information describing how a sound field was transformed to reduce a number of a plurality of hierarchical elements to a reduced plurality of hierarchical elements; and perform binaural audio rendering with respect to the reduced plurality of hierarchical elements based on the transformation information.
Abstract translation: 包括一个或多个处理器的设备被配置为获得变换信息,所述变换信息描述如何变换声场以将多个分层元素的数量减少到减少的多个分层元素; 并且基于变换信息对所减少的多个分层元素执行双耳音频呈现。
-
112.
公开(公告)号:US20140226823A1
公开(公告)日:2014-08-14
申请号:US14174769
申请日:2014-02-06
Applicant: QUALCOMM Incorporated
Inventor: Dipanjan Sen , Martin James Morrell , Nils Günther Peters
IPC: H04S5/00
CPC classification number: H04S7/30 , G10L19/008 , G10L19/167 , H04S7/301 , H04S7/308 , H04S2420/03 , H04S2420/11
Abstract: In general, techniques are described for specifying audio rendering information in a bitstream. A device configured to generate the bitstream may perform various aspects of the techniques. The bitstream generation device may comprise one or more processors configured to specify audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content. A device configured to render multi-channel audio content from a bitstream may also perform various aspects of the techniques. The rendering device may comprise one or more processors configured to determine audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content, and render a plurality of speaker feeds based on the audio rendering information.
Abstract translation: 通常,描述了用于在比特流中指定音频呈现信息的技术。 被配置为生成比特流的设备可以执行该技术的各个方面。 比特流生成装置可以包括一个或多个处理器,其被配置为指定音频呈现信息,其包括识别当生成多声道音频内容时使用的音频渲染器的信号值。 被配置为从比特流呈现多声道音频内容的设备还可以执行该技术的各个方面。 渲染设备可以包括一个或多个处理器,其被配置为确定音频呈现信息,该音频呈现信息包括识别在生成多声道音频内容时使用的音频渲染器的信号值,以及基于该音频呈现信息呈现多个扬声器馈送。
-
公开(公告)号:US20140016786A1
公开(公告)日:2014-01-16
申请号:US13844383
申请日:2013-03-15
Applicant: QUALCOMM INCORPORATED
Inventor: Dipanjan Sen
IPC: G10L19/008
CPC classification number: G10L19/008 , H04S3/008 , H04S2400/01 , H04S2400/03
Abstract: Systems, methods, and apparatus for a unified approach to encoding different types of audio inputs are described.
-
公开(公告)号:US20210264927A1
公开(公告)日:2021-08-26
申请号:US17180255
申请日:2021-02-19
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen , Siddhartha Goutham Swaminathan , S M Akramus Salehin , Jason Filos
Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
-
公开(公告)号:US10972851B2
公开(公告)日:2021-04-06
申请号:US16152153
申请日:2018-10-04
Applicant: QUALCOMM Incorporated
Inventor: Jeongook Song , Dipanjan Sen
IPC: H04S5/00 , H04S3/00 , G10L19/008
Abstract: In general, techniques are described by which to perform spatial relation coding of higher order ambisonic coefficients using expanded parameters. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store at least a portion of a bitstream, the bitstream including a first indication representative of an HOA coefficient associated with the spherical basis function having an order of zero, and a second indication representative of one or more parameters. The processor may be configured to perform parameter expansion with respect to the one or more parameters to obtain one or more expanded parameters, and synthesize, based on the one or more expanded parameters and the HOA coefficient associated with the spherical basis function having the order of zero, one or more HOA coefficients associated with one or more spherical basis functions having an order greater than zero.
-
公开(公告)号:US10952009B2
公开(公告)日:2021-03-16
申请号:US16863626
申请日:2020-04-30
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
-
公开(公告)号:US10924876B2
公开(公告)日:2021-02-16
申请号:US16513436
申请日:2019-07-16
Applicant: QUALCOMM Incorporated
Inventor: Siddhartha Goutham Swaminathan , S M Akramus Salehin , Dipanjan Sen
Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.
-
公开(公告)号:US20200304935A1
公开(公告)日:2020-09-24
申请号:US16822556
申请日:2020-03-18
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , Moo Young Kim , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Isaac Garcia Munoz , Dipanjan Sen
Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.
-
119.
公开(公告)号:US10770087B2
公开(公告)日:2020-09-08
申请号:US14712849
申请日:2015-05-14
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: G10L19/038 , G10L19/008 , G10L19/00
Abstract: In general, techniques are described for performing codebook selection when coding vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store a plurality of codebooks to use when performing vector dequantization with respect to a vector quantized spatial component of a soundfield. The vector quantized spatial component may be obtained through application of a decomposition to a plurality of higher order ambisonic coefficients. The processor may be configured to select one of the plurality of codebooks.
-
公开(公告)号:US10657974B2
公开(公告)日:2020-05-19
申请号:US16227880
申请日:2018-12-20
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen
IPC: G10L19/008 , H04S7/00 , G10L19/16 , H04S3/00
Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
-
-
-
-
-
-
-
-
-