-
公开(公告)号:US20200029164A1
公开(公告)日:2020-01-23
申请号:US16513436
申请日:2019-07-16
Applicant: QUALCOMM Incorporated
Inventor: Siddhartha Goutham Swaminathan , S M Akramus Salehin , Dipanjan Sen
Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.
-
公开(公告)号:US20200013414A1
公开(公告)日:2020-01-09
申请号:US16450698
申请日:2019-06-24
Applicant: QUALCOMM Incorporated
Inventor: Shankar Thagadur Shivappa , Richard Paul Walters , Dipanjan Sen , Nils Günther Peters , Moo Young Kim
IPC: G10L19/008 , G10L19/16 , H04R5/02 , H04S3/00
Abstract: In general, techniques are described by which to embed enhanced audio transports in backward compatible bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream, which conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format, and obtain, from the backward compatible bitstream, extended audio data that enhances the legacy audio data. The processor(s) may also obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format, and output the enhanced audio data to one or more speakers.
-
公开(公告)号:US20190392846A1
公开(公告)日:2019-12-26
申请号:US16450625
申请日:2019-06-24
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Dipanjan Sen , Nils Günther Peters , Ferdinando Olivieri
IPC: G10L19/008 , H03M7/30 , H04S5/00
Abstract: In general, techniques are described by which to obtain demixing data for backward compatible rendering of higher order ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the higher order ambisonic (HOA) audio data. The processor may obtain, from a bitstream, legacy audio data that conforms to a legacy audio format, and obtain, from the bitstream, de-mixing data. The processor(s) may process, based on the de-mixing data, the legacy audio data to obtain the first portion of the HOA audio data. The processor(s) may next obtain, from the bitstream, a second portion of the HOA audio data. The processor(s) may render the first portion and the second portion to obtain a speaker feed. Further, the processor(s) may output the speaker feed to a speaker to reproduce a soundfield represented by the HOA audio data.
-
公开(公告)号:US10514887B2
公开(公告)日:2019-12-24
申请号:US16148837
申请日:2018-10-01
Applicant: QUALCOMM Incorporated
Inventor: Shankar Thagadur Shivappa , Martin Morrell , S M Akramus Salehin , Dipanjan Sen
Abstract: In a particular aspect, a multimedia device is configured to compensate for navigational movement within a visual environment. The multimedia device includes a processor configured to generate a first version of a spatialized audio signal of a sound field in a first audio frame based on a first position within a visual environment. The processor is configured to generate a second version of the spatialized audio signal of the sound field in a second audio frame based on compensating for the speed of movement of the multimedia device that indicates the first position within the visual environment changed to a second position within the visual environment, being different than the speed of movement in changing from a first location of the sound field to a second location of the sound field.
-
公开(公告)号:US10469968B2
公开(公告)日:2019-11-05
申请号:US15782252
申请日:2017-10-12
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , S M Akramus Salehin , Shankar Thagadur Shivappa , Moo Young Kim , Dipanjan Sen
Abstract: In general, techniques are described for adapting higher order ambisonic audio data to include three degrees of freedom plus effects. An example device configured to perform the techniques includes a memory, and a processor coupled to the memory. The memory may be configured to store higher order ambisonic audio data representative of a soundfield. The processor may be configured to obtain a translational distance representative of a translational head movement of a user interfacing with the device. The processor may further be configured to adapt, based on the translational distance, higher order ambisonic audio data to provide three degrees of freedom plus effects that adapt the soundfield to account for the translational head movement, and generate speaker feeds based on the adapted higher order ambient audio data.
-
公开(公告)号:US20190110148A1
公开(公告)日:2019-04-11
申请号:US16152153
申请日:2018-10-04
Applicant: QUALCOMM Incorporated
Inventor: Jeongook Song , Dipanjan Sen
IPC: H04S5/00
Abstract: In general, techniques are described by which to perform spatial relation coding of higher order ambisonic coefficients using expanded parameters. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store at least a portion of a bitstream, the bitstream including a first indication representative of an HOA coefficient associated with the spherical basis function having an order of zero, and a second indication representative of one or more parameters. The processor may be configured to perform parameter expansion with respect to the one or more parameters to obtain one or more expanded parameters, and synthesize, based on the one or more expanded parameters and the HOA coefficient associated with the spherical basis function having the order of zero, one or more HOA coefficients associated with one or more spherical basis functions having an order greater than zero.
-
公开(公告)号:US20190110147A1
公开(公告)日:2019-04-11
申请号:US16152130
申请日:2018-10-04
Applicant: QUALCOMM Incorporated
Inventor: Jeongook Song , Dipanjan Sen
IPC: H04S5/00
Abstract: In general, techniques are described by which to perform spatial relation coding using virtual higher order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store audio data, the audio data representative of zero-ordered higher order ambisonic (HOA) coefficient, and one or more greater-than-zero-ordered HOA coefficients. The processor may be configured to obtain, based on the one or more greater-than-zero-ordered HOA coefficients, a virtual zero-ordered HOA coefficient. The processor may also be configured to obtain, based on the virtual HOA coefficient, one or more parameters from which to synthesize the one or more greater-than-zero-ordered HOA coefficients. The processor may further be configured to generate a bitstream that includes a first indication representative of the zero-ordered HOA coefficients, and a second indication representative of the one or more parameters.
-
公开(公告)号:US10178489B2
公开(公告)日:2019-01-08
申请号:US14174769
申请日:2014-02-06
Applicant: QUALCOMM Incorporated
Inventor: Dipanjan Sen , Martin James Morrell , Nils Günther Peters
IPC: H04S5/00 , H04S7/00 , G10L19/16 , G10L19/008
Abstract: In general, techniques are described for specifying audio rendering information in a bitstream. A device configured to generate the bitstream may perform various aspects of the techniques. The bitstream generation device may comprise one or more processors configured to specify audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content. A device configured to render multi-channel audio content from a bitstream may also perform various aspects of the techniques. The rendering device may comprise one or more processors configured to determine audio rendering information that includes a signal value identifying an audio renderer used when generating the multi-channel audio content, and render a plurality of speaker feeds based on the audio rendering information.
-
公开(公告)号:US20180317002A1
公开(公告)日:2018-11-01
申请号:US15727334
申请日:2017-10-06
Applicant: QUALCOMM Incorporated
Inventor: Ricardo De Jesus Bernal Castillo , Wade Heimbigner , Dipanjan Sen
CPC classification number: H04R1/326 , H04R1/021 , H04R1/08 , H04R1/406 , H04R3/005 , H04R3/02 , H04R5/027 , H04R2201/401 , H04R2410/01 , H04R2410/03 , H04R2430/20 , H04S7/303 , H04S2400/15
Abstract: A microphone device includes a microphone array configured to capture one or more audio objects associated with a three-dimensional sound field. The microphone array includes clusters of two or more microphone elements. Each cluster includes one or more acoustic port openings and two or more microphone elements coupled to the one or more acoustic port openings via corresponding acoustic ports. The microphone device also includes a processor coupled to the microphone array.
-
公开(公告)号:US10089063B2
公开(公告)日:2018-10-02
申请号:US15233767
申请日:2016-08-10
Applicant: QUALCOMM Incorporated
Inventor: Shankar Thagadur Shivappa , Martin Morrell , S M Akramus Salehin , Dipanjan Sen
Abstract: In a particular aspect, a multimedia device includes one or more sensors configured to generate first sensor data and second sensor data. The first sensor data is indicative of a first position at a first time and the second sensor data is indicative of a second position at a second time. The multimedia device further includes a processor coupled to the one or more sensors. The processor is configured to generate a first version of a spatialized audio signal, determine a cumulative value based on an offset, the first position, and the second position, and generate a second version of the spatialized audio signal based on the cumulative value.
-
-
-
-
-
-
-
-
-