Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Dipanjan Sen" Page 16

151.

发明申请
SYSTEMS, METHODS, APPARATUS, AND COMPUTER-READABLE MEDIA FOR GENERATING OBFUSCATED SPEECH SIGNAL 审中-公开
Title translation: 系统，方法，设备和用于产生语音信号的计算机可读介质

公开(公告)号：US20140006017A1

公开(公告)日：2014-01-02

申请号：US13780233

申请日：2013-02-28

Applicant: QUALCOMM INCORPORATED

Inventor： Dipanjan Sen

IPC: G10L21/003

CPC classification number: G10L21/003 , G10K11/175 , G10L19/008 , G10L21/06 , G10L25/48 , H04K1/02 , H04K1/10 , H04K3/42 , H04K3/825 , H04K2203/12 , H04K2203/32 , H04R1/403 , H04R3/12 , H04R2201/403 , H04R2203/12 , H04S7/30 , H04S2400/11

Abstract: Arrangements are described that may be used to reduce the intelligibility of speech using masker signals which are obfuscated yet correlated versions of the speech. Other applications of pitch analysis and demodulation are also described. A system may be used to drive an array of loudspeakers to produce a sound field that includes a source component, whose energy is concentrated along a first direction relative to the array, and a masking component that is based on an estimated intensity of the source component in a second direction that is different from the first direction.

Abstract translation: 描述了可以用于降低语音的可懂度的安排，该掩蔽信号是语音混淆但关联的版本。还描述了音调分析和解调的其他应用。可以使用系统来驱动扬声器阵列以产生声场，该声场包括源分量，其能量沿着相对于阵列的第一方向集中，以及基于源分量的估计强度的掩蔽分量在与第一方向不同的第二方向上。

152.

发明授权
Reordering of foreground audio objects in the ambisonics domain 有权

公开(公告)号：US11962990B2

公开(公告)日：2024-04-16

申请号：US17498707

申请日：2021-10-11

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Sang-Uk Ryu

IPC: G10L19/008 , G06F17/16 , G10L19/002 , G10L19/02 , G10L19/038 , G10L19/06 , G10L19/16 , G10L19/20 , G10L25/18 , H04S5/00 , H04S7/00 , G10L19/00

CPC classification number: H04S5/005 , G06F17/16 , G10L19/002 , G10L19/008 , G10L19/0204 , G10L19/038 , G10L19/06 , G10L19/167 , G10L19/20 , G10L25/18 , H04S7/30 , H04S7/304 , H04S7/40 , G10L2019/0001 , G10L2019/0005 , H04R2205/021 , H04S2400/01 , H04S2400/15 , H04S2420/01 , H04S2420/03 , H04S2420/11

Abstract: In general, disclosed is a device that includes one or more processors, coupled to the memory, configured to perform an energy analysis with respect to one or more audio objects, in the ambisonics domain, in the first time segment. The one or more processors are also configured to perform a similarity measure between the one or more audio objects, in the ambisonics domain, in the first time segment, and the one or more audio objects, in the ambisonics domain, in the second time segment. In addition, the one or more processors are configured to perform a reorder of the one or more audio objects, in the ambisonics domain, in the first time segment with the one or more audio objects, in the ambisonics domain, in the second time segment, to generate one or more reordered audio objects in the first time segment.

153.

发明授权
Spatial transformation of ambisonic audio data 有权

公开(公告)号：US11664035B2

公开(公告)日：2023-05-30

申请号：US17493789

申请日：2021-10-04

Applicant: QUALCOMM Incorporated

Inventor： Nils Günther Peters , Moo Young Kim , Dipanjan Sen

IPC: G10L19/008 , G10L19/16 , H04S3/00 , H04R5/04 , H04S5/00

CPC classification number: G10L19/008 , G10L19/167 , H04S3/008 , H04R5/04 , H04S5/00 , H04S2420/11

Abstract: A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.

154.

发明授权
Rendering metadata to control user movement based audio rendering 有权

公开(公告)号：US11184731B2

公开(公告)日：2021-11-23

申请号：US16822556

申请日：2020-03-18

Applicant: QUALCOMM Incorporated

Inventor： Nils Günther Peters , Moo Young Kim , S M Akramus Salehin , Siddhartha Goutham Swaminathan , Isaac Garcia Munoz , Dipanjan Sen

IPC: H04S7/00 , H04R5/033 , H04S3/00 , H04R5/04

Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.

155.

发明授权
Representing occlusion when rendering for computer-mediated reality systems 有权

公开(公告)号：US11128976B2

公开(公告)日：2021-09-21

申请号：US16584614

申请日：2019-09-26

Applicant: QUALCOMM Incorporated

Inventor： Isaac Garcia Munoz , Siddhartha Goutham Swaminathan , S M Akramus Salehin , Moo Young Kim , Nils Günther Peters , Dipanjan Sen

IPC: H04S7/00 , H04R3/04 , H04R5/02 , H04R5/04 , H04S3/00

Abstract: In general, techniques are described for modeling occlusions when rendering audio data. A device comprising a memory and one or more processors may perform the techniques. The memory may store audio data representative of a soundfield. The one or more processors may obtain occlusion metadata representative of an occlusion within the soundfield in terms of propagation of sound through the occlusion, the occlusion separating the soundfield into two or more sound spaces. The one or more processors may obtain a location of the device, and obtain, based on the occlusion metadata and the location, a renderer by which to render the audio data into one or more speaker feeds that account for propagation of the sound in one of the two or more sound spaces in which the device resides. The one or more processors may apply the renderer to the audio data to generate the speaker feeds.

156.

发明授权
Embedding enhanced audio transports in backward compatible audio bitstreams 有权

公开(公告)号：US11081116B2

公开(公告)日：2021-08-03

申请号：US16450698

申请日：2019-06-24

Applicant: QUALCOMM Incorporated

Inventor： Shankar Thagadur Shivappa , Richard Paul Walters , Dipanjan Sen , Nils Günther Peters , Moo Young Kim

IPC: G10L19/008 , G10L19/16 , H04R5/02 , H04S3/00

Abstract: In general, techniques are described by which to embed enhanced audio transports in backward compatible bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream, which conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format, and obtain, from the backward compatible bitstream, extended audio data that enhances the legacy audio data. The processor(s) may also obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format, and output the enhanced audio data to one or more speakers.

157.

发明授权
Spatially formatted enhanced audio data for backward compatible audio bitstreams 有权

公开(公告)号：US11062713B2

公开(公告)日：2021-07-13

申请号：US16450514

申请日：2019-06-24

Applicant: QUALCOMM Incorporated

Inventor： Nils Günther Peters , Ferdinando Olivieri , Moo Young Kim , Dipanjan Sen , Shankar Thagadur Shivappa

IPC: G10L19/008 , H04S7/00 , H04R5/04 , H04R5/02

Abstract: In general, techniques are described by which to specify spatially formatted enhanced audio data for backward compatible audio bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream that conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format and a spatially formatted extended audio stream. The processor(s) may process the spatially formatted extended audio stream to obtain extended audio data that enhances the legacy audio data. The processor(s) may next obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format. The processor(s) may output the enhanced audio data to one or more speakers.

158.

发明授权
Rendering different portions of audio data using different renderers 有权

公开(公告)号：US10999693B2

公开(公告)日：2021-05-04

申请号：US16450660

申请日：2019-06-24

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Ferdinando Olivieri , Dipanjan Sen

IPC: H04S3/00 , H04S7/00 , H04R5/02 , G10L19/008 , G10L19/16 , H04S3/02

Abstract: In general, techniques are described by which to render different portions of audio data using different renderers. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store audio renderers. The processor(s) may obtain a first audio renderer of the plurality of audio renderers, and apply the first audio renderer with respect to a first portion of the audio data to obtain one or more first speaker feeds. The processor(s) may next obtain a second audio renderer of the plurality of audio renderers, and apply the second audio renderer with respect to a second portion of the audio data to obtain one or more second speaker feeds. The processor(s) may output, to one or more speakers, the one or more first speaker feeds and the one or more second speaker feeds.

159.

发明申请
HIGHER ORDER AMBISONIC AUDIO DATA 审中-公开

公开(公告)号：US20200335113A1

公开(公告)日：2020-10-22

申请号：US16868259

申请日：2020-05-06

Applicant: QUALCOMM Incorporated

Inventor： Moo Young Kim , Nils Günther Peters , Shankar Thagadur Shivappa , Dipanjan Sen

IPC: G10L19/008 , H04S3/02

Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.

160.

发明授权
Transporting coded audio data 有权

公开(公告)号：US10693936B2

公开(公告)日：2020-06-23

申请号：US15246370

申请日：2016-08-24

Applicant: QUALCOMM Incorporated

Inventor： Thomas Stockhammer , Dipanjan Sen , Nils Günther Peters , Moo Young Kim

IPC: G06F15/16 , H04L29/06 , H04L29/08 , H04N21/81 , H04N21/845 , H04N21/6373 , H04N21/462 , H04N21/442

Abstract: In one example, a device for retrieving audio data includes one or more processors configured to receive availability data representative of a plurality of available adaptation sets, the available adaptation sets including a scene-based audio adaptation set and one or more object-based audio adaptation sets, receive selection data identifying which of the scene-based audio adaptation set and the one or more object-based audio adaptation sets are to be retrieved, and provide instruction data to a streaming client to cause the streaming client to retrieve data for each of the adaptation sets identified by the selection data, and a memory configured to store the retrieved data for the audio adaptation sets.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification