Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Dipanjan Sen" Page 8

71.

发明申请
NEAR FIELD COMPENSATION FOR DECOMPOSED REPRESENTATIONS OF A SOUND FIELD 审中-公开
Title translation: 声场的分解表示的近场补偿

公开(公告)号：US20150127354A1

公开(公告)日：2015-05-07

申请号：US14505276

申请日：2014-10-02

Applicant: QUALCOMM Incorporated

Inventor： Nils Günther Peters , Dipanjan Sen

IPC: G10L19/008

CPC classification number: G10L19/008 , H04S3/02 , H04S2420/11

Abstract: In general, techniques are described for compressing higher order ambisonics (HOA) audio data. A device comprising one or more processors may be configured to perform the techniques. The one or more processors may be configured to obtain a plurality of spherical harmonic coefficients from a plurality of near field compensated spherical harmonic coefficients by, at least in part, counterbalancing application of a near field compensation filter to the plurality of spherical harmonic coefficients.

Abstract translation: 一般来说，描述用于压缩高阶态（HOA）音频数据的技术。包括一个或多个处理器的设备可以被配置为执行这些技术。一个或多个处理器可以被配置为通过至少部分地将近场补偿滤波器的应用平衡到多个球谐函数系数，从多个近场补偿的球谐函数中获得多个球谐函数。

72.

发明申请
SYSTEMS AND METHODS FOR FEATURE EXTRACTION 有权
Title translation: 用于特征提取的系统和方法

公开(公告)号：US20150006164A1

公开(公告)日：2015-01-01

申请号：US14314022

申请日：2014-06-24

Applicant: QUALCOMM Incorporated

Inventor： Wenliang Lu , Dipanjan Sen

IPC: G10L15/02

CPC classification number: G10L15/02 , G10L25/00 , G10L25/69

Abstract: A method for feature extraction by an electronic device is described. The method includes processing speech using a physiological cochlear model. The method also includes analyzing sections of an output of the physiological cochlear model. The method further includes extracting a place-based analysis vector and a time-based analysis vector for each section. The method additionally includes determining one or more features from each analysis vector.

Abstract translation: 描述了一种通过电子设备进行特征提取的方法。该方法包括使用生理学耳蜗模型处理语音。该方法还包括分析生理学耳蜗模型的输出的部分。该方法还包括提取每个部分的基于位置的分析向量和基于时间的分析向量。该方法另外包括从每个分析向量确定一个或多个特征。

73.

发明申请
IDENTIFYING CODEBOOKS TO USE WHEN CODING SPATIAL COMPONENTS OF A SOUND FIELD 审中-公开
Title translation: 识别编码声场的空间组件时使用的代码

公开(公告)号：US20140358561A1

公开(公告)日：2014-12-04

申请号：US14289440

申请日：2014-05-28

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Sang-Uk Ryu

IPC: G10L19/038 , G10L19/008

CPC classification number: H04S5/005 , G06F17/16 , G10L19/002 , G10L19/008 , G10L19/0204 , G10L19/038 , G10L19/06 , G10L19/167 , G10L19/20 , G10L25/18 , G10L2019/0001 , G10L2019/0005 , H04R2205/021 , H04S7/30 , H04S7/304 , H04S7/40 , H04S2400/01 , H04S2400/15 , H04S2420/01 , H04S2420/03 , H04S2420/11

Abstract: In general, techniques are described for identifying a codebook to be used when compressing spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. The one or more processors may be configured to identify a Huffman codebook to use when compressing a spatial component of a plurality of spatial components based on an order of the spatial component relative to remaining ones of the plurality of spatial components, the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.

Abstract translation: 通常，描述用于识别当压缩声场的空间分量时要使用的码本的技术。包括一个或多个处理器的设备可以被配置为执行这些技术。一个或多个处理器可以被配置为识别基于相对于多个空间分量中的剩余空间分量的空间分量的顺序来压缩多个空间分量的空间分量时使用的霍夫曼码本，由空间分量生成的空间分量对多个球谐函数执行基于矢量的合成。

74.

发明申请
IDENTIFYING SOURCES FROM WHICH HIGHER ORDER AMBISONIC AUDIO DATA IS GENERATED 有权
Title translation: 识别来自更高级别的健康音频数据的来源

公开(公告)号：US20140358558A1

公开(公告)日：2014-12-04

申请号：US14289174

申请日：2014-05-28

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Martin James Morrell , Nils Günther Peters

IPC: G10L19/008

CPC classification number: H04S5/005 , G06F17/16 , G10L19/002 , G10L19/008 , G10L19/0204 , G10L19/038 , G10L19/06 , G10L19/167 , G10L19/20 , G10L25/18 , G10L2019/0001 , G10L2019/0005 , H04R2205/021 , H04S7/30 , H04S7/304 , H04S7/40 , H04S2400/01 , H04S2400/15 , H04S2420/01 , H04S2420/03 , H04S2420/11

Abstract: In general, techniques are described for obtaining an indication of whether spherical harmonic coefficients are representative of a synthetic audio object. In accordance with the techniques, a device comprising one or more processors may be configured to obtain an indication of whether spherical harmonic coefficients representative of a sound field are generated from a synthetic audio object.

Abstract translation: 通常，描述用于获得球面谐波系数是否代表合成音频对象的指示的技术。根据这些技术，包括一个或多个处理器的装置可以被配置为获得是否从合成音频对象产生表示声场的球面谐波系数的指示。

75.

发明申请
PERFORMING SPATIAL MASKING WITH RESPECT TO SPHERICAL HARMONIC COEFFICIENTS 有权
Title translation: 相对于球形谐波系数进行空间遮蔽

公开(公告)号：US20140355768A1

公开(公告)日：2014-12-04

申请号：US14288219

申请日：2014-05-27

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Martin James Morrell

IPC: G10L19/008

CPC classification number: G10L19/008 , G10L19/0212

Abstract: In general, techniques are described by which to perform spatial masking with respect to spherical harmonic coefficients. As one example, an audio encoding device comprising a processor may perform various aspects of the techniques. The processor may be configured to perform spatial analysis based on the spherical harmonic coefficients describing a three-dimensional sound field to identify a spatial masking threshold. The processor may further be configured to render the multi-channel audio data from the plurality of spherical harmonic coefficients, and compress the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.

Abstract translation: 通常，描述关于球面谐波系数执行空间掩蔽的技术。作为一个示例，包括处理器的音频编码设备可以执行该技术的各个方面。处理器可以被配置为基于描述三维声场的球谐函数来执行空间分析以识别空间掩蔽阈值。处理器还可以被配置为从多个球谐函数渲染多声道音频数据，并且基于所识别的空间掩蔽阈值来压缩多声道音频数据以生成比特流。

76.

发明授权
Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems 有权

公开(公告)号：US12047764B2

公开(公告)日：2024-07-23

申请号：US16557798

申请日：2019-08-30

Applicant: Qualcomm Incorporated

Inventor： Nils Günther Peters , Dipanjan Sen , Thomas Stockhammer

IPC: H04S7/00 , G06F3/01 , G06F3/16 , H04R1/02 , H04R1/40 , H04S3/00 , G06T19/00

CPC classification number: H04S7/303 , G06F3/013 , G06F3/165 , G06F3/167 , H04R1/028 , H04R1/406 , H04S3/008 , G06T19/006 , H04R2420/07 , H04S2400/11 , H04S2400/15 , H04S2420/11

Abstract: An example device includes a memory configured to store a plurality of representations of a soundfield, each representation of the soundfield comprising a different set of ambisonic coefficients representative of the same soundfield at concurrent periods of time. The device also includes a processor, coupled to the memory, and the processor is configured to perform audio playback based on a field of view and on a particular representation of the soundfield from the plurality of representations.

77.

发明申请
Reordering Of Audio Objects In The Ambisonics Domain 有权

公开(公告)号：US20220030372A1

公开(公告)日：2022-01-27

申请号：US17498707

申请日：2021-10-11

Applicant: QUALCOMM Incorporated

Inventor： Dipanjan Sen , Sang-Uk Ryu

IPC: H04S7/00

Abstract: In general, disclosed is a device that includes one or more processors, coupled to the memory, configured to perform an energy analysis with respect to one or more audio objects, in the ambisonics domain, in the first time segment. The one or more processors are also configured to perform a similarity measure between the one or more audio objects, in the ambisonics domain, in the first time segment, and the one or more audio objects, in the ambisonics domain, in the second time segment. In addition, the one or more processors are configured to perform a reorder of the one or more audio objects, in the ambisonics domain, in the first time segment with the one or more audio objects, in the ambisonics domain, in the second time segment, to generate one or more reordered audio objects in the first time segment.

78.

发明授权
Selecting audio streams based on motion 有权

公开(公告)号：US11089428B2

公开(公告)日：2021-08-10

申请号：US16714150

申请日：2019-12-13

Applicant: QUALCOMM Incorporated

Inventor： S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen

IPC: H04S7/00 , H04S3/00 , H04R5/04 , H04R5/033

Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.

79.

发明申请
SELECTING AUDIO STREAMS BASED ON MOTION 有权

公开(公告)号：US20210185470A1

公开(公告)日：2021-06-17

申请号：US16714150

申请日：2019-12-13

Applicant: QUALCOMM Incorporated

Inventor： S M Akramus Salehin , Siddhartha Goutham Swaminathan , Dipanjan Sen

IPC: H04S7/00 , H04S3/00 , H04R5/033 , H04R5/04

Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.

80.

发明授权
Spatial relation coding using virtual higher order ambisonic coefficients 有权

公开(公告)号：US10986456B2

公开(公告)日：2021-04-20

申请号：US16152130

申请日：2018-10-04

Applicant: QUALCOMM Incorporated

Inventor： Jeongook Song , Dipanjan Sen

IPC: H04S5/00 , H04S3/00 , G10L19/008

Abstract: In general, techniques are described by which to perform spatial relation coding using virtual higher order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store audio data, the audio data representative of zero-ordered higher order ambisonic (HOA) coefficient, and one or more greater-than-zero-ordered HOA coefficients. The processor may be configured to obtain, based on the one or more greater-than-zero-ordered HOA coefficients, a virtual zero-ordered HOA coefficient. The processor may also be configured to obtain, based on the virtual HOA coefficient, one or more parameters from which to synthesize the one or more greater-than-zero-ordered HOA coefficients. The processor may further be configured to generate a bitstream that includes a first indication representative of the zero-ordered HOA coefficients, and a second indication representative of the one or more parameters.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification