Coding independent frames of ambient higher-order ambisonic coefficients
    181.
    发明授权
    Coding independent frames of ambient higher-order ambisonic coefficients 有权
    编码独立帧的环境高阶ambisonic系数

    公开(公告)号:US09502045B2

    公开(公告)日:2016-11-22

    申请号:US14609208

    申请日:2015-01-29

    Abstract: In general, techniques are described for coding an ambient higher order ambisonic coefficient. An audio decoding device comprising a memory and a processor may perform the techniques. The memory may store a first frame of a bitstream and a second frame of the bitstream. The processor may obtain, from the first frame, one or more bits indicative of whether the first frame is an independent frame that includes additional reference information to enable the first frame to be decoded without reference to the second frame. The processor may further obtain, in response to the one or more bits indicating that the first frame is not an independent frame, prediction information for first channel side information data of a transport channel. The prediction information may be used to decode the first channel side information data of the transport channel with reference to second channel side information data of the transport channel.

    Abstract translation: 一般来说,描述了用于编码环境较高阶的Ambisonic系数的技术。 包括存储器和处理器的音频解码装置可以执行这些技术。 存储器可以存储比特流的第一帧和比特流的第二帧。 处理器可以从第一帧获得指示第一帧是否是包括附加参考信息的独立帧的一个或多个位,以使第一帧能够被解码而不参考第二帧。 响应于指示第一帧不是独立帧的一个或多个比特,处理器还可以获得用于传输信道的第一信道侧信息数据的预测信息。 参考传输信道的第二信道侧信息数据,预测信息可以用于解码传输信道的第一信道侧信息数据。

    Scalable downmix design with feedback for object-based surround codec
    182.
    发明授权
    Scalable downmix design with feedback for object-based surround codec 有权
    基于对象的环绕编解码器的可缩放缩混设计

    公开(公告)号:US09479886B2

    公开(公告)日:2016-10-25

    申请号:US13945806

    申请日:2013-07-18

    Abstract: In general, techniques are described for grouping audio objects into clusters. In some examples, a device for audio signal processing comprises a cluster analysis module configured to group, based on spatial information for each of N audio objects, a plurality of audio objects that includes the N audio objects into L clusters, where L is less than N, wherein the cluster analysis module is configured to receive information from at least one of a transmission channel, a decoder, and a renderer, and wherein a maximum value for L is based on the information received. The device also comprises a downmix module configured to mix the plurality of audio objects into L audio streams, and a metadata downmix module configured to produce, based on the spatial information and the grouping, metadata that indicates spatial information for each of the L audio streams.

    Abstract translation: 一般来说,描述了将音频对象分组成簇的技术。 在一些示例中,用于音频信号处理的设备包括:集群分析模块,被配置为基于N个音频对象中的每一个的空间信息将包括N个音频对象的多个音频对象分组为L个集群,其中L小于 N,其中所述聚类分析模块被配置为从传输信道,解码器和渲染器中的至少一个接收信息,并且其中L的最大值基于所接收的信息。 所述设备还包括被配置为将所述多个音频对象混合成L个音频流的下混模块,以及被配置为基于所述空间信息和所述分组来产生指示每个所述L个音频流的空间信息的元数据的元数据下混模块 。

    Performing positional analysis to code spherical harmonic coefficients
    183.
    发明授权
    Performing positional analysis to code spherical harmonic coefficients 有权
    执行位置分析以编码球谐函数

    公开(公告)号:US09466305B2

    公开(公告)日:2016-10-11

    申请号:US14288320

    申请日:2014-05-27

    CPC classification number: G10L19/02 G10L19/008 H04S2420/11

    Abstract: In general, techniques are described for performing a positional analysis to code audio data. Typically, this audio data comprises a hierarchical representation of a soundfield and may include, as one example, spherical harmonic coefficients (which may also be referred to as higher-order ambisonic coefficients). An audio compression device that includes one or more processors may perform the techniques. The processors may be configured to allocate bits to one or more portions of the audio data, at least in part by performing positional analysis on the audio data.

    Abstract translation: 通常,描述了用于执行位置分析以编码音频数据的技术。 通常,该音频数据包括声场的分级表示,并且可以包括球谐函数(其也可以被称为高阶ambisonic系数)作为一个示例。 包括一个或多个处理器的音频压缩设备可以执行这些技术。 处理器可以被配置为至少部分地通过对音频数据执行位置分析来将比特分配给音频数据的一个或多个部分。

    SCREEN RELATED ADAPTATION OF HOA CONTENT
    184.
    发明申请
    SCREEN RELATED ADAPTATION OF HOA CONTENT 有权
    屏幕相关适应HOA内容

    公开(公告)号:US20160104495A1

    公开(公告)日:2016-04-14

    申请号:US14878948

    申请日:2015-10-08

    Abstract: This disclosure describes techniques for coding of higher-order ambisonics audio data comprising at least one higher-order ambisonic (HOA) coefficient corresponding to a spherical harmonic basis function having an order greater than one. This disclosure describes techniques for adjusting HOA soundfields to potentially improve spatial alignment of the acoustic elements to the visual component in a mixed audio/video reproduction scenario. In one example, a device for rendering an HOA audio signal includes one or more processors configured to render the HOA audio signal over one or more speakers based on one or more field of view (FOV) parameters of a reference screen and one or more FOV parameters of a viewing window.

    Abstract translation: 本公开描述了用于编码包括与具有大于1的阶数的球面谐波基函数相对应的至少一个高阶环比(HOA)系数的高阶有源音频数据的技术。 本公开描述了用于调整HOA声场以在混合音频/视频再现场景中潜在地改善声学元件与视觉分量的空间对准的技术。 在一个示例中,用于呈现HOA音频信号的设备包括一个或多个处理器,其被配置为基于参考屏幕的一个或多个视场(FOV)参数和一个或多个FOV来呈现一个或多个扬声器上的HOA音频信号 查看窗口的参数。

    SIGNALING LAYERS FOR SCALABLE CODING OF HIGHER ORDER AMBISONIC AUDIO DATA
    185.
    发明申请
    SIGNALING LAYERS FOR SCALABLE CODING OF HIGHER ORDER AMBISONIC AUDIO DATA 审中-公开
    用于高可靠编码的更高级别的有声音频数据的信号层

    公开(公告)号:US20160104493A1

    公开(公告)日:2016-04-14

    申请号:US14878691

    申请日:2015-10-08

    CPC classification number: G10L19/008 G10L19/167 H04S3/008 H04S2420/11

    Abstract: In general, techniques are described for signaling layers for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of layers specified in the bitstream, and obtain the layers of the bitstream based on the indication of the number of layers.

    Abstract translation: 通常,描述了用于信令层的技术,用于可升级编码高阶ambisonic音频数据。 包括存储器和处理器的装置可以被配置为执行这些技术。 存储器可以被配置为存储比特流。 处理器可以被配置为从比特流获得在比特流中指定的层数的指示,并且基于层的数量的指示来获得比特流的层。

    EDITING OF HIGHER-ORDER AMBISONIC AUDIO DATA
    186.
    发明申请
    EDITING OF HIGHER-ORDER AMBISONIC AUDIO DATA 有权
    编辑高阶音乐音频数据

    公开(公告)号:US20160035386A1

    公开(公告)日:2016-02-04

    申请号:US14670074

    申请日:2015-03-26

    Abstract: In general, techniques are described for audio editing of higher-order ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store a higher order ambisonic (HOA) representation of the audio object. The one or more processors may be configured to add a source tail to the HOA representation of the audio object by storing one or more spherical harmonic (SH) basis functions associated with the audio object to a buffer.

    Abstract translation: 通常,描述了用于高阶ambisonic音频数据的音频编辑的技术。 包括存储器和一个或多个处理器的设备可以被配置为执行这些技术。 存储器可以被配置为存储音频对象的更高阶的Ambisonic(HOA)表示。 一个或多个处理器可以被配置为通过将与音频对象相关联的一个或多个球面谐波(SH)基函数存储到缓冲器来将源尾添加到音频对象的HOA表示。

    EDITING OF HIGHER-ORDER AMBISONIC AUDIO DATA
    188.
    发明申请
    EDITING OF HIGHER-ORDER AMBISONIC AUDIO DATA 有权
    编辑高阶音乐音频数据

    公开(公告)号:US20160035356A1

    公开(公告)日:2016-02-04

    申请号:US14670029

    申请日:2015-03-26

    Abstract: In general, techniques are described for editing of higher-order ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store spherical harmonic (SH) basis functions. The one or more processors may be configured to manipulate the SH basis functions associated with higher order ambisonics coefficients to alter a direction of an audio object represented by the higher order ambisonics coefficients.

    Abstract translation: 一般来说,描述了用于编辑高阶ambisonic音频数据的技术。 包括存储器和一个或多个处理器的设备可以被配置为执行这些技术。 存储器可以被配置为存储球面谐波(SH)基函数。 一个或多个处理器可以被配置为操纵与较高阶的对数系数相关联的SH基函数,以改变由高阶有向系数表示的音频对象的方向。

    INDICATING FRAME PARAMETER REUSABILITY FOR CODING VECTORS
    189.
    发明申请
    INDICATING FRAME PARAMETER REUSABILITY FOR CODING VECTORS 有权
    指示编码矢量的帧参数可重用性

    公开(公告)号:US20150213805A1

    公开(公告)日:2015-07-30

    申请号:US14609190

    申请日:2015-01-29

    Abstract: In general, techniques are described for indicating frame parameter reusability for decoding vectors. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain a bitstream comprising a vector representative of an orthogonal spatial axis in a spherical harmonics domain. The bitstream may further comprise an indicator for whether to reuse, from a previous frame, at least one syntax element indicative of information used when compressing the vector. The memory may be configured to store the bitstream.

    Abstract translation: 通常,描述了用于指示用于解码向量的帧参数可重用性的技术。 包括处理器和存储器的设备可以执行这些技术。 处理器可以被配置为获得包括表示球面谐波域中的正交空间轴的矢量的比特流。 比特流还可以包括用于是否从先前帧重用指示在压缩向量时使用的信息的至少一个语法元素的指示符。 存储器可以被配置为存储比特流。

    SYSTEMS AND METHODS FOR MEASURING SPEECH SIGNAL QUALITY
    190.
    发明申请
    SYSTEMS AND METHODS FOR MEASURING SPEECH SIGNAL QUALITY 有权
    用于测量语音信号质量的系统和方法

    公开(公告)号:US20150006162A1

    公开(公告)日:2015-01-01

    申请号:US14314019

    申请日:2014-06-24

    CPC classification number: G10L15/02 G10L25/00 G10L25/69

    Abstract: A method for measuring speech signal quality by an electronic device is described. The method includes obtaining a modified single-channel speech signal. The method also includes estimating multiple objective distortions based on the modified single-channel speech signal. The multiple objective distortions include at least one foreground distortion and at least one background distortion. The method further includes estimating a foreground quality and a background quality based on the multiple objective distortions. The method additionally includes estimating an overall quality based on the foreground quality and the background quality.

    Abstract translation: 描述了通过电子设备测量语音信号质量的方法。 该方法包括获得修改的单通道语音信号。 该方法还包括基于修改的单通道语音信号来估计多个目标失真。 多个目标失真包括至少一个前景失真和至少一个背景失真。 该方法还包括基于多个目标失真来估计前景质量和背景质量。 该方法还包括基于前景质量和背景质量来估计整体质量。

Patent Agency Ranking