SPATIAL ERROR METRICS OF AUDIO CONTENT
    1.
    发明申请
    SPATIAL ERROR METRICS OF AUDIO CONTENT 审中-公开
    音频内容的空间误差度量

    公开(公告)号:US20160337776A1

    公开(公告)日:2016-11-17

    申请号:US15110371

    申请日:2015-01-05

    Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.

    Abstract translation: 确定存在于一个或多个帧中的输入音频内容中的音频对象。 还确定存在于一个或多个帧中的输出音频内容中的输出簇。 这里,输入音频内容中的音频对象被转换为输出音频内容中的输出群集。 至少部分地基于音频对象的位置元数据和输出簇的位置元数据来计算一个或多个空间误差度量。

    UPMIXING OF AUDIO SIGNALS
    4.
    发明申请

    公开(公告)号:US20180262856A1

    公开(公告)日:2018-09-13

    申请号:US15538892

    申请日:2016-02-09

    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

    METADATA-PRESERVED AUDIO OBJECT CLUSTERING

    公开(公告)号:US20220272474A1

    公开(公告)日:2022-08-25

    申请号:US17737184

    申请日:2022-05-05

    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.

    Projection-Based Audio Object Extraction from Audio Content

    公开(公告)号:US20170344852A1

    公开(公告)日:2017-11-30

    申请号:US15538306

    申请日:2015-12-18

    Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.

    AUDIO OBJECT CLUSTERING BY UTILIZING TEMPORAL VARIATIONS OF AUDIO OBJECTS
    9.
    发明申请
    AUDIO OBJECT CLUSTERING BY UTILIZING TEMPORAL VARIATIONS OF AUDIO OBJECTS 有权
    使用音频对象的时间变化的音频对象聚类

    公开(公告)号:US20160358618A1

    公开(公告)日:2016-12-08

    申请号:US15117647

    申请日:2015-02-23

    Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

    Abstract translation: 本发明的实施例涉及通过利用音频对象的时间变化的音频对象聚类。 提供了一种估计用于音频对象聚类的音频对象的时间变化的方法。 所述方法包括获得与所述音频对象相关联的音轨的至少一个段,所述至少一个段包含所述音频对象; 基于所述音频对象的至少一个属性来估计所述音频对象在所述至少一个段的持续时间上的变化,并且至少部分地基于所估计的所述音频对象的变化来调整所述音频对象对所述音频对象的贡献 确定音频对象聚类中的质心。 披露了相应的系统和计算机程序产品。

Patent Agency Ranking