-
公开(公告)号:US20160078882A1
公开(公告)日:2016-03-17
申请号:US14952820
申请日:2015-11-25
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie LU , Mingqing HU
IPC: G10L25/51 , H04R29/00 , G10L19/038
CPC classification number: G10L25/51 , G10L19/038 , H04R29/00
Abstract: Embodiments for measuring content coherence and embodiments for measuring content similarity are described. Content coherence between a first audio section and a second audio section is measured. For each audio segment in the first audio section, a predetermined number of audio segments in the second audio section are determined. Content similarity between the audio segment in the first audio section and the determined audio segments is higher than that between the audio segment and all the other audio segments in the second audio section. An average of the content similarity between the audio segment in the first audio section and the determined audio segments is calculated. The content coherence is calculated as an average, the maximum or the minimum of the averages calculated for the audio segments in the first audio section. The content similarity may be calculated based on Dirichlet distribution.
Abstract translation: 描述用于测量内容相干性的实施例和用于测量内容相似性的实施例。 测量第一音频部分和第二音频部分之间的内容相干性。 对于第一音频部分中的每个音频段,确定第二音频部分中的预定数量的音频片段。 第一音频部分中的音频片段与所确定的音频片段之间的内容相似性高于音频片段和第二音频片段中的所有其他音频片段之间的内容相似度。 计算第一音频部分中的音频片段与确定的音频片段之间的内容相似度的平均值。 内容相干性被计算为平均值,对于第一音频部分中的音频段计算的平均值的最大值或最小值。 内容相似性可以基于Dirichlet分布来计算。
-
公开(公告)号:US20170133039A1
公开(公告)日:2017-05-11
申请号:US15321741
申请日:2015-06-24
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Claus BAUER , Lie LU , Mingqing HU , Jun WANG , Poppy CRUM , Rhonda WILSON , Regunathan RADHAKRISHNAN
IPC: G10L25/54
CPC classification number: G10L25/54 , G06K9/6259 , G06K9/6261 , G10L25/03
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
公开(公告)号:US20160056787A1
公开(公告)日:2016-02-25
申请号:US14780485
申请日:2014-03-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie LU , Jun WANG , Alan SEEFELDT , Mingqing HU
Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
Abstract translation: 公开了均衡器控制器和控制方法。 在一个实施例中,均衡器控制器包括用于实时地识别音频信号的音频类型的音频分类器; 以及调整单元,用于基于识别的音频类型的置信度值以连续的方式调整均衡器。
-
公开(公告)号:US20190052991A9
公开(公告)日:2019-02-14
申请号:US15538892
申请日:2016-02-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Lianwu CHEN , Mingqing HU
Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
公开(公告)号:US20160267914A1
公开(公告)日:2016-09-15
申请号:US15031887
申请日:2014-11-25
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Mingqing HU , Lie LU , Jun WANG
IPC: G10L19/02 , G10L19/038 , H04S3/00 , G10L19/008
CPC classification number: G10L19/02 , G10L19/008 , G10L19/038 , H04S3/008 , H04S2400/11
Abstract: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.
Abstract translation: 本发明的实施例涉及音频对象提取。 公开了一种基于多个频道的格式的音频内容提取音频对象的方法。 该方法包括至少部分地基于多个频道之间的频谱相似度来对音频内容的各个帧应用音频对象提取。 该方法还包括基于在各个帧上的音频对象提取来执行跨音频内容的帧的音频对象组合,以产生至少一个音频对象的轨道。 还公开了相应的系统和计算机程序产品。
-
公开(公告)号:US20190325894A1
公开(公告)日:2019-10-24
申请号:US16455178
申请日:2019-06-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Claus BAUER , Lie LU , Mingqing HU , Jun WANG , Poppy CRUM , Rhonda WILSON , Regunathan RADHAKRISHNAN
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
公开(公告)号:US20170344852A1
公开(公告)日:2017-11-30
申请号:US15538306
申请日:2015-12-18
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Mingqing HU , Lie LU , Lianwu CHEN
CPC classification number: G06K9/624 , G06F17/15 , H03H21/00 , H03H2021/0034 , H04S5/00 , H04S2400/03 , H04S2400/11
Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.
-
公开(公告)号:US20170238117A1
公开(公告)日:2017-08-17
申请号:US15508065
申请日:2015-08-31
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Mingqing HU , Lie LU
CPC classification number: H04S7/302 , G01S5/18 , G10L19/008 , G11B27/10 , H04S7/30 , H04S7/301 , H04S2400/01 , H04S2400/11 , H04S2420/01
Abstract: Example embodiments disclosed herein relate to audio object processing. A method for processing audio content, which includes at least one audio object of a multi-channel format, is disclosed. The method includes generating metadata associated with the audio object, the metadata including at least one of an estimated trajectory of the audio object and an estimated perceptual size of the audio object, the perceptual size being a perceived area of a phantom of the audio object produced by at least two transducers. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20180262856A1
公开(公告)日:2018-09-13
申请号:US15538892
申请日:2016-02-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Lianwu CHEN , Mingqing HU
Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
10.
公开(公告)号:US20180144759A1
公开(公告)日:2018-05-24
申请号:US15572067
申请日:2016-05-12
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie LU , Mingqing HU
IPC: G10L21/0308 , G10L25/18 , G10L19/008
CPC classification number: G10L21/0308 , G10L19/008 , G10L21/0264 , G10L21/0272 , G10L25/18
Abstract: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.
-
-
-
-
-
-
-
-
-