Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Mingqing Hu" Page 1

1.

发明授权
Perception based multimedia processing 有权

公开(公告)号：US10748555B2

公开(公告)日：2020-08-18

申请号：US16455178

申请日：2019-06-27

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Claus Bauer , Lie Lu , Mingqing Hu , Jun Wang , Poppy Crum , Rhonda Wilson , Regunathan Radhakrishnan

IPC: G10L25/54 , G06K9/62 , G10L25/03

Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.

2.

发明授权
Equalizer controller and controlling method 有权

公开(公告)号：US10044337B2

公开(公告)日：2018-08-07

申请号：US15433486

申请日：2017-02-15

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Lie Lu , Jun Wang , Alan J. Seefeldt , Mingqing Hu

IPC: H03G5/00 , H03G5/16 , H04R3/04

Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.

3.

发明申请
Identifying Multimedia Objects Based on Multimedia Fingerprint 有权
Title translation: 基于多媒体指纹识别多媒体对象

公开(公告)号：US20130279740A1

公开(公告)日：2013-10-24

申请号：US13854276

申请日：2013-04-01

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Claus Bauer , Lie Lu , Mingqing Hu

IPC: G06T1/00

CPC classification number: G06T1/0021 , G06F17/30023 , G06F17/30247 , G06F17/30345 , G06F17/30598 , G06F17/30743 , G06F17/30784 , G06K9/00744 , G06K9/6292

Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.

Abstract translation: 提供了基于多媒体指纹识别多媒体对象的实施例。根据不同的指纹算法，从多媒体对象中导出查询指纹。对于每个指纹算法，通过基于查询指纹和参考指纹的至少一个与指纹算法相对应的分类器来计算决策，参考指纹根据相同的指纹算法从参考多媒体对象中导出。每个决定都指示查询指纹和参考指纹不是从相同的多媒体内容导出的。对于每个参考多媒体对象，将距离计算为与参考指纹相关的决定的加权和。多媒体对象被识别为具有小于阈值的最小距离的参考多媒体对象的匹配。

4.

发明授权
Audio source separation with source direction determination based on iterative weighting 有权

公开(公告)号：US10930299B2

公开(公告)日：2021-02-23

申请号：US15572067

申请日：2016-05-12

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Lie Lu , Mingqing Hu

IPC: G10L21/0308 , G10L25/18 , G10L21/0272 , G10L19/008 , G10L21/0264

Abstract: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.

5.

发明授权
Audio object extraction 有权

公开(公告)号：US09786288B2

公开(公告)日：2017-10-10

申请号：US15031887

申请日：2014-11-25

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Mingqing Hu , Lie Lu , Jun Wang

IPC: H04R3/04 , G10L19/12 , G10L19/02 , G10L19/008 , G10L19/038 , H04S3/00

CPC classification number: G10L19/02 , G10L19/008 , G10L19/038 , H04S3/008 , H04S2400/11

Abstract: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.

6.

发明授权
Generating metadata for audio object 有权

公开(公告)号：US10362427B2

公开(公告)日：2019-07-23

申请号：US15508065

申请日：2015-08-31

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Mingqing Hu , Lie Lu

IPC: G06F17/00 , H04S7/00 , G01S5/18 , G10L19/008 , G11B27/10

Abstract: Example embodiments disclosed herein relate to audio object processing. A method for processing audio content, which includes at least one audio object of a multi-channel format, is disclosed. The method includes generating metadata associated with the audio object, the metadata including at least one of an estimated trajectory of the audio object and an estimated perceptual size of the audio object, the perceptual size being a perceived area of a phantom of the audio object produced by at least two transducers. Corresponding system and computer program product are also disclosed.

7.

发明授权
Perception based multimedia processing 有权

公开(公告)号：US10339959B2

公开(公告)日：2019-07-02

申请号：US15321741

申请日：2015-06-24

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Claus Bauer , Lie LU , Mingqing Hu , Jun Wang , Poppy Crum , Rhonda Wilson , Regunathan Radhakrishnan

IPC: G06K9/62 , G10L25/54 , G10L25/03

Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.

8.

发明申请
IDENTIFYING MULTIMEDIA OBJECTS BASED ON MULTIMEDIA FINGERPRINT 审中-公开
Title translation: 基于多媒体指纹识别多媒体对象

公开(公告)号：US20160019671A1

公开(公告)日：2016-01-21

申请号：US14869554

申请日：2015-09-29

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Claus Bauer , Lie Lu , Mingqing Hu

IPC: G06T1/00 , G06F17/30 , G06K9/62

CPC classification number: G06T1/0021 , G06F16/23 , G06F16/285 , G06F16/43 , G06F16/583 , G06F16/683 , G06F16/783 , G06K9/00744 , G06K9/6292

Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.

Abstract translation: 提供了基于多媒体指纹识别多媒体对象的实施例。根据不同的指纹算法，从多媒体对象中导出查询指纹。对于每个指纹算法，通过基于查询指纹和参考指纹的至少一个与指纹算法相对应的分类器来计算决策，参考指纹根据相同的指纹算法从参考多媒体对象中导出。每个决定都指示查询指纹和参考指纹不是从相同的多媒体内容导出的。对于每个参考多媒体对象，将距离计算为与参考指纹相关的决定的加权和。多媒体对象被识别为具有小于阈值的最小距离的参考多媒体对象的匹配。

9.

发明授权
Identifying multimedia objects based on multimedia fingerprint 有权
Title translation: 基于多媒体指纹识别多媒体对象

公开(公告)号：US09202255B2

公开(公告)日：2015-12-01

申请号：US13854276

申请日：2013-04-01

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Claus Bauer , Lie Lu , Mingqing Hu

IPC: G06T1/00 , G06K9/00 , G06K9/62 , G06F17/30

CPC classification number: G06T1/0021 , G06F17/30023 , G06F17/30247 , G06F17/30345 , G06F17/30598 , G06F17/30743 , G06F17/30784 , G06K9/00744 , G06K9/6292

Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.

Abstract translation: 提供了基于多媒体指纹识别多媒体对象的实施例。根据不同的指纹算法，从多媒体对象中导出查询指纹。对于每个指纹算法，通过基于查询指纹和参考指纹的至少一个与指纹算法相对应的分类器来计算决策，参考指纹根据相同的指纹算法从参考多媒体对象中导出。每个决定都指示查询指纹和参考指纹不是从相同的多媒体内容导出的。对于每个参考多媒体对象，将距离计算为与参考指纹相关的决定的加权和。多媒体对象被识别为具有小于阈值的最小距离的参考多媒体对象的匹配。

10.

发明授权
Upmixing of audio signals 有权

公开(公告)号：US10362426B2

公开(公告)日：2019-07-23

申请号：US15538892

申请日：2016-02-09

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Jun Wang , Lie Lu , Lianwu Chen , Mingqing Hu

IPC: H04S5/00 , H04R1/32 , H04S7/00

Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification