-
公开(公告)号:US20240163340A1
公开(公告)日:2024-05-16
申请号:US18415544
申请日:2024-01-17
Inventor: Glenn N. Dickins , Mark R.P. Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04S7/303
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US20230217173A1
公开(公告)日:2023-07-06
申请号:US18179698
申请日:2023-03-07
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Carlos Eduardo Medaglia Dyonisio , David Gunawan
CPC classification number: H04R3/12 , G10L15/22 , G10L15/063 , G10L15/16 , H04R3/005 , H04R1/403 , H04R1/406 , G10L2015/223 , G10L2015/225
Abstract: Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may include estimating in which of the zones the user is located. The audio activity may be performed using microphones and loudspeakers which are implemented in or coupled to smart audio devices.
-
公开(公告)号:US20230208921A1
公开(公告)日:2023-06-29
申请号:US17630779
申请日:2020-07-28
Inventor: Glenn N. Dickins , Mark Richard Paul Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04S7/303 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US11538486B2
公开(公告)日:2022-12-27
申请号:US17075659
申请日:2020-10-20
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Dong Shi , Kai Li , Hannes Muesch , David Gunawan , Paul Holmberg , Glenn N. Dickins
IPC: G10L21/0232 , H04R3/02 , G10L21/0264 , H04R3/04 , G10L21/0208 , H04R27/00
Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.
-
公开(公告)号:US11277518B2
公开(公告)日:2022-03-15
申请号:US16640169
申请日:2018-09-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Kai Li , David Gunawan , Feng Deng , Qianqian Fang
Abstract: The disclosed teleconferencing methods involve detecting a howl state during a teleconference which involves two or more teleconference client locations and a teleconference server. The teleconference server is configured for providing full-duplex audio connectivity between the teleconference client locations. The howl state is a state of acoustic feedback involving two or more teleconference devices in a teleconference client location. Detecting the howl state involves an analysis of both spectral and temporal characteristics of teleconference audio data. The disclosed teleconferencing methods involve determining which client location is causing the howl state and involve mitigating the howl state or sending a howl state detection message.
-
公开(公告)号:US11122239B2
公开(公告)日:2021-09-14
申请号:US16786799
申请日:2020-02-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Glenn N. Dickins , Ludovic Christophe Malfait , David Gunawan
Abstract: Systems and methods are described for detecting and remedying potential incongruence in a video conference. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify locations of a plurality of participants within an image plane of a video image. Using face and shape detection, a location of a center point of each identified participant's torso may be calculated. A region of congruence bounded by key parallax lines may be calculated, the key parallax lines being a subset of all parallax lines running through the center points of each identified participant. When the audio device location is not within the region of congruence, audio captured by an audio device may be adjusted to reduce effects of incongruence when the captured audio is replayed at a far end of the video conference.
-
公开(公告)号:US10923132B2
公开(公告)日:2021-02-16
申请号:US16073663
申请日:2017-02-15
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David Gunawan , Chunjian Li
IPC: G10L19/00 , G10L19/008 , G10L21/034 , H03G3/30 , H03G3/34 , H03G5/00 , H04R5/04 , H04S7/00 , H03G5/16 , H03G3/32
Abstract: A sound processing system operative to measure the level of diffusivity of the sounds present in the input sound signal. The system includes a plurality of input channels for receiving audio signals from an audio scene, the audio scene comprising at least one target sound in the presence of background noise. A diffusivity measurement unit is included so as to be operably coupled to the plurality of input channels to receive the audio signals therefrom and measure a level of diffusivity of the sounds present therein. A leveler unit is operably coupled to the plurality of input channels for receiving the audio signals therefrom and for applying a gain to the audio signals to minimize variations in the audio signal levels. A controller is operably coupled to the diffusivity measurement unit and the leveler unit to control the gain applied to the audio signals by the leveler unit based on the level of diffusivity of the sounds present therein.
-
公开(公告)号:US10811027B2
公开(公告)日:2020-10-20
申请号:US16308761
申请日:2017-06-07
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dong Shi , Kai Li , Hannes Muesch , David Gunawan , Paul Holmberg , Glenn N. Dickins
IPC: H04R27/00 , H04R3/02 , G10L21/0208 , G10L21/0232 , G10L21/0264 , H04R3/04
Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.
-
公开(公告)号:US10771631B2
公开(公告)日:2020-09-08
申请号:US15667510
申请日:2017-08-02
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: David Gunawan , Glenn N. Dickins
IPC: H04M3/56 , G10L21/0208 , H04L29/06 , G10L21/02 , G10L25/78
Abstract: Systems and methods are described for modifying one of far-end signal playback and capture of local audio on an audio device. Frames of both a far-end audio stream and a near-end audio stream may be analyzed using a measure of voice activity, the analyzing producing voice data associated with each frame. Based on the voice data, a conference state may be determined, and one of playback of the far-end audio stream and capture of local audio on an audio device may be modified based on the determined conference state. By associating the likely intent with a predefined state, the device may further cull or remove unwanted or unlikely content from the device input and output. This may have a substantial advantage in allowing for full duplex operation in the case of more meaningful and continuing voice activity, particularly in the case where there are many connected endpoints.
-
公开(公告)号:US10410653B2
公开(公告)日:2019-09-10
申请号:US15558181
申请日:2016-03-21
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dong Shi , Glenn N. Dickins , David Gunawan , Xuejing Sun
IPC: G10L19/00 , G10L21/0232 , G10L21/0208 , G10L21/028 , G10L25/21 , H04M9/08
Abstract: In an audio processing system (300), a filtering section (350, 400): receives subband signals (410, 420, 430) corresponding to audio content of a reference signal (301) in respective frequency subbands; receives subband signals (411, 421, 431) corresponding to audio content of a response signal (304) in the respective subbands; and forms filtered inband references (412, 422, 432) by applying respective filters (413, 423, 433) to the subband signals of the reference signal. For a frequency subband: filtered crossband references (424, 425) are formed by multiplying, by scalar factors (426, 427), filtered inband references of other subbands; a composite filtered reference (428) is formed by summing the filtered inband reference of the subband (422) and the filtered crossband references; a residual signal (429) is computed as a difference between the composite filtered reference and the subband signal of the response signal corresponding to the subband; and the scalar factors and the filter applied to the subband signal of the reference signal corresponding to the subband are adjusted based on the residual signal.
-
-
-
-
-
-
-
-
-