-
公开(公告)号:US11950081B2
公开(公告)日:2024-04-02
申请号:US17669574
申请日:2022-02-11
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor , Uwe Helmut Jost
IPC: H04S7/00 , G06T7/70 , G10L15/06 , G10L15/22 , G10L19/00 , G10L19/008 , G10L19/16 , G10L21/0208 , G10L21/0216 , H04R1/40 , H04R3/00 , H04R5/027 , H04S3/00
CPC classification number: H04S7/30 , G06T7/70 , G10L15/063 , G10L15/22 , G10L19/008 , G10L19/167 , G10L21/0208 , H04R1/406 , H04R3/005 , H04R5/027 , H04S3/008 , G10L2019/0001 , G10L2019/0002 , G10L2021/02166 , H04R2201/401 , H04S2400/01 , H04S2400/15
Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions for a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. The plurality of acoustic relative transfer functions may be encoded into a first embedding of acoustic relative transfer functions and at least a second embedding of acoustic relative transfer functions. Information may be extracted from at least the first embedding of acoustic relative transfer functions.
-
2.
公开(公告)号:US11769486B2
公开(公告)日:2023-09-26
申请号:US17178734
申请日:2021-02-18
Applicant: Nuance Communications, Inc.
Inventor: Patrick A. Naylor , Dushyant Sharma , Uwe Helmut Jost , William F. Ganong, III
CPC classification number: G10L15/063 , G06N20/00 , G10L25/03 , H04R1/406
Abstract: A method, computer program product, and computing system for defining model representative of a plurality of acoustic variations to a speech signal, thus defining a plurality of time-varying spectral modifications. The plurality of time-varying spectral modifications may be applied to a plurality of feature coefficients of a target domain of a reference signal, thus generating a plurality of time-varying spectrally-augmented feature coefficients of the reference signal.
-
公开(公告)号:US20230230580A1
公开(公告)日:2023-07-20
申请号:US17579766
申请日:2022-01-20
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Ljubomir Milanovic , Philipp Salletmayr , Rong Gong , Patrick A. Naylor
IPC: G10L15/08
CPC classification number: G10L15/08
Abstract: A method, computer program product, and computing system for obtaining one or more speech signals from a first device, thus defining one or more first device speech signals. One or more speech signals may be obtained from a second device, thus defining one or more second device speech signals. A noise component model may be selected from a plurality of noise component models based upon, at least in part, the one or more first device speech signals and the one or more second device speech signals. The one or more second device speech signals may be augmented, at run-time, based upon, at least in part, the noise component model.
-
公开(公告)号:US11699440B2
公开(公告)日:2023-07-11
申请号:US17314527
申请日:2021-05-07
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor , Rong Gong , Stanislav Kruchinin , Ljubomir Milanovic
IPC: H04R3/00 , H04R3/04 , G10L15/22 , H04R1/40 , G10L25/84 , G10L15/32 , G10L15/20 , G06F16/65 , G06F16/68 , G10L17/06 , G10L25/78 , H04R5/04 , H04S7/00 , H04R29/00 , G16H15/00 , G06N20/00 , G10L21/028 , G10L15/26 , G16H10/60 , G16H40/20 , G10L21/0216 , G10L21/0272
CPC classification number: G10L15/22 , G06F16/65 , G06F16/686 , G06N20/00 , G10L15/20 , G10L15/32 , G10L17/06 , G10L21/028 , G10L25/78 , G10L25/84 , G16H15/00 , H04R1/406 , H04R3/005 , H04R3/04 , H04R5/04 , H04R29/005 , H04S7/307 , G10L15/26 , G10L21/0216 , G10L21/0272 , G10L2021/02166 , G16H10/60 , G16H40/20
Abstract: A method, computer program product, and computing system for receiving a signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more inter-microphone gain-based augmentations may be performed on the plurality of signals, thus defining one or more inter-microphone gain-augmented signals.
-
公开(公告)号:US11631411B2
公开(公告)日:2023-04-18
申请号:US17315890
申请日:2021-05-10
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor
IPC: G10L15/22 , H04R1/40 , H04R3/00 , G10L25/84 , G10L15/32 , G10L15/20 , G06F16/65 , G10L17/06 , G10L25/78 , H04R5/04 , H04S7/00 , H04R29/00 , G10L21/028 , G06F16/68 , H04R3/04 , G16H15/00 , G06N20/00 , G16H10/60 , G16H40/20 , G10L15/26 , G10L21/0216 , G10L21/0272
Abstract: A method, computer program product, and computing system for receiving information associated with an acoustic environment. Acoustic metadata associated with audio encounter information received by a first microphone system may be received. One or more speaker representations may be defined based upon, at least in part, the acoustic metadata associated with the audio encounter information and the information associated with the acoustic environment. One or more portions of the audio encounter information may be labeled with the one or more speaker representations and a speaker location within the acoustic environment.
-
公开(公告)号:US20220254358A1
公开(公告)日:2022-08-11
申请号:US17669567
申请日:2022-02-11
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor , Uwe Helmut Jost
IPC: G10L19/008 , G10L19/16 , G10L21/0208 , G10L15/22
Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions between a plurality of audio acquisition devices of an audio recording system based upon, at least in part, one or more of a predefined speech processing application and a predefined acoustic environment. An acoustic relative transfer function codebook may be generated using the plurality of acoustic relative transfer functions. One or more channels from the plurality of audio acquisition devices of the audio recording system may be encoded using the acoustic relative transfer function codebook.
-
公开(公告)号:US20210350815A1
公开(公告)日:2021-11-11
申请号:US17315857
申请日:2021-05-10
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor
IPC: G10L21/0216 , G10L15/26 , H04R3/00 , G10L21/028 , G06N20/00 , G16H15/00
Abstract: A method, computer program product, and computing system for receiving audio encounter information from a microphone array. Speech activity within one or more portions of the audio encounter information may be identified based upon, at least in part, a correlation among the audio encounter information received from the microphone array. Location information for the one or more portions of the audio encounter information may be determined based upon, at least in part, the correlation among the signals received by each microphone of the microphone array. The one or more portions of the audio encounter information may be labeled with the speech activity and the location information.
-
公开(公告)号:US20210287660A1
公开(公告)日:2021-09-16
申请号:US17197587
申请日:2021-03-10
Applicant: Nuance Communications, Inc.
Inventor: Dushyant Sharma , Patrick A. Naylor , James W. Fosburgh
Abstract: A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more gain-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining gain-augmented feature-based voice data.
-
公开(公告)号:US10978187B2
公开(公告)日:2021-04-13
申请号:US16058856
申请日:2018-08-08
Applicant: Nuance Communications, Inc.
IPC: G16H10/60 , G06K9/00 , H04R3/12 , G16H40/20 , H04L12/58 , G06F40/30 , G06F40/40 , G06F40/174 , G10L17/00 , G16H50/30 , G16H80/00 , A61B5/00 , G16H30/00 , G16H40/63 , H04N7/18 , H04R3/00 , G16H10/20 , G06T7/00 , H04R1/32 , G16H15/00 , G16H40/60 , G06F21/62 , G06F16/904 , G06F3/16 , G16H30/20 , G06K9/62 , G16H30/40 , G10L21/0232 , G16H50/20 , G06N3/00 , G16B50/00 , G06F16/635 , G06K19/077 , G10L15/08 , G11B27/10 , H04R3/02 , G10L15/26 , H04R1/40 , G10L15/22 , G10L21/0208 , G10L15/18
Abstract: A mixed-media ACD device is configured to monitor one or more encounter participants of a patient encounter and includes a machine vision system configured to obtain machine vision encounter information concerning the patient encounter. An audio recording system is configured to obtain audio encounter information concerning the patient encounter, wherein the audio recording system includes a plurality of discrete audio acquisition devices.
-
公开(公告)号:US20190272905A1
公开(公告)日:2019-09-05
申请号:US16271616
申请日:2019-02-08
Applicant: Nuance Communications, Inc.
Inventor: Daniel Paulino Almendro Barreda , Dushyant Sharma , Joel Praveen Pinto , Uwe Helmut Jost , Patrick A. Naylor
Abstract: A method, computer program product, and computing system for obtaining encounter information of a patient encounter, wherein the encounter information includes machine vision encounter information; processing the machine vision encounter information to identify one or more humanoid shapes; and steering one or more audio recording beams toward the one or more humanoid shapes to capture audio encounter information.
-
-
-
-
-
-
-
-
-