-
1.
公开(公告)号:US20250095662A1
公开(公告)日:2025-03-20
申请号:US18883681
申请日:2024-09-12
Applicant: Pindrop Security, Inc.
Inventor: David Looney , Nikolay Gaubitch
IPC: G10L19/018 , G10L25/30
Abstract: Embodiments disclosed herein include software processes executed by a computer for encoding and decoding watermarks for a speech signal in a call signal communicated via telephony channels. An encoder uses Linear Predictive Coding (LPC) to analyzes the call signal's spectral envelope and embeds the watermark into the LPC log-spectrum of the speech signal of the call signal. The encoder may reduce the watermark's strength at a formant peak of the speech signal, balancing the watermark's robustness and detectability. A deep decoder includes a neural network architecture trained on watermarked and watermark-free speech signals having various types of degradation to extract a feature vector of a call signal and compute a watermark detection score for one or more frames or for the call signal. At inference time, the deep decoder detects the watermark when the watermark detection score satisfies a detection threshold.
-
公开(公告)号:US20240311474A1
公开(公告)日:2024-09-19
申请号:US18598595
申请日:2024-03-07
Applicant: PINDROP SECURITY, INC.
Inventor: Nikolay Gaubitch , David Looney
CPC classification number: G06F21/554 , G06N20/00 , G10L25/18 , G10L25/51 , G10L25/69 , G06F2221/034
Abstract: Embodiments include a computing device that executes software routines and/or one or more machine-learning architectures including obtaining training audio signals having corresponding training impulse responses associated with reverberation degradation, training a machine-learning model of a presentation attack detection engine to generate one or more acoustic parameters by executing the presentation attack detection engine using the training impulse responses of the training audio signals and a loss function, obtaining an audio signal having an acoustic impulse response associated with reverberation degradation caused by one or more rooms, generating the one or more acoustic parameters for the audio signal by executing the machine-learning model using the audio signal as input, and generating an attack score for the audio signal based upon the one or more parameters generated by the machine-learning model.
-
公开(公告)号:US12087319B1
公开(公告)日:2024-09-10
申请号:US17079082
申请日:2020-10-23
Applicant: PINDROP SECURITY, INC.
Inventor: David Looney , Nikolay Gaubitch
Abstract: Embodiments described herein provide for end-to-end joint determination of degradation parameter scores for certain types of degradation. Degradation parameters include degradation describing additive noise and multiplicative noise such as Signal-to-Noise Ratio (SNR), reverberation time (T60), and Direct-to-Reverberant Ratio (DRR). Various neural network architectures are described such that the inherent interplay between the degradation parameters is considered in both the degradation parameter score and degradation score determination. The neural network architectures are trained according to computer generated audio datasets.
-
-