Abstract:
In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
Abstract:
Arrangements described herein include identifying a voice communication session established between a first communication device and a second communication device and, based on the voice communication session established between the first communication device and the second communication device, identifying a plurality of contacts who potentially may be the second user. A list including at least a name of each of the plurality of contacts who potentially may be the second user is presented to a first user using the first communication device.
Abstract:
A system and method for providing guidance to persuade a caller is provided. A call is received from a caller into a call center and an offer is provided to the caller. A likelihood of the caller to accept the offer is measured by analyzing voice input of the caller during the call. One or more paralinguistic voice characteristics in the voice input are determined. A stage of persuasion is assigned to the caller based on the paralinguistic voice characteristics and a recommendation is made for guidance to persuade the caller to accept the offer.
Abstract:
Methods, systems, computer-readable media, and apparatuses for handling calls based on a voice biometric confidence score are presented. In some embodiments, a computing device may receive a voice sample associated with a telephone call. Subsequently, the computing device may determine a voice biometric confidence score based on the voice sample. The computing device then may determine to route the telephone call to a certain endpoint based on the voice biometric confidence score.
Abstract:
In many scenarios, speaker verification systems can be given a single-channel audio with recordings of multiple speakers. To perform accurate speaker verification, a system can isolate the speech of a speaker. In one embodiment, a method, and corresponding system, of speaker verification includes extracting a target speaker's speech, using a known speaker voiceprint, from an audio recording that includes the target speaker's speech and the known speaker's speech. The known speaker voiceprint can correspond to the known speaker. Extracting the target speaker's speech can include determining portions of the audio recording where the known speaker voiceprint matches the known speaker's speech above a particular threshold, and extracting the target speaker's speech from other portions of the audio recording. In this manner, speaker verification is performed on the target speaker's speech without interference from the known speaker's speech and allows for a more accurate verification.
Abstract:
An apparatus for use in a verbal communication between a speaker and at least one listener, where the speaker and the at least one listener are spatially separate from each other, the apparatus provides the listener with a sensory output associated with the identity of the speaker, the apparatus including an identification device including a vibration sensor for detecting vibrations associated with speech of the speaker, is configured to store identification data representative of the identity of the speaker or the identification device, is associable with the identity of the speaker, and, in response to detecting vibrations, transmits an identification signal comprising or generated from the stored identification data, and a display device operable to receive the identification signal and to generate from the received identification signal a sensory output indicating the identity of either or both of the speaker and the identification device.
Abstract:
An embodiment according to the invention provides automatic discovery, via Automatic Speech Recognition (ASR) and Voice Biometrics, of the identification of a caller, when the caller is making a phone call from, for example, a residential line. The caller may, for example, initiate a phone call by voice request to a computer or other device. The device initiates the call, but rather than using the conventional technique of determining Calling Name via lookup to the Transaction Capabilities Application Part (TCAP) database, the embodiment uses a technique of ASR in tandem with voice or other biometrics to recognize who within the residence is making the call, and to use the name associated with the requesting caller's voiceprint for determining the Calling Name to display to the called party. Other forms of biometrics, such as image biometrics (e.g., facial or iris biometrics), may alternatively be employed.
Abstract:
A method and system for using conversational biometrics and speaker identification and/or verification to filter voice streams during mixed mode communication. The method includes receiving an audio stream of a communication between participants. Additionally, the method includes filtering the audio stream of the communication into separate audio streams, one for each of the participants. Each of the separate audio streams contains portions of the communication attributable to a respective participant. Furthermore, the method includes outputting the separate audio streams to a storage system.
Abstract:
Technology for crime control includes receiving a voucher identifier for a mobile phone credit voucher purchased under duress by a victim and generating a request for a legal order directing a telecommunication service provider to obtain certain information about use of the voucher. Approval for the legal order is received and the legal order and the voucher identifier are transmitted by a law enforcement agency computer system via a network to a computer system of the telecommunication service provider. A phone number associated with a mobile phone to which a credit associated with the voucher identifier was applied and a recording of a telephone call to or from the phone number are received via the network from the telecommunication service provider computer system and the law enforcement agency computer system performs an automated analysis of the call by a voice recognition process.
Abstract:
An embodiment according to the invention provides automatic discovery, via Automatic Speech Recognition (ASR) and Voice Biometrics, of the identification of a caller, when the caller is making a phone call from, for example, a residential line. The caller may, for example, initiate a phone call by voice request to a computer or other device. The device initiates the call, but rather than using the conventional technique of determining Calling Name via lookup to the Transaction Capabilities Application Part (TCAP) database, the embodiment uses a technique of ASR in tandem with voice or other biometrics to recognize who within the residence is making the call, and to use the name associated with the requesting caller's voiceprint for determining the Calling Name to display to the called party. Other forms of biometrics, such as image biometrics (e.g., facial or iris biometrics), may alternatively be employed.