Abstract:
A computer system may communicate metadata that identifies a current speaker. The computer system may receive audio data that represents speech of the current speaker, generate an audio fingerprint of the current speaker based on the audio data, and perform automated speaker recognition by comparing the audio fingerprint of the current speaker against stored audio fingerprints contained in a speaker fingerprint repository. The computer system may communicate data indicating that the current speaker is unrecognized to a client device of an observer and receive tagging information that identifies the current speaker from the client device of the observer. The computer system may store the audio fingerprint of the current speaker and metadata that identifies the current speaker in the speaker fingerprint repository and communicate the metadata that identifies the current speaker to at least one of the client device of the observer or a client device of a different observer.
Abstract:
A system is described herein that facilitates the monitoring of inmate communications. The system provides a remotely-accessible means for a reviewer to monitor a call between an inmate and another person. The system includes a monitoring server and a monitoring station. The monitoring server is configured to receive a call and call information from a communication center and process the call for monitoring, schedule a review of the call; and store the call, the call information, and scheduling data. The monitoring station is configured to receive the call and the call information from the monitoring server based on the scheduling data, and to display the identifying information and facilitate the review of the call.
Abstract:
Aspects of the subject disclosure may include, for example, a method in which a device comprising a processor detects a communication session between a calling device and a called device, and receives a motion signal from the called device; the motion signal is generated at a motion sensor of the called device during or after the communication session. The device analyzes the motion signal to determine whether a portion of the motion signal corresponds to a preselected motion of the called device and whether a subsequent call from the calling device accordingly is to be blocked. Responsive to a determination that the subsequent call is to be blocked, the device also updates a list of blocked caller identifiers associated with the called device to add an identifier associated with the calling device. Other embodiments are disclosed.
Abstract:
Embodiments of the present invention relate to speaker identification using spatial information. A method of speaker identification for audio content being of a format based on multiple channels is disclosed. The method comprises extracting, from a first audio clip in the format, a plurality of spatial acoustic features across the multiple channels and location information, the first audio clip containing voices from a speaker, and constructing a first model for the speaker based on the spatial acoustic features and the location information, the first model indicating a characteristic of the voices from the speaker. The method further comprises identifying whether the audio content contains voices from the speaker based on the first model. Corresponding system and computer program product are also disclosed.
Abstract:
Methods, systems, computer-readable media, and apparatuses for handling calls based on a voice biometric confidence score are presented. In some embodiments, a computing device may receive a voice sample associated with a telephone call. Subsequently, the computing device may determine a voice biometric confidence score based on the voice sample. The computing device then may determine to route the telephone call to a certain endpoint based on the voice biometric confidence score.
Abstract:
An event associated with a communication session between a first party and a second party is detected. The event is related to the communication session. For example, the event may be a phrase spoken in the communication session or a presentation of a call progression screen to a contact center agent. At this point, the communication session is analyzed for a period of non-communication that meets a defined threshold during a monitoring period associated with the event. If a period of non-communication is detected that is longer than the threshold during the monitoring period, the period of non-communication is flagged. The flagged period of non-communication can then be used to evaluate how a contact center agent is performing.
Abstract:
A process and system for generating three dimensional audio for television broadcast includes generating a virtual map of participants with a plurality of positions, each participant selecting one of the positions, determining a direction from each position to each other position on the map and to predetermined listener position, receiving sound from each participant, converting the received sound according to the direction of the speaking participant to the listener, mixing the converted sounds, transforming the mixed sound into binaural audio, and directing the binaural audio sound for transmission to a television viewer. The result is a clarified sound that gives to the television viewer a sense of where the speaking participant is positioned relative to the listening television viewer.
Abstract:
One-to-many comparisons of callers' words and/or voice prints with known words and/or voice prints to identify any substantial matches between them. When a customer communicates with a particular entity, such as a customer service center, the system makes a recording of the real-time call including both the customer's and agent's voices. The system segments the recording to extract different words, such as words of anger. The system may also segment at least a portion of the customer's voice to create a tone profile, and it formats the segmented words and tone profiles for network transmission to a server. The server compares the customer's words and/or tone profiles with multiple known words and/or tone profiles stored on a database to determine any substantial matches. The identification of any matches may be used for a variety of purposes, such as providing representative feedback or customer follow-up.
Abstract:
A secure telephone call management system is provided for authenticating users of a telephone system in an institutional facility. Authentication of the users of the telephone call management system is accomplished by using a personal identification number, biometric means, and/or radio frequency means. The secure telephone call management system includes accounting software capable of limiting access to the system based on funds in a user's account, and includes management software capable of implementing widespread or local changes to the system. The system monitors a conversation in the telephone call to detect a presence of a first characteristic in audio of the conversation, and terminates the telephone call if the first characteristic does not match a second characteristic of biometric information of a user or a called party.
Abstract:
Arrangements described herein include identifying a voice communication session established between a first communication device and a second communication device and, based on the voice communication session established between the first communication device and the second communication device, identifying a plurality of contacts who potentially may be the second user. A list including at least a name of each of the plurality of contacts who potentially may be the second user is presented to a first user using the first communication device.