Abstract:
A process and system for generating three dimensional audio for television broadcast includes generating a virtual map of participants with a plurality of positions, each participant selecting one of the positions, determining a direction from each position to each other position on the map and to predetermined listener position, receiving sound from each participant, converting the received sound according to the direction of the speaking participant to the listener, mixing the converted sounds, transforming the mixed sound into binaural audio, and directing the binaural audio sound for transmission to a television viewer. The result is a clarified sound that gives to the television viewer a sense of where the speaking participant is positioned relative to the listening television viewer.
Abstract:
A method and apparatus of applying user profile information to a customized application are disclosed. One example method of operation may include receiving an inquiry message or call from a user device, identifying and authorizing the user from inquiry message information received from the inquiry message, retrieving a user profile comprising at least one user preference, applying the at least one user preference to a user call processing application, and transmitting menu options to the user device based on the applied at least user preference.
Abstract:
Methods and systems for identifying intended recipients of remarks from a speaker in a communications session established among a plurality of participant devices are provided herein. In some embodiments, a method for identifying intended recipients of remarks from a speaker in a communications session established among a plurality of participant devices may include receiving an indication of a first participant to whom remarks are to be addressed; determining identification information associated with the first participant; and transmitting the identification information associated with the first participant to one or more of the plurality of participant devices.
Abstract:
Systems and methods for analyzing digital recordings of the human voice in order to find characteristics unique to an individual. A biometrics engine may use an analytics service in a contact center to supply audio streams based on configured rules and providers for biometric detection. The analytics service may provide call audio data and attributes to connected engines based on a provider-set of selection rules. The connected providers send call audio data and attributes through the analytics service. The engines are notified when a new call is available for processing and can then retrieve chunks of audio data and call attributes by polling an analytics service interface. A mathematical model of the human vocal tract in the call audio data is created and/or matched against existing models. The result is analogous to a fingerprint, i.e., a pattern unique to an individual to within some level of probability.
Abstract:
Systems, methods, and media for disambiguating call data are provided herein. Some exemplary methods include receiving notification of a fraud event including a customer account identifier and a fraud time stamp; determining a time frame that is proximate the fraud time stamp; collecting call events associated with the customer account identifier that occur during the determined time frame, each call event including a unique call event identifier, a voice sample, and a call event time stamp; identifying a first call event belonging to a first speaker and a second call event belonging to a second speaker; and generating a timeline presentation that includes the first call event and call event timestamp and an identification of a first voice sample as belonging to the first speaker, the second call event and call event timestamp and an identification of a second voice sample as belonging to the second speaker.
Abstract:
A voice input system (100) includes a processing receiving unit (104) that receives identification information of a telephone that is to receive a callback, in order to input a voice, together with a voice recognition process request, a call processing unit (106) that originates a callback based on the identification information of the telephone received by the processing receiving unit (104), a voice data receiving unit (108) that receives voice data of a voice when the callback originated by the call processing unit (106) is received and the user's voice is input, and a voice recognition result storage unit (122) that stores result data which is data of a voice recognition result of the voice data received by the voice data receiving unit (108) in association with the identification information of the telephone.
Abstract:
A process and system for generating three dimensional audio for television broadcast includes generating a virtual map of participants with a plurality of positions, each participant selecting one of the positions, determining a direction from each position to each other position on the map and to predetermined listener position, receiving sound from each participant, converting the received sound according to the direction of the speaking participant to the listener, mixing the converted sounds, transforming the mixed sound into binaural audio, and directing the binaural audio sound for transmission to a television viewer. The result is a clarified sound that gives to the television viewer a sense of where the speaking participant is positioned relative to the listening television viewer.
Abstract:
One-to-many comparisons of callers' words and/or voice prints with known words and/or voice prints to identify any substantial matches between them. When a customer communicates with a particular entity, such as a customer service center, the system makes a recording of the real-time call including both the customer's and agent's voices. The system segments the recording to extract different words, such as words of anger. The system may also segment at least a portion of the customer's voice to create a tone profile, and it formats the segmented words and tone profiles for network transmission to a server. The server compares the customer's words and/or tone profiles with multiple known words and/or tone profiles stored on a database to determine any substantial matches. The identification of any matches may be used for a variety of purposes, such as providing representative feedback or customer follow-up.
Abstract:
In one aspect, the present invention facilitates the investigation of networks of criminals, by gathering associations between phone numbers, the names of persons reached at those phone numbers, and voice print data. In another aspect the invention automatically detects phone calls from a prison where the voiceprint of the person called matches the voiceprint of a past inmate. In another aspect the invention detects identity scams in prisons, by monitoring for known voice characteristics of likely imposters on phone calls made by prisoners. In another aspect, the invention automatically does speech-to-text conversion of phone numbers spoken within a predetermined time of detecting data indicative of a three-way call event while monitoring a phone call from a prison inmate. In another aspect, the invention automatically thwarts attempts of prison inmates to use re-dialing services. In another aspect, the invention automatically tags audio data retrieved from a database, by steganographically encoding into the audio data the identity of the official retrieving the audio data.
Abstract:
In one aspect, the present invention facilitates the investigation of networks of criminals, by gathering associations between phone numbers, the names of persons reached at those phone numbers, and voice print data. In another aspect the invention automatically detects phone calls from a prison where the voiceprint of the person called matches the voiceprint of a past inmate. In another aspect the invention detects identity scams in prisons, by monitoring for known voice characteristics of likely imposters on phone calls made by prisoners. In another aspect, the invention automatically does speech-to-text conversion of phone numbers spoken within a predetermined time of detecting data indicative of a three-way call event while monitoring a phone call from a prison inmate. In another aspect, the invention automatically thwarts attempts of prison inmates to use re-dialing services. In another aspect, the invention automatically tags audio data retrieved from a database, by steganographically encoding into the audio data the identity of the official retrieving the audio data.