Abstract:
A system and method for extracting demographic information from a contact is disclosed. The method discloses: initiating a dialog between a contact and a call handling system; selecting a set of demographic characteristics; assigning a set of acoustic confidence scores; assigning a set of substantive confidence scores; combining the acoustic and substantive confidence scores; and tailoring information presented to the contact using the set of combined confidence scores. The system discloses: an Interactive Voice Response module for initiating a dialog between a contact and a call handling system, and selecting a set of demographic characteristics; an acoustic classifier for assigning a set of acoustic confidence scores; a substantive classifier for assigning a set of substantive confidence scores; and a data combiner for combining the acoustic and substantive confidence scores. The Interactive Voice Response module also tailors information presented to the contact using the set of combined confidence scores.
Abstract:
Generally described, the present invention is directed to a method and system that provides the ability to record and store a digital conversation exchanged among mutually agreed parties in a specified archive database. In some instances, content of a digital conversations may be stored (e.g., for legal and/or medical purposes) along with the authenticity of those conversations. Parties who are involved in the digital conversation can authenticate and associate themselves with the conversation. The authentication may be integrated or bound with the digital conversation so that the digital conversation can be stored with a proof of authenticity in the specified archive database. The stored conversation may be retrieved by the parties or any entity with a delegation from the party for reviewing the conversation, or incorporating new information into the stored conversation.
Abstract:
Techniques for positioning participants of a conference call in a three dimensional (3D) audio space are described. Aspects of a system for positioning include a client component that extracts speech frames of a currently speaking participant of a conference call from a transmission signal. A speech analysis component determines a voice fingerprint of the currently speaking participant based upon any of a number of factors, such as a pitch value of the participant. A control component determines a category position of the currently speaking participant in a three dimensional audio space based upon the voice fingerprint. An audio engine outputs audio signals of the speech frame based upon the determined category position of the currently speaking participant. The category position of one or more participants may be changed as new participants are added to the conference call.
Abstract:
A voice processing system is provided in which sets of engines running on a plurality of servers are configured differently from one another. The sets of engines may be configured to achieve different trade-offs between performance of a task and resources required to perform the task. In the voice processing system, a task routing server is provided that assigns different sets of sub-tasks to different sets of task engines. The number of engines used to perform a task and the number of engines in each set are adjusted. By adjusting the parameters settings for the set of engines based on the type of application, the particular requirements of the application, or the nature and importance of the subtasks, for example, advantages such as improvement of resource utilization and the hardware and software costs reduction may be obtained.
Abstract:
A multi-party conversation analyzer and logger uses a variety of techniques including spectrographic voice analysis, absolute loudness measurements, directional microphones, and telephonic directional separation to determine the number of parties who take part in a conversation, and segment the conversation by speaking party. In one aspect, the invention monitors telephone conversations in real time to detect conditions of interest (for instance, calls to non-allowed parties or calls of a prohibited nature from prison inmates). In another aspect, automated prosody measurement algorithms are used in conjunction with speaker segmentation to extract emotional content of the speech of participants within a particular conversation, and speaker interactions and emotions are displayed in graphical form. A conversation database is generated which contains conversation recordings, and derived data such as transcription text, derived emotions, alert conditions, and correctness probabilities associated with derived data. Investigative tools allow flexible queries of the conversation database.
Abstract:
A method for distinguishing speakers in a conference call of a plurality of participants, in which method speech frames of the conference call are received in a receiving unit, which speech frames include encoded speech parameters. At least one parameter of the received speech frames is examined in an audio codec of the receiving unit, and the speech frames are classified to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. These functions may be carried out in a speaker identification block, which is applicable in various positions of a teleconferencing processing chain. Finally, a spatialization effect is created in a terminal reproducing the audio signal according to notified differences by placing the participants at distinct positions in an acoustical space of the audio signal.
Abstract:
A method and system for identifying conference participants who dial in to a telephone conference of an electronic conference that includes a web conference is provided. To identify a conference participant, a conference system displays to a user the names of those conference participants who have not yet been associated with a telephone line of the telephone conference. The conference system plays to the user the identification announcement of a conference participant who is not yet associated with a telephone line. When the user hears the identification announcement, the user recognizes the name of the conference participant and selects the name of that conference participant from the displayed names. The conference system can then associate that conference participant with the telephone line associated with the identification announcement that was played to the user.
Abstract:
A voiceprint of a calling party, and identifying information, are obtained during a call. One or more voiceprint samples stored in association with the identifying information are selected, and the calling party's identity is verified based on a comparison of the voiceprint to at least one of the selected voiceprint samples. For example, the voiceprint samples may be derived from a verbal utterance by the calling party before the call, mixed with different background noises likely corrupting the calling party's utterance during the call. If the comparison fails to determine the calling party's identity, an attempt to verify the calling party's identity is made based on additional information received during the call. In another example, if the calling party's identity is verified based on the additional information, the voiceprint may be stored in association with the identifying information as a voiceprint sample for use in future verification of the calling party's identity.
Abstract:
In one embodiment, a system authenticates a caller before providing the caller with access to one or more secured resources. Identification and verification data is stored during system enrollment of known persons. A processing system, in a current call initiated by a caller and prior to identification of the caller, receives voice-based information for the caller including a digital representation of a telephone number spoken by the caller in the current call, the received information including voice-based identification information and voice-based verification information for the caller. The processing system automatically compares the received identification information and the received verification information for the caller with the stored identification and verification data for known persons, respectively, to uniquely identify and verify the identity of the caller as a particular known person. The processing system authenticates the caller for access to the one or more secured resources in response.
Abstract:
There is provided a system for facilitating a conference call. The system includes a module to generate a real-time voiceprint from a voice input of a participant in the conference call, and a module to provide information indicative of the participant based on the real-time voiceprint.