摘要:
Method and apparatus for maintaining state information on a client device configured for voice-over-internet protocol (VOIP) communication is described. In one example, a VOIP call between the client device and an endpoint device through a packet network is established. State information is sent from the client device to the endpoint device during the VOIP call. The state object having the state information embedded therein is received from the endpoint device at the client device. The state object is stored in the client device. The state object may be retrieved from the client device during the VOIP call or a subsequent VOIP call in response to an authenticated request from the endpoint device. Alternatively, the state object may be retrieved from the client device during a subsequent web session in response to an authenticated request from a host.
摘要:
A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
摘要:
Systems, methods, and programs, for evaluating audio messages store a model that may include language patterns, audio patterns, and/or metafeatures that indicate a likelihood that the audio message is a spam message or a non-spam message and compare the content of the input audio message with the model. Based on the comparison, the systems, methods, and programs identify the input audio message as a spam message or a non-spam message.
摘要:
A system and method to provide content and call attributes for a destination phone number using a click-to-dial connection includes accepting a query, retrieving links to a document, and searching through cached data using the query to identify relevant or destination number information. If no match is found, the method continues with accessing the document identified by the link for identifying relevant number information and creating a click-to-dial icon to link to the relevant numbers included in the relevant phone number information identified by the query. Next, a popup box for content relevant to the click-to-dial icon and a click feature on the popup box are created to retrieve call destination attributes for viewing by a user. Further, computer instructions create at least one communication connection between two communication devices after viewing call destination attributes.
摘要:
A method of generating a natural language model for use in a spoken dialog system is disclosed. The method comprises using sample utterances and creating a number of hand crafted rules for each call-type defined in a labeling guide. A first NLU model is generated and tested using the hand crafted rules and sample utterances. A second NLU model is built using the sample utterances as new training data and using the hand crafted rules. The second NLU model is tested for performance using a first batch of labeled data. A series of NLU models are built by adding a previous batch of labeled data to training data and using a new batch of labeling data as test data to generate the series of NLU models with training data that increases constantly. If not all the labeling data is received, the method comprises repeating the step of building a series of NLU models until all labeling data is received. After all the training data is received, at least once, the method comprises building a third NLU model using all the labeling data, wherein the third NLU model is used in generating the spoken dialog service.
摘要:
A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.
摘要:
A computerized method is disclosed for presenting advertising data extracted from a video data stream, the method including storing a plurality of advertising data items extracted from the video data stream at an end user device; and displaying a plurality of sorted advertising indicator data items at the end user device, wherein each of the advertising indicator data items indicates one of the plurality of stored advertising data items. A system is disclosed for performing the method. A data structure is disclosed providing a functional and structural interrelationship between a processor in the system and data in the data structure.
摘要:
A system for understanding entries, such as speech, develops a classifier by employing prior knowledge with which a given corpus of training entries is enlarged threefold. A rule is created for each of the labels employed in the classifier, and the created rules are applied to the given corpus to create a corpus of attachments by appending a weight of ηp(x), or 1−ηp(x), to labels of entries that meet, or fail to meet, respectively, conditions of the labels' rules, and to also create a corpus of non-attachments by appending a weight of 1−ηp(x), or ηp(x), to labels of entries that meet, or fail to meet conditions of the labels' rules.
摘要:
Method and apparatus for maintaining state information on a client device configured for voice-over-internet protocol (VOIP) communication is described. In one example, a VOIP call between the client device and an endpoint device through a packet network is established. State information is sent from the client device to the endpoint device during the VOIP call. The state object having the state information embedded therein is received from the endpoint device at the client device. The state object is stored in the client device. The state object may be retrieved from the client device during the VOIP call or a subsequent VOIP call in response to an authenticated request from the endpoint device. Alternatively, the state object may be retrieved from the client device during a subsequent web session in response to an authenticated request from a host.
摘要:
A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.