摘要:
An interaction simulator uses computer vision, and inputs of other modalities, to analyze the user's mental state and/or personality. The mental state and/or personality are classified and this information used to guide conversation and other interaction. In a chatterbot embodiment, the substance of the conversation may be altered in response to the mental state and/or personality class, for example, by changing the topic of conversation to a favorite subject when the user is sad or by telling a joke when the user is in a good mood.
摘要:
The learning history memory 9 controls, based on the accumulated learning history, the word dictionary memory 6, semantic/syntactic analysis grammar memory 7, and a correction information memory 13 to limit the syntactic/semantic language structure and range of vocabulary, which are not used frequently. Then, the speech recognition collator 4 collates the feature vector with information, having high priority within the limited range, in the word dictionary memory 6 and semantic/syntactic analysis grammar memory 7 to recognize the speech.
摘要:
An information and control system for personnel transport devices. In one embodiment, the information and control system is coupled to the elevator system of a building, and includes a touch panel input device, a flat panel display having a touch sensitive screen, and speech recognition and synthesis systems serving each elevator car. The speech recognition and synthesis systems and input device(s) are operatively coupled to a processor and storage devices having a plurality of different types of data stored thereon. Each elevator car is also a client connected to a LAN, WAN, intranet, or Internet, and capable of exchanging data with and retrieving data therefrom. Functions performed by the information and control system include a voice-actuated building directory, download of selected data to personal electronic devices (PEDs), monitoring of areas adjacent to the elevator car on destination floors, and control of lighting and security monitoring in selectable areas of destination floors. The system is also optionally fitted with an RFID interrogator/reader capable of recognizing RFID tags carried by passengers on the elevator, thereby granting access to various controlled locations automatically after password authentication. The RFID system also allows the authenticated passenger(s) to control utilities such as lighting and HVAC within specific zones on their destination floors. The information and control system is also optionally equipped with an occupancy estimating sub-system which allows elevator cars to bypass calling floors when their capacity is reached or exceeded.
摘要:
A portable dialogue management system includes a dialogue manager and a hierarchical task description table. The hierarchical task description table has a plurality of base tables connected with a hierarchical structure. Each base table defines the strategy of a sub-dialogue and stores the dialogue states, a number of domain parameters, and a plurality of response actions corresponding to each dialogue state. The dialogue manager manages the dialogue state of a dialogue system, determines the dialogue state and executes the appropriate response action. Because the domain knowledge is defined in the hierarchical task description table and the dialogue manager is not dependent on the application domain, the dialogue management system is easily portable to different applications. A stack may also be used to push in or pop up a dialogue state so that dialogues of multiple purposes can be accomplished.
摘要:
A distributed voice user interface system includes a local device which receives speech input issued from a user. Such speech input may specify a command or a request by the user. The local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself. If not, the local device initiates communication with a remote system for further processing of the speech input.
摘要:
An IVR system for an information network and method for storing and executing user queries stored on the network so that such queries do not have to be re-entered each time a user wants to access information from or execute a transaction on the network. The system can also be programmed to automatically execute the query at a predetermined time or times, and deliver information retrieved from the network and/or confirmation of the execution of a transaction on the network to the user in a format specified by the user.
摘要:
An audio retrieval system and method are provided for augmenting the transcription of an audio file with one or more alternate word or phrase choices, such as next-best guesses for each word or phrase, in addition to the best word sequence identified by the transcription process. The audio retrieval system can utilize a primary index file containing the best identified words and/or phrases for each portion of the input audio stream and a supplemental index file containing alternative choices for each word or phrase in the transcript. The present invention allows words that are incorrectly transcribed during speech recognition to be identified in response to a textual query by searching the supplemental index files. During an indexing process, the list of alternative word or phrase choices provided by the speech recognition system are collected to produce a set of supplemental index files. During a retrieval process, the user-specified textual query is matched against the primary and supplemental indexes derived from the transcribed audio to identify relevant documents. An objective ranking function scales matches found in the supplemental index file(s) using a predefined scaling factor, or a value reflecting the confidence value of the corresponding alternative choice as identified by the speech recognition system.
摘要:
A voice-input document creation system which reduces correction times to a wrongly-recognized input to ensure input efficiency. The system has a determination means for comparing the feature content of a newest voice input extracted by a feature extracting module with a feature content of an immediately preceding voice input to determine if the newest voice input is a correction to the immediately preceding voice input. When a first-time correction is received, the system displays a list of all output candidates for the immediately preceding voice input stored in a second memory. When a second-time correction is received, the system stores the output candidates for the newest voice input into a third memory and, at the same time, displays the output candidates; at this time, the system neither displays nor stores in the third memory those output candidates displayed upon the first-time correction.
摘要:
A system and apparatus for using speech recognition and verification to provide secure and authorized data transmissions between networked computer systems is provided. The system includes first and second network computer systems wherein a request for a transaction by user of the first computer system causes the user to be prompted to enter a spoken identifier such as a credit card number, PIN number or password. This spoken identifier is converted from speech data into speech feature data using either a resident software application or a downloaded application from the second computer system. The speech feature data is transmitted to the second computer system wherein speech recognition and verification engines identify the spoken identifier and determine whether or not the user who spoke the identifier is properly associated with the spoken identifier. Upon successful completion of this recognition and verification process, the requested transaction is completed.
摘要:
An operator receives a request for directory information about a listed party over a telephone connection. It is then determined if enhanced directory information for the listed party is available in an enhanced information database. The enhanced directory information associated with the listed party is retrieved, from the enhanced information database, as a script that incorporates the enhanced directory information. The enhanced directory information can then be provided using the script. The enhanced directory information can be stored, for example, on a Web server that is updated by the listed party and the information can be transferred using a Phone Markup Language script.