摘要:
A method for improving the performance of a noise cancellation device, the method includes determining whether one or more noise making objects (NMO) are near an audible range of the noise cancellation device and receiving a signal from the one or more NMOs indicative of a kind of noise the one or more NMOs is generating. The method also includes selecting a specific noise cancellation model to reduce an expected noise in response to the received kind of noise the one or more NMOs is generating.
摘要:
Pretraining for a DBN initializes weights of the DBN (Deep Belief Network) using a hybrid pre-training methodology. Hybrid pre-training employs generative component that allows the hybrid PT method to have better performance in WER (Word Error Rate) compared to the discriminative PT method. Hybrid pre-training learns weights which are more closely linked to the final objective function, allowing for a much larger batch size compared to generative PT, which allows for improvements in speed; and a larger batch size allows for parallelization of the gradient computation, speeding up training further.
摘要:
Methods and apparatus for language translation in a computing environment associated with a virtual application are presented. For example, a method for providing language translation includes determining languages of a user and a correspondent; determining one or more sequences of translators; determining a selected sequence of selected translators from the one or more sequences of the translators; requesting a change in virtual locations, within the computing environment associated with the virtual application, of one or more selected translator virtual representations of the selected translators to a virtual meeting location within the computing environment associated with the virtual application; and changing virtual locations of the one or more selected translator virtual representations to the virtual meeting location. One or more of determining languages, determining one or more sequences, determining a selected sequence, requesting a change in virtual locations, and changing virtual locations occur on a processor device.
摘要:
Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.
摘要:
Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process.
摘要:
A method, information processing system, and computer readable article of manufacture model data. A first dataset is received that includes a first set of physical world data. At least one data model associated with the first dataset is generated based on the receiving. A second dataset is received that includes a second set of physical world data. The second dataset is compared to the at least one data model. A probability that the second dataset is modeled by the at least one data model is determined. A determination is made that the probability is above a given threshold. A decision associated with the second dataset based on the at least one data model is generated in response to the probability being above the given threshold. The probability and the decision are stored in memory. The probability and the decision are provided to user via a user interface.
摘要:
Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.
摘要:
A simulation method and system. A computing system receives a first audio and/or video data stream. The first audio and/or video data stream includes data associated with a first person. The computing system monitors the first audio and/or video data stream. The computing system identifies emotional attributes comprised by the first audio and/or video data stream. The computing system generates a second audio and/or video data stream associated with the first audio and/or video data stream. The second audio and/or video data stream includes the data without the emotional attributes. The computing system stores the second audio and/or video data stream.
摘要:
An optimization system and method includes determining a best gradient as a sparse direction in a function having a plurality of parameters. The sparse direction includes a direction that maximizes change of the function. This maximum change of the function is determined by performing an optimization process that gives maximum growth subject to a sparsity regularized constraint. An extended Baum Welch (EBW) method can be used to identify the sparse direction. A best step size is determined along the sparse direction by finding magnitudes of entries of direction that maximizes the function restricted to the sparse direction. A solution is recursively refined for the function optimization using a processor and storage media.
摘要:
Techniques are disclosed for authentication and identification of a user by use of an electroencephalographic (EEG) signal. For example, a method for authenticating a user includes the following steps. At least one electroencephalographic response is obtained from a user in accordance with perceptory stimuli presented to the user. The user is authenticated based on the obtained electroencephalographic response. The authenticating step may be based on detection of an event-related potential in the obtained electroencephalographic response. The event-related potential may be a P300 event-related potential. The method may also include the step of enrolling the user prior to authenticating the user. The enrolling step may include a supervised enrollment procedure or an unsupervised enrollment procedure.