Abstract:
An apparatus and a method for processing a natural language arranged so as to improve the speech recognition rate. In an example search section, the degree of similarity between each of a plurality of examples of the actual use of the language stored in an example data base and each of a plurality of probable recognition results output from a recognition section, and one of the examples corresponding to the highest degree of similarity is selected. A final speech recognition result is obtained by using the selected example. The example search section calculates the degree of similarity by weighting the degree of similarity on the basis of a context according to at least one of the examples previously selected.
Abstract:
A speech recognition system includes an acoustic analyzer which produces a time sequence of acoustic parameters from an input speech signal in an utterance boundary thereof, and estimates a trajectory in a parameter space from the time sequence of acoustic parameters. The trajectory is re-sampled in the parameter space at predetermined constant intervals sequentially each time the acoustic parameters are produced by the acoustic analyzing means, thereby producing an input utterance pattern. The input utterance pattern is matched with reference speech patterns to recognize the input speech signal. The speech recognition system also has an utterance boundary detector for detecting the utterance boundary of the input speech signal. The trajectory is re-sampled while the utterance boundary is being detected by the utterance boundary detector.
Abstract:
Apparatus for obtaining an orthogonal transformer for use in signal analysis. This apparatus is especially useful in cepstral analysis where vocal tract characteristics of a signal are separated from a sound source. A buffer memory 10 temporarily stores input time series data and an orthogonal transformation is carried out by an orthogonal transform means 20 by multiplying each predetermined data read sequentially out of the buffer memory 10 by a predetermined twiddle factor. A controller 30 is used to specify the data read out of the buffer memory 10 and for giving the twiddle factor a value of 1 and r/2.sup.l corresponding to the magnitude of the real and imaginary part of the twiddle factor. Cepstrum information is obtainable using simple hardware to analyze the data generated by the orthogonal transformer.
Abstract:
A start point vicinity of an uttered input speech pattern is set. DP-matching is performed between the input speech pattern and a plurality of reference patterns obtained by connecting previously prepared reference patterns in the start point vicinity of a portion for the head word of the input speech pattern. A point within the start point vicinity is determined as a temporary start point. The dissimilarity proportional to the time length of the reference pattern is calculated and then converted into a value proportional to the time length of the input pattern from the temporary start point. The dissimilarity measure between the input speech pattern and the reference pattern on the second and the following digits are determined as a value proportional to the time length of the input speech pattern. The end time point of the input speech pattern is determined on the basis of the minimum value of a normalized dissimilarity measure and by the time length of the input speech pattern.
Abstract:
Phoneme recognition uses the silence-phoneme and phoneme-phoneme transition spectral information rather than the phoneme information itself. The transition detector features first and second differences in level for each frequency band.
Abstract:
A minature computer comprising a main section which includes a central processor, a read only memory storing a monitor program, a random access memory for executing the monitor program and a controlling circuit for peripheral equipments developed which has a hand-held controller electrically connected to the main section through a flat cable, having a keyboard and display device such that the keyboard has keys of a numeral 0 to 9 and the alpha characters A to Z for enabling symbolic languages to be input according to an assembler program. The assembler program is stored in the read only memory and the symbolic languages are converted to corresponding machine languages. The main section has a first jack capable of inputting a program from a audio cassette tape deck and a second jack capable of storing a program in a audio cassette tape deck. The main section also has third and fourth jacks which control the operation of the audio cassette tape decks connected to the first two jacks.
Abstract:
A book database stores at least phonetic signal information including phoneme information and rhythm information as document data, a central system transmits phonetic signal information stored on the book database to a terminal and the terminal receives the phonetic signal information is then carried out at the terminal and the document is then recited via synthesized sounds.
Abstract:
A voice recognition device according to the present invention including a voice analyzer for acoustically analyzing voice every predetermined frame unit to extract a feature vector X, a converter for subjecting the feature vector X output from the analyzer to a predetermined conversion process, and a voice recognizer for recognizing the voice on the basis of a new feature vector output from the converter, wherein the converter conducts the predetermined conversion processing according to a mapping F from an N-dimensional vector space .OMEGA..sub.N to an M-dimensional vector space .OMEGA..sub.M, the feature vector X is a vector on the N-dimensional vector space .OMEGA..sub.N and the function f.sub.m (X) of an m-th component of the mapping F is represented by the following linear summation of the products of functions g.sub.m.sup.k (X) and coefficients c.sub.m.sup.k of L.sub.m : ##EQU1## Each function g.sub.m.sup.k (X) may be set to a monomial.
Abstract:
A recognition system for recognizing a plurality of continuous hand-written characters, employing a first memory in which isolated characters are stored, and a second memory which stores information, including interstroke character information, for connecting isolated characters. According to various embodiments of the invention, this interstroke information may be stored as part of a continuous character, or by itself.
Abstract:
A map determination method and apparatus for calculating the coefficients to give a minimum evaluation function quickly and reliably where a map is expressed as the linear sum of a function g.sub.i (X) and a coefficient c.sub.i while a map for transforming a N-dimensional vector (x.sub.0, x.sub.1, x.sub.2, x.sub.3) to a M-dimensional vector y is being decided. The coefficient ci for the map is obtained by giving a learning sample and a teaching sample, obtaining an evaluation function and solving a simultaneous linear equation for which the partial differential is zero.