Abstract:
The present invention suggests an interface device for processing a voice of a user which efficiently outputs various information so as to allow a user to contribute to the voice recognition or the automatic interpretation and a method thereof. For this purpose, the present invention suggests an interface device for processing a voice of a user which includes an utterance input unit configured to input utterance of a user, an utterance end recognizing unit configured to recognize the end of the input utterance; and an utterance result output unit configured to output at least one of a voice recognition result, a translation result, and an interpretation result of the ended utterance.
Abstract:
Provided are an automatic interpretation system and method for generating a synthetic sound having characteristics similar to those of an original speaker's voice. The automatic interpretation system for generating a synthetic sound having characteristics similar to those of an original speaker's voice includes a speech recognition module configured to generate text data by performing speech recognition for an original speech signal of an original speaker and extract at least one piece of characteristic information among pitch information, vocal intensity information, speech speed information, and vocal tract characteristic information of the original speech, an automatic translation module configured to generate a synthesis-target translation by translating the text data, and a speech synthesis module configured to generate a synthetic sound of the synthesis-target translation.
Abstract:
A method of correcting errors in a speech recognition system includes a process of searching a speech recognition error-answer pair DB based on a sound model for a first candidate answer group for a speech recognition error, a process of searching a word relationship information DB for a second candidate answer group for the speech recognition error, a process of searching a user error correction information DB for a third candidate answer group for the speech recognition error, a process of searching a domain articulation pattern DB and a proper noun DB for a fourth candidate answer group for the speech recognition error, and a process of aligning candidate answers within each of the retrieved candidate answer groups and displaying the aligned candidate answers.
Abstract:
Disclosed herein are a translation apparatus and a translation method. The translation apparatus includes: a speech input unit that receives a speech of a first language from a user; a control unit that generates sentences to be translated of the first language from the speech of the first language input from the speech input unit; a communication unit that transmits the sentences to be translated of the first language to a translation server and receives the sentences to be translated of a second language from the translation server; a display unit that displays the translated sentences of the second language along with previously translated sentences; a memory that stores a translation history including the sentences to be translated of the first language and the translated sentences of the second language; and a user input unit that receives an operation input of the previously translated sentences from a user.