Abstract:
A method of obtaining a grammar model to perform speech recognition includes obtaining information about a state of at least one device, obtaining grammar model information about the at least one device based on the obtained information, and generating a grammar model to perform the speech recognition based on the obtained grammar model information.
Abstract:
A speech recognition method and apparatus for performing speech recognition in response to an activation word determined based on a situation are provided. The speech recognition method and apparatus include an artificial intelligence (AI) system and its application, which simulates functions such as recognition and judgment of a human brain using a machine learning algorithm such as deep learning.
Abstract:
A method of updating a grammar model used during speech recognition includes obtaining a corpus including at least one word, obtaining the at least one word from the corpus, splitting the at least one obtained word into at least one segment, generating a hint for recombining the at least one segment into the at least one word, and updating the grammar model by using at least one segment comprising the hint.
Abstract:
A speech recognition device is provided. The speech recognition device includes at least one microphone configured to receive a sound signal from a first sound source, and at least one processor configured to determine a direction of the first sound source based on the sound signal, determine whether the direction of the first sound source is in a registered direction, and based on whether the direction of the first sound source is in the registered direction, recognize a speech from the sound signal regardless of whether the sound signal comprises a wake-up keyword.
Abstract:
A medical apparatus includes an elastic compensation member providing a first link that is rotatable with respect to a first rotational axis so that a display apparatus is movable, with a torque in an opposite direction to a torque acting due to a load of the display apparatus in order to compensate for the torque acting due to the load of the display apparatus, in order to minimize a length variation of the elastic compensation member despite movement of the display apparatus, a first end portion of the elastic compensation member is not fixed to the first link but is supported by an additional rotatable supporting portion so that the first end portion of the elastic compensation member is movable relative the first link while the display apparatus is being moved.
Abstract:
A server, a user terminal, and a method for controlling the server and the user terminal are provided. The server controlling method includes receiving a text from a user terminal, translating the received text to generate a translated text, extracting at least one core word from the translated text, obtaining image information corresponding to the at least one core word with respect to each of the at least one core word, and transmitting the translated text and the extracted image information to the user terminal.
Abstract:
A device detects a wake-up keyword from a received speech signal of a user by using a wake-up keyword model, and transmits a wake-up keyword detection/non-detection signal and the received speech signal of the user to a speech recognition server. The speech recognition server performs a recognition process on the speech signal of the user by setting a speech recognition model according to the detection or non-detection of the wake-up keyword.
Abstract:
An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.
Abstract:
A light emitting device may include a substrate, an n-type clad layer, an active layer, and a p-type clad layer. A concave-convex pattern having a plurality of grooves and a mesa between each of the plurality of grooves may be formed on the substrate, and a reflective layer may be formed on the surfaces of the plurality of grooves or the mesa between each of the plurality of grooves. Therefore, light generated in the active layer may be reflected by the reflective layer, and extracted to an external location.