Apparatus and method for statistical memory network

    公开(公告)号:US11526732B2

    公开(公告)日:2022-12-13

    申请号:US16260637

    申请日:2019-01-29

    Abstract: Provided are an apparatus and method for a statistical memory network. The apparatus includes a stochastic memory, an uncertainty estimator configured to estimate uncertainty information of external input signals from the input signals and provide the uncertainty information of the input signals, a writing controller configured to generate parameters for writing in the stochastic memory using the external input signals and the uncertainty information and generate additional statistics by converting statistics of the external input signals, a writing probability calculator configured to calculate a probability of a writing position of the stochastic memory using the parameters for writing, and a statistic updater configured to update stochastic values composed of an average and a variance of signals in the stochastic memory using the probability of a writing position, the parameters for writing, and the additional statistics.

    MOBILE COMMUNICATION TERMINAL AND OPERATING METHOD THEREOF
    4.
    发明申请
    MOBILE COMMUNICATION TERMINAL AND OPERATING METHOD THEREOF 有权
    移动通信终端及其操作方法

    公开(公告)号:US20140221043A1

    公开(公告)日:2014-08-07

    申请号:US14018068

    申请日:2013-09-04

    CPC classification number: H04M1/72519 G10L15/25 H04M2250/52 H04M2250/74

    Abstract: Provided is a mobile communication terminal including: a camera module which captures an image of a set area; a microphone module which, when a sound including a voice of a user is input, extracts a sound level corresponding to the sound and a sound generating position; and a control module which estimates a position of a lip of the user from the image, extracts a voice level from the sound level corresponding to the position of the lip of the user and a voice generating position from the sound generating position, and recognizes the voice of the user based on at least one of the voice level and the voice generating position.

    Abstract translation: 提供了一种移动通信终端,包括:相机模块,其捕获设置区域的图像; 麦克风模块,当输入包括用户的声音的声音时,提取与声音和声音产生位置相对应的声级; 以及控制模块,其从图像估计用户的嘴唇的位置,从与声音产生位置的用户的嘴唇的位置和语音产生位置相对应的声级提取语音电平,并且识别出 基于语音电平和语音产生位置中的至少一个的用户的语音。

    Feature compensation apparatus and method for speech recognition in noisy environment

    公开(公告)号:US09799331B2

    公开(公告)日:2017-10-24

    申请号:US15074579

    申请日:2016-03-18

    CPC classification number: G10L15/20 G10L15/02

    Abstract: A feature compensation apparatus includes a feature extractor configured to extract corrupt speech features from a corrupt speech signal with additive noise that consists of two or more frames; a noise estimator configured to estimate noise features based on the extracted corrupt speech features and compensated speech features; a probability calculator configured to calculate a correlation between adjacent frames of the corrupt speech signal; and a speech feature compensator configured to generate compensated speech features by eliminating noise features of the extracted corrupt speech features while taking into consideration the correlation between adjacent frames of the corrupt speech signal and the estimated noise features, and to transmit the generated compensated speech features to the noise estimator.

    Pre-training apparatus and method for speech recognition

    公开(公告)号:US09875737B2

    公开(公告)日:2018-01-23

    申请号:US15207673

    申请日:2016-07-12

    Inventor: Ho Young Jung

    Abstract: A pre-training apparatus and method for recognition speech, which initialize, by layers, a deep neural network to correct a node connection weight. The pre-training apparatus for speech recognition includes an input unit configured to receive speech data, a model generation unit configured to initialize a connection weight of a deep neural network, based on the speech data, and an output unit configured to output information about the connection weight. In order for a state of a phoneme unit corresponding to the speech data to be output, the model generation unit trains the connection weight by piling a plurality of hidden layers according to a determined structure of the deep neural network, applies an output layer to a certain layer between the plurality of hidden layers to correct the trained connection weight in each of the plurality of hidden layers, thereby initializing the connection weight.

    Method and apparatus for generating summarized information, and server for the same
    7.
    发明授权
    Method and apparatus for generating summarized information, and server for the same 有权
    用于生成汇总信息的方法和装置,以及与之相同的服务器

    公开(公告)号:US09426411B2

    公开(公告)日:2016-08-23

    申请号:US13850646

    申请日:2013-03-26

    Inventor: Ho Young Jung

    Abstract: The present invention relates to automatic summarization so as to recognize entire contents of multimedia data. A method of generating summarized information according to the present invention includes: generating index information on a specific audio signal or a specific video signal among input signals; synchronizing text information extracted from the input signal or received for the input signal with the index information; and generating first summarized information by using the synchronized text information and index information.

    Abstract translation: 本发明涉及自动汇总,以便识别多媒体数据的全部内容。 根据本发明的产生汇总信息的方法包括:在输入信号之间产生关于特定音频信号或特定视频信号的索引信息; 将从输入信号中提取或接收的输入信号的文本信息与索引信息同步; 以及通过使用同步的文本信息和索引信息来生成第一汇总信息。

    Mobile communication terminal and operating method thereof
    8.
    发明授权
    Mobile communication terminal and operating method thereof 有权
    移动通信终端及其操作方法

    公开(公告)号:US09100492B2

    公开(公告)日:2015-08-04

    申请号:US14018068

    申请日:2013-09-04

    CPC classification number: H04M1/72519 G10L15/25 H04M2250/52 H04M2250/74

    Abstract: Provided is a mobile communication terminal including: a camera module which captures an image of a set area; a microphone module which, when a sound including a voice of a user is input, extracts a sound level corresponding to the sound and a sound generating position; and a control module which estimates a position of a lip of the user from the image, extracts a voice level from the sound level corresponding to the position of the lip of the user and a voice generating position from the sound generating position, and recognizes the voice of the user based on at least one of the voice level and the voice generating position.

    Abstract translation: 提供了一种移动通信终端,包括:相机模块,其捕获设置区域的图像; 麦克风模块,当输入包括用户的声音的声音时,提取与声音和声音产生位置相对应的声级; 以及控制模块,其从图像估计用户的嘴唇的位置,从与声音产生位置的用户的嘴唇的位置和语音产生位置相对应的声级提取语音电平,并且识别出 基于语音电平和语音产生位置中的至少一个的用户的语音。

    Sentence embedding method and apparatus based on subword embedding and skip-thoughts

    公开(公告)号:US11423238B2

    公开(公告)日:2022-08-23

    申请号:US16671773

    申请日:2019-11-01

    Abstract: Provided are sentence embedding method and apparatus based on subword embedding and skip-thoughts. To integrate skip-thought sentence embedding learning methodology with a subword embedding technique, a skip-thought sentence embedding learning method based on subword embedding and methodology for simultaneously learning subword embedding learning and skip-thought sentence embedding learning, that is, multitask learning methodology, are provided as methodology for applying intra-sentence contextual information to subword embedding in the case of subword embedding learning. This makes it possible to apply a sentence embedding approach to agglutinative languages such as Korean in a bag-of-words form. Also, skip-thought sentence embedding learning methodology is integrated with a subword embedding technique such that intra-sentence contextual information can be used in the case of subword embedding learning. A proposed model minimizes additional training parameters based on sentence embedding such that most training results may be accumulated in a subword embedding parameter.

    Apparatus and method for controlling mobile device by conversation recognition, and apparatus for providing information by conversation recognition during meeting
    10.
    发明授权
    Apparatus and method for controlling mobile device by conversation recognition, and apparatus for providing information by conversation recognition during meeting 有权
    通过会话识别来控制移动设备的装置和方法,以及在会议期间通过会话识别提供信息的装置

    公开(公告)号:US09258406B2

    公开(公告)日:2016-02-09

    申请号:US14030034

    申请日:2013-09-18

    Inventor: Ho Young Jung

    Abstract: An apparatus for controlling a mobile device according to the present invention includes: a conversation recognition unit configured to recognize a conversation between users through mobile devices; a user intent verification unit configured to verify an intent of at least one user among the users based on the recognition result; and an additional function control unit configured to execute an additional function corresponding to the verified user's intent in a mobile device of the user. According to the present invention, great contribution may be made to improve communication between users by recognizing the conversation between the users, thereby directly providing information associated with the conversation or providing a service.

    Abstract translation: 根据本发明的用于控制移动设备的装置包括:对话识别单元,被配置为通过移动设备识别用户之间的对话; 用户意图验证单元,被配置为基于所述识别结果来验证用户中的至少一个用户的意图; 以及附加功能控制单元,被配置为在所述用户的移动设备中执行与所述验证的用户的意图相对应的附加功能。 根据本发明,可以通过识别用户之间的对话来改善用户之间的通信,从而直接提供与会话相关联的信息或提供服务的重要贡献。

Patent Agency Ranking