-
公开(公告)号:US20240304190A1
公开(公告)日:2024-09-12
申请号:US18663779
申请日:2024-05-14
发明人: Taewoo LEE , Minseok KWON , Kyungtae KIM , Gajin SONG , Hoseon SHIN , Jungin LEE , Seokyeong JUNG
摘要: An electronic device configured to perform speaker verification on a voice input to determine whether the voice input matches a voice of an enrolled speaker, based on determining that the voice input does not match the voice of the enrolled speaker, perform first speech recognition on the voice input based on a first automatic speech recognition (ASR) model, and based on determining that the voice input matches the voice of the enrolled speaker, perform second speech recognition on the voice input based on a sequence summarizing neural network (SSN) and a second ASR model.
-
公开(公告)号:US20220319500A1
公开(公告)日:2022-10-06
申请号:US17425211
申请日:2021-07-08
发明人: Taewoo LEE , Taegyoon KANG , Hogyeong KIM , Minjoong LEE , Seokyeong JUNG , Jiseung JEONG
摘要: Disclosed is an electronic device including processor and memory operatively connected to the processor and storing language model. The electronic device may enter data into the language model, generate an embedding vector in the input embedding layer, add position information to the embedding vector in the positional encoding layer, branch the embedding vector based on domain information, normalize the branched embedding vectors, enter the normalized embedding vectors into the multi-head attention layer, enter output data of the multi-head attention layer into the first layer, normalize pieces of output data of the first layer, enter the normalized pieces of output data of the first layer into the feed-forward layer, enter output data of the feed-forward layer into the second layer and normalize pieces of output data of the second layer, and enter the normalized pieces of output data of the second layer into the linearization layer and the softmax layer to obtain result data. In addition, various embodiments as understood from the specification may be also possible.
-