SPEECH SIGNAL RECOGNITION SYSTEM AND METHOD
    2.
    发明申请

    公开(公告)号:US20190088251A1

    公开(公告)日:2019-03-21

    申请号:US15916512

    申请日:2018-03-09

    Abstract: A speech signal recognition method, apparatus, and system. The speech signal recognition method may include obtaining by or from a terminal an output of a personalization layer, with respect to a speech signal provided by a user of the terminal, having been implemented by input of the speech signal to the personalization layer, the personalization layer being previously trained based on speech features of the user, implementing a global model by providing the obtained output of the personalization layer to the global model, the global model being configured to output a phonemic signal indicating a phoneme included in the speech signal through the global model being previously trained based on speech features common to a plurality of users, and re-training the personalization layer based on the phonemic signal output from the global model, where the personalization layer and the global model collectively represent an acoustic model.

    METHOD OF TRAINING IMAGE REPRESENTATION MODEL

    公开(公告)号:US20240078785A1

    公开(公告)日:2024-03-07

    申请号:US18116602

    申请日:2023-03-02

    CPC classification number: G06V10/761 G06V10/764

    Abstract: A method generates an anchor image embedding vector for an anchor image using an image representation model, determine first similarities between the anchor image and negative samples of the anchor image using first image embedding vectors for the negative samples and the generated anchor image embedding vector, determine second similarities between the anchor image and positive samples of the anchor image using second image embedding vectors for the positive samples and the generated anchor image embedding vector, obtain one of a vector corresponding to a label of the anchor image and third similarities between the label of the anchor image and labels of the negative samples, determine a loss value for the anchor image based on the determined first similarities, and the determined second similarities, and one of the obtained third similarities and a fourth similarity.

    MULTILEVEL SPEECH RECOGNITION METHOD AND APPARATUS
    8.
    发明申请
    MULTILEVEL SPEECH RECOGNITION METHOD AND APPARATUS 审中-公开
    多语音识别方法和设备

    公开(公告)号:US20160012820A1

    公开(公告)日:2016-01-14

    申请号:US14558479

    申请日:2014-12-02

    CPC classification number: G10L15/32 G10L15/1822 G10L2015/223

    Abstract: A multilevel speech recognition method and an apparatus performing the method are disclosed. The method includes receiving a first speech command from a user through a speech interface, and extracting a keyword from the first speech command. The method also includes providing a candidate application group of a category providing a service associated with the keyword, and processing a second speech command from the user associated with an application selected from the candidate application group.

    Abstract translation: 公开了一种多级语音识别方法和执行该方法的装置。 该方法包括通过语音接口从用户接收第一语音命令,以及从第一语音命令中提取关键字。 该方法还包括提供提供与关键字相关联的服务的类别的候选应用组,以及处理来自与从候选应用组中选择的应用相关联的用户的第二语音命令。

Patent Agency Ranking