Method and terminal for generating a text based on self-encoding neural network, and medium

    公开(公告)号:US11487952B2

    公开(公告)日:2022-11-01

    申请号:US16637274

    申请日:2019-06-26

    IPC分类号: G06F40/40 G06N3/04 G06N3/08

    摘要: The present disclosure relates to the technical field of natural language understanding, and provides a method, a terminal and a medium for generating a text based on a self-encoding neural network. The method includes: obtaining a text word vector and a classification requirement of a statement to be input; reversely inputting the text word vector into a trained self-encoding neural network model to obtain a hidden feature of an intermediate hidden layer of the self-encoding neural network model; modifying the hidden feature according to a preset classification scale and the classification requirement; defining the modified hidden feature as the intermediate hidden layer of the self-encoding neural network model, and reversely generating a word vector corresponding to an input layer of the self-encoding neural network model by the intermediate hidden layer; and generating the corresponding text, according to the generated word vector.

    Car damage picture angle correction method, electronic device, and readable storage medium

    公开(公告)号:US11386499B2

    公开(公告)日:2022-07-12

    申请号:US16084993

    申请日:2017-09-30

    摘要: Disclosed are a car damage picture angle correction method, an electronic device, and a readable storage medium. The method includes: after receiving a car damage picture to be classified and identified, identifying a rotation category corresponding to the received car damage picture by using a pre-trained picture rotation category identification model; determining a rotation control parameter corresponding to the identified rotation category according to a pre-determined mapping relation between rotation categories and rotation control parameters, the rotation control parameter including a rotation angle and a rotation direction; and rotating the received car damage picture according to the determined rotation control parameter, so as to generate an angle-normal car damage picture. The disclosure can perform car damage picture angle correction more comprehensively and more effectively with no need to artificially perform angle identification on a car damage picture and to manually rotate the picture, thereby achieving a higher efficiency and accuracy.

    Speech recognition with trained GMM-HMM and LSTM models

    公开(公告)号:US11062699B2

    公开(公告)日:2021-07-13

    申请号:US16348807

    申请日:2017-08-31

    摘要: A speech recognition method, comprising: acquiring speech data to be recognized; extracting a Filter Bank feature and a Mel-Frequency Cepstral Coefficient (MFCC) feature in the speech data; using the MFCC feature as input data of a trained Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) model, acquiring a first likelihood probability matrix outputted by the trained GMM-HMM model; using the Filter Bank feature as an input feature of a trained long short-term memory (LSTM) model which has a connection unit, acquiring a posterior probability matrix outputted by the LSTM model; using the posterior probability matrix and the first likelihood probability matrix as input data of a trained HMM model, acquiring an second likelihood probability matrix; and acquiring a target word sequence corresponding to the speech data to be recognized from a phoneme decoding network according to the second likelihood probability matrix.

    Acoustic model training method, speech recognition method, apparatus, device and medium

    公开(公告)号:US11030998B2

    公开(公告)日:2021-06-08

    申请号:US16097850

    申请日:2017-08-31

    IPC分类号: G10L15/14 G10L15/02 G10L15/16

    摘要: An acoustic model training method, a speech recognition method, an apparatus, a device and a medium. The acoustic model training method comprises: performing feature extraction on a training speech signal to obtain an audio feature sequence; training the audio feature sequence by a phoneme mixed Gaussian Model-Hidden Markov Model to obtain a phoneme feature sequence; and training the phoneme feature sequence by a Deep Neural Net-Hidden Markov Model-sequence training model to obtain a target acoustic model. The acoustic model training method can effectively save time required for an acoustic model training, improve the training efficiency, and ensure the recognition efficiency.

    METHOD AND TERMINAL FOR GENERATING A TEXT BASED ON SELF-ENCODING NEURAL NETWORK, AND MEDIUM

    公开(公告)号:US20210165970A1

    公开(公告)日:2021-06-03

    申请号:US16637274

    申请日:2019-06-26

    IPC分类号: G06F40/40 G06N3/04 G06N3/08

    摘要: The present disclosure relates to the technical field of natural language understanding, and provides a method, a terminal and a medium for generating a text based on a self-encoding neural network. The method includes: obtaining a text word vector and a classification requirement of a statement to be input; reversely inputting the text word vector into a trained self-encoding neural network model to obtain a hidden feature of an intermediate hidden layer of the self-encoding neural network model; modifying the hidden feature according to a preset classification scale and the classification requirement; defining the modified hidden feature as the intermediate hidden layer of the self-encoding neural network model, and reversely generating a word vector corresponding to an input layer of the self-encoding neural network model by the intermediate hidden layer; and generating the corresponding text, according to the generated word vector.