Method and device for correcting error in text

    公开(公告)号:US11080492B2

    公开(公告)日:2021-08-03

    申请号:US16692549

    申请日:2019-11-22

    Abstract: A method and device for correcting an error in a text are provided. The method includes: preprocessing the text to obtain at least one segment of the text; generating a plurality of candidate segments for the segment; scoring the plurality of candidate segments with a tree model, to obtain respective first scoring results of the plurality of candidate segments; scoring the plurality of candidate segments with a deep neural network model, to obtain respective second scoring results of the plurality of candidate segments; for each candidate segment, calculating a scoring of the candidate segment based on the first scoring result and the second scoring result of the candidate segment; ranking the plurality of candidate segments according to the scorings of the candidate segments, to obtain a ranking result; and correcting the error in the text according to the ranking result.

    Word segmentation method based on artificial intelligence, server and storage medium

    公开(公告)号:US10650096B2

    公开(公告)日:2020-05-12

    申请号:US15934410

    申请日:2018-03-23

    Abstract: Embodiments of the present disclosure disclose a word segmentation method based on artificial intelligence, a server and a storage medium. The word segmentation method may include: acquiring a corpus to be segmented and a segmentation model corresponding to a preset segmentation template; matching the corpus to be segmented with the segmentation model according to a preset matching algorithm, and acquiring a target phrase satisfying a first preset rule in the corpus to be segmented; modifying an emission matrix corresponding to the segmentation model and the corpus to be segmented according to the target phrase; and performing a word segmentation on the corpus to be segmented according to the emission matrix modified, to acquire a first segmentation result.

    Method and apparatus for correcting character errors, electronic device and storage medium

    公开(公告)号:US11443100B2

    公开(公告)日:2022-09-13

    申请号:US16950975

    申请日:2020-11-18

    Abstract: A method and apparatus for correcting character errors, an electronic device and a storage medium are disclosed, which relates to the natural language processing field and the deep learning field. The method may include: for a character to be processed, acquiring the score of each character in a pre-constructed vocabulary, the score being a score of the reasonability of the character in the vocabulary at the position of the character to be processed; selecting top K characters as candidates of the character to be processed, K being a positive integer greater than one; selecting an optimal candidate from the K candidates; and replacing the character to be processed with the optimal candidate if the optimal candidate is different from the character to be processed. With the solution of the present application, the accuracy of an error correction result, or the like, may be improved.

    Method and apparatus for generating parallel text in same language

    公开(公告)号:US10650102B2

    公开(公告)日:2020-05-12

    申请号:US15900166

    申请日:2018-02-20

    Abstract: The present disclosure discloses a method and apparatus for generating a parallel text in the same language. The method comprises: acquiring a source segmented word sequence and a pre-trained word vector table; determining a source word vector sequence corresponding to the source segmented word sequence, according to the word vector table; importing the source word vector sequence into a first pre-trained recurrent neural network model, to generate an intermediate vector of a preset dimension for characterizing semantics of the source segmented word sequence; importing the intermediate vector into a second pre-trained recurrent neural network model, to generate a target word vector sequence corresponding to the intermediate vector; and determining a target segmented word sequence corresponding to the target word vector sequence according to the word vector table, and determining the target segmented word sequence as a parallel text in the same language corresponding to the source segmented word sequence.

    METHOD AND APPARATUS FOR GENERATING PARALLEL TEXT IN SAME LANGUAGE

    公开(公告)号:US20180365231A1

    公开(公告)日:2018-12-20

    申请号:US15900166

    申请日:2018-02-20

    Abstract: The present disclosure discloses a method and apparatus for generating a parallel text in the same language. The method comprises: acquiring a source segmented word sequence and a pre-trained word vector table; determining a source word vector sequence corresponding to the source segmented word sequence, according to the word vector table; importing the source word vector sequence into a first pre-trained recurrent neural network model, to generate an intermediate vector of a preset dimension for characterizing semantics of the source segmented word sequence;importing the intermediate vector into a second pre-trained recurrent neural network model, to generate a target word vector sequence corresponding to the intermediate vector; and determining a target segmented word sequence corresponding to the target word vector sequence according to the word vector table, and determining the target segmented word sequence as a parallel text in the same language corresponding to the source segmented word sequence.

Patent Agency Ranking