-
公开(公告)号:US11080492B2
公开(公告)日:2021-08-03
申请号:US16692549
申请日:2019-11-22
Inventor: Zhuobin Deng , Liqun Zheng , Xiyi Luo , Zhihong Fu , Jingzhou He
IPC: G06F40/30 , G06F40/289 , G06N3/04 , G06N3/08
Abstract: A method and device for correcting an error in a text are provided. The method includes: preprocessing the text to obtain at least one segment of the text; generating a plurality of candidate segments for the segment; scoring the plurality of candidate segments with a tree model, to obtain respective first scoring results of the plurality of candidate segments; scoring the plurality of candidate segments with a deep neural network model, to obtain respective second scoring results of the plurality of candidate segments; for each candidate segment, calculating a scoring of the candidate segment based on the first scoring result and the second scoring result of the candidate segment; ranking the plurality of candidate segments according to the scorings of the candidate segments, to obtain a ranking result; and correcting the error in the text according to the ranking result.
-
公开(公告)号:US20180144024A1
公开(公告)日:2018-05-24
申请号:US15677612
申请日:2017-08-15
Inventor: Zhihong Fu , Zengfeng Zeng , Qiugen Xiao , Jingzhou He , Lei Shi , Pengkai Li
CPC classification number: G06F16/243 , G06F16/2453 , G06F16/951 , G06F17/2705 , G06F17/273 , G06N5/048
Abstract: A method and an apparatus for correcting a query based on artificial intelligence, including: receiving a first query input by a user, and judging whether the first query satisfies an error correcting condition according to a preset error correcting strategy; determining a first segment to be corrected in the first query if the first query satisfies the error correcting condition; acquiring one or more first candidate results corresponding to the first segment according to a preset candidate recalling strategy; determining an error correcting result corresponding to the first segment according to quality feature values of the one or more first candidate results; and performing an error correction on the first query according to the error correcting result, and generating a second query.
-
3.
公开(公告)号:US20180365227A1
公开(公告)日:2018-12-20
申请号:US15941065
申请日:2018-03-30
Inventor: Liqun Zheng , Jinbo Zhan , Qiugen Xiao , Zhihong Fu , Jingzhou He , Guyue Zhou
IPC: G06F17/27
Abstract: Embodiments of the present disclosure disclose a method and an apparatus for customizing a word segmentation model based on artificial intelligence, a device and a medium. The method includes the followings. A customized segmentation training corpus is acquired. A first preset word segmentation model is rectified with an increment training method or a weight intervention method, based on the customized segmentation training corpus, to obtain a customized word segmentation model corresponding to the customized segmentation training corpus.
-
公开(公告)号:US10929390B2
公开(公告)日:2021-02-23
申请号:US15677612
申请日:2017-08-15
Inventor: Zhihong Fu , Zengfeng Zeng , Qiugen Xiao , Jingzhou He , Lei Shi , Pengkai Li
IPC: G06F15/16 , G06F16/242 , G06N5/04 , G06F16/951 , G06F16/2453 , G06F40/205 , G06F40/232
Abstract: A method and an apparatus for correcting a query based on artificial intelligence, including: receiving a first query input by a user, and judging whether the first query satisfies an error correcting condition according to a preset error correcting strategy; determining a first segment to be corrected in the first query if the first query satisfies the error correcting condition; acquiring one or more first candidate results corresponding to the first segment according to a preset candidate recalling strategy; determining an error correcting result corresponding to the first segment according to quality feature values of the one or more first candidate results; and performing an error correction on the first query according to the error correcting result, and generating a second query.
-
公开(公告)号:US10650096B2
公开(公告)日:2020-05-12
申请号:US15934410
申请日:2018-03-23
Inventor: Liqun Zheng , Jinbo Zhan , Qiugen Xiao , Zhihong Fu , Jingzhou He , Guyue Zhou
Abstract: Embodiments of the present disclosure disclose a word segmentation method based on artificial intelligence, a server and a storage medium. The word segmentation method may include: acquiring a corpus to be segmented and a segmentation model corresponding to a preset segmentation template; matching the corpus to be segmented with the segmentation model according to a preset matching algorithm, and acquiring a target phrase satisfying a first preset rule in the corpus to be segmented; modifying an emission matrix corresponding to the segmentation model and the corpus to be segmented according to the target phrase; and performing a word segmentation on the corpus to be segmented according to the emission matrix modified, to acquire a first segmentation result.
-
6.
公开(公告)号:US11443100B2
公开(公告)日:2022-09-13
申请号:US16950975
申请日:2020-11-18
Inventor: Mengdi Xu , Zhuobin Deng , Jiawei Lai , Zhihong Fu , Jingzhou He
IPC: G06F40/00 , G06F40/166 , G06N20/00 , G06F40/20 , G06K9/62
Abstract: A method and apparatus for correcting character errors, an electronic device and a storage medium are disclosed, which relates to the natural language processing field and the deep learning field. The method may include: for a character to be processed, acquiring the score of each character in a pre-constructed vocabulary, the score being a score of the reasonability of the character in the vocabulary at the position of the character to be processed; selecting top K characters as candidates of the character to be processed, K being a positive integer greater than one; selecting an optimal candidate from the K candidates; and replacing the character to be processed with the optimal candidate if the optimal candidate is different from the character to be processed. With the solution of the present application, the accuracy of an error correction result, or the like, may be improved.
-
7.
公开(公告)号:US10664659B2
公开(公告)日:2020-05-26
申请号:US15934496
申请日:2018-03-23
Inventor: Liqun Zheng , Jinbo Zhan , Qiugen Xiao , Zhihong Fu , Jingzhou He , Guyue Zhou
IPC: G06F40/284 , G06N3/08 , G06N3/04 , G06F40/242
Abstract: Embodiments of the present disclosure disclose a method for modifying a segmentation model based on artificial intelligence, a device and a storage medium. The method may include: acquiring a model parameter of the segmentation model, and performing a training on a feature vector corresponding to a preset generalized feature of a first training corpus via a neural network so as to acquire a model parameter of the preset generalized feature; performing a word segmentation on the first training corpus according to the model parameter of the segmentation model and the model parameter of the preset generalized feature, so as to acquire a segmentation result; and comparing the segmentation result with the first training corpus according to a preset rule, and modifying the model parameter of the segmentation model and a parameter of the neural network according to a comparison result.
-
公开(公告)号:US10650102B2
公开(公告)日:2020-05-12
申请号:US15900166
申请日:2018-02-20
Inventor: Pengkai Li , Jingzhou He , Zhihong Fu , Xianwei Xin
Abstract: The present disclosure discloses a method and apparatus for generating a parallel text in the same language. The method comprises: acquiring a source segmented word sequence and a pre-trained word vector table; determining a source word vector sequence corresponding to the source segmented word sequence, according to the word vector table; importing the source word vector sequence into a first pre-trained recurrent neural network model, to generate an intermediate vector of a preset dimension for characterizing semantics of the source segmented word sequence; importing the intermediate vector into a second pre-trained recurrent neural network model, to generate a target word vector sequence corresponding to the intermediate vector; and determining a target segmented word sequence corresponding to the target word vector sequence according to the word vector table, and determining the target segmented word sequence as a parallel text in the same language corresponding to the source segmented word sequence.
-
公开(公告)号:US20180365231A1
公开(公告)日:2018-12-20
申请号:US15900166
申请日:2018-02-20
Inventor: Pengkai Li , Jingzhou He , Zhihong Fu , Xianwei Xin
Abstract: The present disclosure discloses a method and apparatus for generating a parallel text in the same language. The method comprises: acquiring a source segmented word sequence and a pre-trained word vector table; determining a source word vector sequence corresponding to the source segmented word sequence, according to the word vector table; importing the source word vector sequence into a first pre-trained recurrent neural network model, to generate an intermediate vector of a preset dimension for characterizing semantics of the source segmented word sequence;importing the intermediate vector into a second pre-trained recurrent neural network model, to generate a target word vector sequence corresponding to the intermediate vector; and determining a target segmented word sequence corresponding to the target word vector sequence according to the word vector table, and determining the target segmented word sequence as a parallel text in the same language corresponding to the source segmented word sequence.
-
-
-
-
-
-
-
-