-
公开(公告)号:US11269966B2
公开(公告)日:2022-03-08
申请号:US16918869
申请日:2020-07-01
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Xinxing Yang , Jun Zhou , Xiaolong Li
IPC: G06F16/00 , G06F16/9535 , G06K9/62
Abstract: The present specification discloses a multi-classifier-based recommendation method and device, and an electronic device. In the method, a target sub-classifier is obtained from k sub-classifiers in a multi-classifier based on received data, where the target sub-classifier is a sub-classifier whose data distribution has a highest similarity with feature data in the multi-classifier; the first prediction data obtained after the primary classifier performs prediction on the feature data is obtained, the second prediction data obtained after the target sub-classifier performs prediction on the feature data is obtained, and then the third prediction data is obtained by combining the first prediction data and the second prediction data, so that more accurate prediction data is obtained, thereby resolving the technical problem of low accuracy of data classification prediction in the existing technology; and the feature data is recommended based on the third prediction data, so that the accuracy of data recommendation is improved.
-
公开(公告)号:US20210027177A1
公开(公告)日:2021-01-28
申请号:US16818927
申请日:2020-03-13
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Kaisheng Yao , Jiaxing Zhang , Jia Liu , Xiaolong Li
Abstract: Implementations of the present specification provide a method and an apparatus for extending question and answer samples. According to the method, a random number is generated for each existing sample, a question is blurred for a sample whose random number belongs to sample extension random numbers, to generate an extended sample, so that an overall sample blurring extension rate can be effectively controlled. In addition, for a sample needing blurring extension, a question is extended by deleting a word with a predetermined part of speech in the corresponding question, and then an extended sample is generated based on an extended question, so that more question expression ways are compatible. As such, a question and answer model is trained by using a sample set to which extended samples are added, so that an answer can be provided to a user more effectively.
-
公开(公告)号:US10891161B2
公开(公告)日:2021-01-12
申请号:US16907637
申请日:2020-06-22
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Jun Zhou , Xiaolong Li
Abstract: Evaluation results of a plurality of users are received from a plurality of data providers. The evaluation results are obtained by the plurality of data providers evaluating the plurality of users based on evaluation models of the plurality of data providers. A plurality of training samples is constructed by using the evaluation results. Each training sample includes a respective subset of the evaluation results corresponding to a same user of the plurality of users. A label for each training sample is generated based on an actual service execution status of the same user. A model is trained based on the plurality of training samples and the plurality of labels, including setting a plurality of variable coefficients, each variable coefficient specifying a contribution level of a corresponding data provider. Virtual resources to each data provider are allocated based on the plurality of variable coefficients.
-
公开(公告)号:US11226993B2
公开(公告)日:2022-01-18
申请号:US16684831
申请日:2019-11-15
Applicant: ADVANCED NEW TECHNOLOGIES CO., LTD.
Inventor: Jun Zhou , Xiaolong Li
IPC: G06F16/28 , G06F16/2455 , G06F16/2458 , G06F16/21 , G06F16/27 , G06N7/00
Abstract: Provided is a method for clustering a data stream. The method comprises: acquiring a plurality of resulting models of a plurality of preceding data partitions prior to a current data partition in a data stream, wherein data partitions in the data stream have a temporal relationship, and wherein each of the plurality of resulting models is generated according to a clustering result of a corresponding preceding data partition, and each of the plurality of resulting models comprises one or more representative parameters in different categories; determining a starting model of the current data partition according to the plurality of resulting models, wherein the starting model comprises one or more representative parameters in different categories determined based on representative parameters of the same category in the plurality of resulting models; and clustering data records in the current data partition by using the starting model.
-
公开(公告)号:US11106804B2
公开(公告)日:2021-08-31
申请号:US16720931
申请日:2019-12-19
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Peilin Zhao , Jun Zhou , Xiaolong Li , Longfei Li
Abstract: Techniques for data sharing between a data miner and a data provider are provided. A set of public parameters is downloaded from the data miner. The public parameters are data miner parameters associated with a feature set of training sample data. A set of private parameters in the data provider can be replaced with the set of public parameters. The private parameters are data provider parameters associated with the feature set of training sample data. The private parameters are updated to provide a set of update results. The private parameters are updated based on a model parameter update algorithm associated with the data provider. The update results is uploaded to the data miner.
-
公开(公告)号:US11354512B2
公开(公告)日:2022-06-07
申请号:US16704358
申请日:2019-12-05
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Xiaofu Chang , Linlin Chao , Peng Xu , Xiaolong Li
Abstract: A dialog generation method includes: training a sequence to sequence (seq2seq)-based dialog model using a loss function including topic range constraint information; and generating a dialog using the trained dialog model. With the dialog generation method, topic range constraint information is introduced in the process of dialog model training using a loss function including the topic range constraint information, thus helping to prevent the trained model from producing low-quality meaningless replies.
-
公开(公告)号:US10929558B2
公开(公告)日:2021-02-23
申请号:US16047399
申请日:2018-07-27
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Ling Xie , Xiaolong Li
Abstract: Encrypted user data are received at a service device from at least one user equipment, and the user data is encrypted in a trusted zone of the at least one user equipment. The encrypted user data then be decrypted in a trust zone of the service device by a first central processing unit (CPU) to obtain decrypted user data. A model is trained by using the decrypted user data to determine a training intermediate value and a training effective representative value, and a determination is made whether the training effective representative value satisfies a specified condition is determined. If so, the trained model is generated based on a model parameter. Otherwise, a model parameter is iterately adjusted and the model is iteratively trained based on an adjusted model parameter until the trained effective representative value satisfies the specified condition.
-
公开(公告)号:US11257007B2
公开(公告)日:2022-02-22
申请号:US16587977
申请日:2019-09-30
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Xinxing Yang , Shaosheng Cao , Jun Zhou , Xiaolong Li
Abstract: An N×M dimensional target matrix is generated based on N data samples and M dimensional data features respectively corresponding to the N data samples. Encryption calculation is performed on the N×M dimensional target matrix based on a Principal Component Analysis (PCA) algorithm to obtain an N×K dimensional encryption matrix K is less than M. The N×K dimensional encryption matrix is transmitted to a modeling server. The modeling server trains a machine learning model by using the N×K dimensional encryption matrix as a training sample.
-
公开(公告)号:US11106802B2
公开(公告)日:2021-08-31
申请号:US16053606
申请日:2018-08-02
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Peilin Zhao , Jun Zhou , Xiaolong Li , Longfei Li
Abstract: Techniques for data sharing between a data miner and a data provider are provided. A set of public parameters is downloaded from the data miner. The public parameters are data miner parameters associated with a feature set of training sample data. A set of private parameters in the data provider can be replaced with the set of public parameters. The private parameters are data provider parameters associated with the feature set of training sample data. The private parameters are updated to provide a set of update results. The private parameters are updated based on a model parameter update algorithm associated with the data provider. The update results is uploaded to the data miner.
-
公开(公告)号:US11100412B2
公开(公告)日:2021-08-24
申请号:US16818927
申请日:2020-03-13
Applicant: Advanced New Technologies Co., Ltd.
Inventor: Kaisheng Yao , Jiaxing Zhang , Jia Liu , Xiaolong Li
Abstract: Implementations of the present specification provide a method and an apparatus for extending question and answer samples. According to the method, a random number is generated for each existing sample, a question is blurred for a sample whose random number belongs to sample extension random numbers, to generate an extended sample, so that an overall sample blurring extension rate can be effectively controlled. In addition, for a sample needing blurring extension, a question is extended by deleting a word with a predetermined part of speech in the corresponding question, and then an extended sample is generated based on an extended question, so that more question expression ways are compatible. As such, a question and answer model is trained by using a sample set to which extended samples are added, so that an answer can be provided to a user more effectively.
-
-
-
-
-
-
-
-
-