Multi-classifier-based recommendation method and device, and electronic device

    公开(公告)号:US11269966B2

    公开(公告)日:2022-03-08

    申请号:US16918869

    申请日:2020-07-01

    Abstract: The present specification discloses a multi-classifier-based recommendation method and device, and an electronic device. In the method, a target sub-classifier is obtained from k sub-classifiers in a multi-classifier based on received data, where the target sub-classifier is a sub-classifier whose data distribution has a highest similarity with feature data in the multi-classifier; the first prediction data obtained after the primary classifier performs prediction on the feature data is obtained, the second prediction data obtained after the target sub-classifier performs prediction on the feature data is obtained, and then the third prediction data is obtained by combining the first prediction data and the second prediction data, so that more accurate prediction data is obtained, thereby resolving the technical problem of low accuracy of data classification prediction in the existing technology; and the feature data is recommended based on the third prediction data, so that the accuracy of data recommendation is improved.

    EXTENDING QUESTION AND ANSWER SAMPLES

    公开(公告)号:US20210027177A1

    公开(公告)日:2021-01-28

    申请号:US16818927

    申请日:2020-03-13

    Abstract: Implementations of the present specification provide a method and an apparatus for extending question and answer samples. According to the method, a random number is generated for each existing sample, a question is blurred for a sample whose random number belongs to sample extension random numbers, to generate an extended sample, so that an overall sample blurring extension rate can be effectively controlled. In addition, for a sample needing blurring extension, a question is extended by deleting a word with a predetermined part of speech in the corresponding question, and then an extended sample is generated based on an extended question, so that more question expression ways are compatible. As such, a question and answer model is trained by using a sample set to which extended samples are added, so that an answer can be provided to a user more effectively.

    Method and device for virtual resource allocation, modeling, and data prediction

    公开(公告)号:US10891161B2

    公开(公告)日:2021-01-12

    申请号:US16907637

    申请日:2020-06-22

    Abstract: Evaluation results of a plurality of users are received from a plurality of data providers. The evaluation results are obtained by the plurality of data providers evaluating the plurality of users based on evaluation models of the plurality of data providers. A plurality of training samples is constructed by using the evaluation results. Each training sample includes a respective subset of the evaluation results corresponding to a same user of the plurality of users. A label for each training sample is generated based on an actual service execution status of the same user. A model is trained based on the plurality of training samples and the plurality of labels, including setting a plurality of variable coefficients, each variable coefficient specifying a contribution level of a corresponding data provider. Virtual resources to each data provider are allocated based on the plurality of variable coefficients.

    Method and apparatus for clustering data stream

    公开(公告)号:US11226993B2

    公开(公告)日:2022-01-18

    申请号:US16684831

    申请日:2019-11-15

    Abstract: Provided is a method for clustering a data stream. The method comprises: acquiring a plurality of resulting models of a plurality of preceding data partitions prior to a current data partition in a data stream, wherein data partitions in the data stream have a temporal relationship, and wherein each of the plurality of resulting models is generated according to a clustering result of a corresponding preceding data partition, and each of the plurality of resulting models comprises one or more representative parameters in different categories; determining a starting model of the current data partition according to the plurality of resulting models, wherein the starting model comprises one or more representative parameters in different categories determined based on representative parameters of the same category in the plurality of resulting models; and clustering data records in the current data partition by using the starting model.

    Model training method and apparatus based on data sharing

    公开(公告)号:US11106804B2

    公开(公告)日:2021-08-31

    申请号:US16720931

    申请日:2019-12-19

    Abstract: Techniques for data sharing between a data miner and a data provider are provided. A set of public parameters is downloaded from the data miner. The public parameters are data miner parameters associated with a feature set of training sample data. A set of private parameters in the data provider can be replaced with the set of public parameters. The private parameters are data provider parameters associated with the feature set of training sample data. The private parameters are updated to provide a set of update results. The private parameters are updated based on a model parameter update algorithm associated with the data provider. The update results is uploaded to the data miner.

    Data secruity enhancement by model training

    公开(公告)号:US10929558B2

    公开(公告)日:2021-02-23

    申请号:US16047399

    申请日:2018-07-27

    Abstract: Encrypted user data are received at a service device from at least one user equipment, and the user data is encrypted in a trusted zone of the at least one user equipment. The encrypted user data then be decrypted in a trust zone of the service device by a first central processing unit (CPU) to obtain decrypted user data. A model is trained by using the decrypted user data to determine a training intermediate value and a training effective representative value, and a determination is made whether the training effective representative value satisfies a specified condition is determined. If so, the trained model is generated based on a model parameter. Otherwise, a model parameter is iterately adjusted and the model is iteratively trained based on an adjusted model parameter until the trained effective representative value satisfies the specified condition.

    Model training method and apparatus based on data sharing

    公开(公告)号:US11106802B2

    公开(公告)日:2021-08-31

    申请号:US16053606

    申请日:2018-08-02

    Abstract: Techniques for data sharing between a data miner and a data provider are provided. A set of public parameters is downloaded from the data miner. The public parameters are data miner parameters associated with a feature set of training sample data. A set of private parameters in the data provider can be replaced with the set of public parameters. The private parameters are data provider parameters associated with the feature set of training sample data. The private parameters are updated to provide a set of update results. The private parameters are updated based on a model parameter update algorithm associated with the data provider. The update results is uploaded to the data miner.

    Extending question and answer samples

    公开(公告)号:US11100412B2

    公开(公告)日:2021-08-24

    申请号:US16818927

    申请日:2020-03-13

    Abstract: Implementations of the present specification provide a method and an apparatus for extending question and answer samples. According to the method, a random number is generated for each existing sample, a question is blurred for a sample whose random number belongs to sample extension random numbers, to generate an extended sample, so that an overall sample blurring extension rate can be effectively controlled. In addition, for a sample needing blurring extension, a question is extended by deleting a word with a predetermined part of speech in the corresponding question, and then an extended sample is generated based on an extended question, so that more question expression ways are compatible. As such, a question and answer model is trained by using a sample set to which extended samples are added, so that an answer can be provided to a user more effectively.

Patent Agency Ranking