Evaluation of content items against policies regulating content presentation by an online system using machine learning

    公开(公告)号:US10853696B1

    公开(公告)日:2020-12-01

    申请号:US16382162

    申请日:2019-04-11

    Applicant: Facebook, Inc.

    Abstract: An online system uses a model to detect violations of policies enforced by the online system for content uploaded to the online system by users for viewing by other users. The online system trains the model in multiple stages. To train the model, the online system obtains a set of training content items, with each content item of the set labeled with both a policy violated by the content item and a source of the content item, which acts as a proxy for a sub-category identifying a way in which the content item violated the policy. In the first stage, the online system trains the model using the set of training content items. In a second stage, the model of trained to predict policy violations from content items that are not labeled with a source. For example, the second stage is performed by freezing earlier layers in the model.

    MEMORIZATION MODEL FOR CONTEXT VIOLATIONS
    12.
    发明申请

    公开(公告)号:US20180349942A1

    公开(公告)日:2018-12-06

    申请号:US15608803

    申请日:2017-05-30

    Applicant: Facebook, Inc.

    Abstract: For various content campaigns (or content), an online system predicts a likelihood score of context violations (e.g., account term violations) of a content campaign. The online system derives a plurality of feature vectors of the content campaign. The online system predicts a likelihood score of context violation of the content campaign using a memorization model based on the plurality of feature vectors. The memorization model comprises a plurality of categories and a plurality of items of each category. Each of the plurality of categories has a category weight, and each of the plurality of items of each category has an item weight. The predicted likelihood score is based on a combination of a plurality of category weights and a plurality of item weights associated with the plurality of feature vectors. The online system performs an action affecting the content campaign based in part on the predicted likelihood score.

    EVALUATING CONTENT FOR COMPLIANCE WITH A CONTENT POLICY ENFORCED BY AN ONLINE SYSTEM USING A MACHINE LEARNING MODEL DETERMINING COMPLIANCE WITH ANOTHER CONTENT POLICY

    公开(公告)号:US20180253661A1

    公开(公告)日:2018-09-06

    申请号:US15449448

    申请日:2017-03-03

    Applicant: Facebook, Inc.

    CPC classification number: G06N20/00 G06N7/005 G06Q30/0269 G06Q50/01

    Abstract: An online system maintains machine learning models that determine risk scores for content items indicating likelihoods of content items violating content policies associated with the machine learning models. When the online system obtains an additional content policy, the online system applies a maintained machine learning model to a set including content items previously identified as violating or not violating the additional content policy. The online system maps the risk scores determined for content items of the set to likelihoods of violating the additional content policy based on the identifications of content times in the set violating or not violating the additional content policy. Subsequently, the online system applies the maintained machine learning model to content items and determines likelihoods of the content items violating the additional content policy based on the mapping of risk scores to likelihood of violating the additional content policy.

    Detecting content items in violation of an online system policy using semantic vectors

    公开(公告)号:US11195099B2

    公开(公告)日:2021-12-07

    申请号:US15694321

    申请日:2017-09-01

    Applicant: Facebook, Inc.

    Abstract: A content review system for an online system automatically determines if received content items to be displayed to users violate any policies of the online system. The content review system generates a semantic vector representing the semantic features of a content item, for example, using a neural network. By comparing the semantic vector for the content item with semantic vectors of content items previously determined to violate one or more policies, the content review system determines whether the content item also violates one or more policies. The content review system may also maintain templates corresponding to portions of semantic vectors shared by multiple content items. An analysis of historical content items that conform to the template is performed to determine a probability that received content items that conform to the template violate a policy.

    Regular expression generation and screening of textual items

    公开(公告)号:US10956522B1

    公开(公告)日:2021-03-23

    申请号:US16003770

    申请日:2018-06-08

    Applicant: Facebook, Inc.

    Abstract: An online system enforces policies to content items that are distributed on its platform and blocks content items that violate one or more of those policies. To identify content items that are slightly varied from each other, the online system generates an embedding for each of the known content items that have already been determined to be noncompliant with one or more policies. The online system then groups the known noncompliant content items that are clustered together in the embedding space. The texts of the group of known noncompliant content items are converted to finite state automata and are merged to generate a common automaton. The common automaton is used to generate a common regular expression that is used to screen new content items. When a new content item matches the textual pattern defined by the common regular expression, the system may block the new content item.

    Evaluating content items based upon semantic similarity of text

    公开(公告)号:US10599774B1

    公开(公告)日:2020-03-24

    申请号:US15905709

    申请日:2018-02-26

    Applicant: Facebook, Inc.

    Abstract: A content review system for an online system automatically determines if received content items to be displayed to users contain text that violates a policy of the online system. The content review system generates a semantic vector representing semantic features of text extracted from the content item, for example, using a neural network. By comparing the semantic vector for the extracted text with stored semantic vectors of extracted text previously determined to violate one or more policies, the content review system determines whether the content item contains text that also violates one or more policies. The content review system also reviews stored semantic vectors previously determined to be unsuitable, in order to remove false positives, as well as unsuitable semantic vectors that are sufficiently similar to known suitable semantic vectors and as such may cause content items having suitable text to be erroneously rejected.

Patent Agency Ranking