Abstract:
In one embodiment, a method includes accessing an input vector representing an input post, wherein the input post includes one or more n-grams and an image, the input vector corresponds to a point in a d-dimensional vector space, the input vector was generated by an artificial neural network (ANN) based on a text vector representing the one or more n-grams of the input post and an image vector representing the image of the input post; and the ANN was jointly trained to receive a text vector representing one or more n-grams of a post and an image vector representing an image of the post and then output a probability that the received post is related to the training posts of a training page; and determining a topic of the input post based on the input vector.
Abstract:
In one embodiment, a method includes accessing a user profile associated with a user of an online social network, wherein the user profile identifies one or more topics that the user is interested in; accessing post vectors, wherein each post vector represents one of a plurality of posts, indicates one or more topics, and for each of the topics, indicates a probability that the post is related to the corresponding topic; ranking the posts based on comparisons between the user profile and the post vectors; and providing for display to the user posts based on the ranking.
Abstract:
An online system simplifies modification of features used by machine learned models used by the online system, such as machined learned models with high dimensionality. The online system obtains a superset of features including features used by at least one machine learned model and may include additional features. From the superset of features, the online system generates various groups of features for a machine learned model. The groups of features may be a group including features currently used by the machine learned model, a group including all available features, and one or more intermediate groups. Intermediate groups include various numbers of features from the set selected based on measures of feature impact on the machine learned model associated with various features. A user may select a group of features, test the machine learning model using the selected group, and then launch the tested model based on the results.
Abstract:
An optimized computer architecture for training an neural network includes a system having multiple GPUs. The neural network may be divided into separate portions, and a different portion is assigned to each of the multiple GPUs. Within each GPU, its portion is further divided across multiple training worker threads in multiple processing cores, and each processing core has lock-free access to a local parameter memory. The local parameter memory of each GPU is separately, and individually, synchronized with a remote master parameter memory by lock memory access. Each GPU has a separate set of communication worker threads dedicated to data transfer between the GPU and the remote parameter memory so that the GPU's training worker threads are not involved with cross GPU communications.
Abstract:
An online system receives content items from a third party content provider. For each content item, the online system inputs an image into a neural network and extracts a feature vector from a hidden layer of the neural network. The online system compresses each feature vector by assigning a label to each feature value representing whether the feature value was above a threshold value. The online system identifies a set of content items that the user has interacted with and determines a user feature vector by aggregating feature vectors of the set of content items. For a new set of content items, the online system compares the compressed feature vectors of the content item with the user feature vector. The online system selects one or more of the new content items based on the comparison and sends the selected content items to the user.
Abstract:
In one embodiment, a method includes accessing an input vector representing an input post, wherein: the vector space comprises clusters each associated with a topic; each cluster was determined based on a clustering of training-page vectors corresponding to training pages that each comprise training posts, each training post submitted by a user to a training page and comprises content selected by the user; and each training-page vector was generated by an ANN that was trained, based on the training posts of training pages associated with the ANN, to receive a post and then output a probability that the received post is related to the training posts of the training pages; determining that the input vector is located within a particular cluster in the vector space; and determining a topic of the input post based on the topic associated with the particular cluster that the input vector is located within.
Abstract:
An advertising system receives from an advertiser at a social networking system an advertisement request, the advertisement request comprising advertisement content and a specification of a target audience for the advertisement content. The advertising system defines a plurality of bid values for the advertisement request. For each of the plurality of bid values, the advertisement system estimates a corresponding value of advertisement reach for the target audience, for example, by estimating a number of users of the target audience for each of whom the given bid value is expected to have resulted in at least one successful impression. The advertiser is provided a visual representation of a bid-reach landscape representing the estimated plurality of advertisement reach values as a function of the plurality of bid values. The advertising system provides, to the advertiser, one or more recommendations for bid values for which corresponding return-on-investment metrics exceed a specified threshold.
Abstract:
The present disclosure is directed to a high-capacity training and prediction machine learning platform that can support high-capacity parameter models (e.g., with 10 billion weights). The platform implements a generic feature transformation layer for joint updating and a distributed training framework utilizing shard servers to increase training speed for the high-capacity model size. The models generated by the platform can be utilized in conjunction with existing dense baseline models to predict compatibilities between different groupings of objects (e.g., a group of two objects, three objects, etc.).
Abstract:
The present disclosure is directed to a high-capacity training and prediction machine learning platform that can support high-capacity parameter models (e.g., with 10 billion weights). The platform implements a generic feature transformation layer for joint updating and a distributed training framework utilizing shard servers to increase training speed for the high-capacity model size. The models generated by the platform can be utilized in conjunction with existing dense baseline models to predict compatibilities between different groupings of objects (e.g., a group of two objects, three objects, etc.).
Abstract:
An advertising system predicts advertisement reach for a received advertisement request based on an advertiser-specified bid amount and a specification of a target audience. The system samples the target audience, and for each sampled user of the target audience, accesses a recent impression history to obtain costs or bids associated with recent advertisement impressions. The system compares the advertiser-specified bid amount in the received advertisement request to costs or bid values associated with successful advertisement impressions, for each sampled user, in order to determine whether the received advertisement request would have won a bid auction for each given sampled user to successfully reach each given sampled user. An estimated aggregate reach for the sampled users is computed and extrapolated to the targeted user population to estimate a total reach of the advertisement content for the target audience.