Abstract:
Disclosed herein is an intelligent agent to analyze a media object. The agent comprises a trained model comprising a number of state layers for storing a history of actions taken by the agent in each of a number of previous iterations performed by the agent in analyzing a media object. The stored state may be used by the agent in a current iteration to determine whether or not to make, or abstain from making, a prediction from output generated by the model, identify another portion of the media object to analyze, end analysis. Output from the agent's model may comprise a semantic vector that can be mapped to a semantic vector space to identify a number of labels for a media object.
Abstract:
The technologies described herein serve contextually relevant advertisements under a guaranteed advertisement campaign. A publisher retrieves a guaranteed advertisement campaign related to a webpage available for serving an advertisement, and identifies a set of advertisements relating to the guaranteed advertisement campaign. Advertisement selecting circuitry of the publisher determines whether an advertisement that is contextually relevant to content published at the webpage is present in the set of advertisements. If there is no contextually relevant advertisement in the set of advertisements, the advertisement selecting circuitry selects an alternative advertisement from the set of advertisements that minimizes an under-delivery risk related to the guaranteed advertisement campaign. If there is a contextually relevant advertisement in the set of advertisements, the advertisement selecting circuitry selects the contextually relevant advertisement. Then, the publisher provides the selected advertisement to a client device.
Abstract:
Embodiments of the present disclosure may be used to gather, rank, categorize, and perform other processing of various types of content. In some embodiments, content items such as text, images, video, and other content are received from a variety of different sources and are processed to generate an article containing selected content items. While there may be hundreds or thousands of separate articles and stories regarding a particular topic, embodiments of the present disclosure help provide users with a single concise article that contains high-quality content items selected from among a potentially vast number of disparate sources.
Abstract:
Embodiments of the present disclosure may be utilized to analyze a content item comprising text to identify: a quote, a named entity that is the source of the quote (e.g., a person or organization such as a company), and identification information for the named entity (such as a title of the person giving the quote). Quotes may also be ranked to determine, for example, the best quotes to include in an article.
Abstract:
One particular embodiment clusters a plurality of documents using one or more clustering algorithms to obtain one or more first sets of clusters, wherein: each first set of clusters results from clustering the documents using one of the clustering algorithms; and with respect to each first set of clusters, each of the documents belongs to one of the clusters from the first set of clusters; accesses a search query; identifies a search result in response to the search query, wherein the search result comprises two or more of the documents; and clusters the search result to obtain a second set of clusters, wherein each document of the search result belongs to one of the clusters from the second set of clusters.
Abstract:
Online experimentation has been widely used to evaluate an effect of a new feature of an online product on user engagement. One challenge is that an existence of outliers can often complicate the analysis of such experimental results. Thus, a procedure is provided herein to detect and remove outliers from experimental results. The procedure can use statistical tests based on parametric distributions of sample maximum or minimum. These tests can be performed using an inward testing procedure to identify multiple outliers. Finally, these filtered test results can be used to control delivery of a new feature of an online product.
Abstract:
A seasonality-aware search assist identifies seasonal search query suggestions using seasonal information associated with the seasonal search query suggestions. A user's search query input, temporal information associated with the user's search query input and a seasonal information associated with each seasonal-aware search query suggestion candidates may be used to identify a number of seasonal search query suggestions for presentation to the user. Seasonal search query suggestion candidates may be promoted in a list of search query suggestions presented to the user, and any non-seasonal search query suggestions included in the list may be positioned below the seasonal search query suggestions in the list. A popularity score associated with each of the listed seasonal search query suggestions may be used to order the seasonal search query candidates, and a popularity score associated with each of the listed non-seasonal search query suggestions may be used to order the non-seasonal search query suggestions.
Abstract:
Exemplary methods and apparatuses are provided which may be used for classifying and indexing segmented portions of web pages and providing related information for use in information extraction and/or information retrieval systems. In an embodiment, an index of segmented portions may be used by a search engine to respond to a search query. In an embodiment, one or more machine learned models may be used to identify one or more feature properties of a plurality of segmented portions within one or more files, or otherwise inferable from the one or more files. In an embodiment, one or more machine learned models may be used to classify one or more of a plurality of segmented portions as being at least one of a plurality of segment types.
Abstract:
The present teaching, which includes methods, systems and computer-readable media, relates to providing a representation of a relationship between entities related to online content interaction. The disclosed techniques may include receiving data related to online content interactions between a set of first entities and a set of second entities, and based on the received data, determining, for each one of the set of first entities, a set of first interaction frequency values each corresponding to one of the set of second entities, and determining, for each one of the set of second entities, a second interaction frequency value. Further, for each one of the set of first entities, a set of relation values may be determined based on the set of first interaction frequency values for that first entity and the second interaction frequency values, each relation value indicating an interaction relationship between that first entity and one second entity.