Abstract:
Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
Abstract:
When a website has a number of equivalent domain names including a preferred domain name, the locator for a document in the website can be rewritten using the preferred domain name before indexing the document, according to certain embodiments. According to certain embodiments, a user interface is provided to allow a user to specify the preferred domain name for a website for which the user is a verified owner.
Abstract:
Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
Abstract:
Methods, systems, and apparatus, including computer program products, for testing web search scoring functions. A query is received. A first and a second scoring function are selected by receiving search results responsive to the query; applying candidate scoring functions to the search results to determine scores for the search results for each candidate scoring function; identifying pairs of the candidate scoring functions, and calculating a diversity score for each of the pairs. A pair of candidate scoring functions is chosen from the one or more pairs of candidate scoring functions based on the diversity scores, and the alpha function is selected as the first scoring function and the beta function is selected as the second scoring function. The plurality of search results are presented in an order according to scores from the first scoring function and are presented in an order according to scores from the second scoring function.
Abstract:
Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
Abstract:
Methods, systems, and apparatus, including computer program products, for testing web search scoring functions. A query is received. A first and a second scoring function are selected by receiving search results responsive to the query; applying candidate scoring functions to the search results to determine scores for the search results for each candidate scoring function; identifying pairs of the candidate scoring functions, and calculating a diversity score for each of the pairs. A pair of candidate scoring functions is chosen from the one or more pairs of candidate scoring functions based on the diversity scores, and the alpha function is selected as the first scoring function and the beta function is selected as the second scoring function. The plurality of search results are presented in an order according to scores from the first scoring function and are presented in an order according to scores from the second scoring function.
Abstract:
Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
Abstract:
When a website has a number of equivalent domain names including a preferred domain name, the locator for a document in the website can be rewritten using the preferred domain name before indexing the document, according to certain embodiments. According to certain embodiments, a user interface is provided to allow a user to specify the preferred domain name for a website for which the user is a verified owner.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy protection. In one aspect, a method includes accessing personally identifiable information (PII) type definitions that characterize PII types; identifying PII type information included in content of a web page, the PII type information being information matching at least one PII type definition; identifying secondary information included in the content of the web page, the secondary information being information that is predefined as being associated with PII type information; determining a risk score from the PII type information and the secondary information; and classifying the web page as a personal information exposure risk if the risk score meets a confidentiality threshold, wherein the personal information exposure risk is indicative of the web page including personally identifiable information.
Abstract:
When a website has a number of equivalent domain names including a preferred domain name, the URL for a document in the website can be rewritten using the preferred domain name before indexing the document, according to certain embodiments. According to certain embodiments, a user interface is provided to allow a user to specify the preferred domain name for a website for which the user is a verified owner.