Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, re-ranking resources for categorical queries. In one aspect, a method includes receiving queries, and for each received query: receiving data indicating resources identified by a search operation as being responsive to the query and ranked according to a first order, each resource having corresponding search score by which the resources are ranked in responsiveness to the query and determining whether a proper subset meets a quality condition based on a quality measure that is indicative of the quality of the resources in the proper subset and independent of search scores of the resources for received query. For each query for which the proper subset meets the quality condition, determining a quality score for each resource in the proper subset and re-ranking the resources in the proper subset according to their respective quality scores.
Abstract:
Methods, systems, and apparatus for selecting geographic areas of interest for a query. A method includes for each query stored in search log data, determining an origin location for each occurrence of the query, determining a content location for each occurrence of the query, for each of a plurality of catchment areas, determining catchment area matches for the query based on the origin location of each occurrence of the query and the content location of the occurrence of the query, determining, for each catchment area, a catchment area score that is indicative of an interest level for the catchment area for a query, the determination based on a number of catchment area matches for the catchment area for the query and selecting, for the query and based on the catchment area scores for the catchment areas, one of the catchment areas as a selected catchment area for the query.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for ranking search results. One method includes ranking web objects in response to a search query to generate a global ranking based on a relevance of each web object to the search query and a relative authority of each web object compared to other web objects in the plurality of web objects, each web object including a web page in a corresponding website that includes a plurality of web pages; ranking the plurality of web pages corresponding to each website based on onsite ranking criteria to generate an onsite ranking; generating a combined ranking for each web page based on a combination of the global ranking of the web object that includes the web page and the onsite ranking of the web page; and presenting web pages responsive to the search query based on the combined rankings.
Abstract:
Methods, systems, and apparatus for selecting geographic areas of interest for a query. A method includes for each query stored in search log data, determining an origin location for each occurrence of the query, determining a content location for each occurrence of the query, for each of a plurality of catchment areas, determining catchment area matches for the query based on the origin location of each occurrence of the query and the content location of the occurrence of the query, determining, for each catchment area, a catchment area score that is indicative of an interest level for the catchment area for a query, the determination based on a number of catchment area matches for the catchment area for the query and selecting, for the query and based on the catchment area scores for the catchment areas, one of the catchment areas as a selected catchment area for the query.
Abstract:
One embodiment of the present invention provides a system for detecting correlations between terms. During operation, the system identifies one or more lists contained in one or more documents and identifies two terms co-occurring in the lists. The system further determines a correlation between the co-occurring terms, and places the co-occurring terms in a correlated-pair list based on the correlation.
Abstract:
Methods, systems, and apparatus for selecting geographic areas of interest for a query. A method includes for each query stored in search log data, determining an origin location for each occurrence of the query, determining a content location for each occurrence of the query, for each of a plurality of catchment areas, determining catchment area matches for the query based on the origin location of each occurrence of the query and the content location of the occurrence of the query, determining, for each catchment area, a catchment area score that is indicative of an interest level for the catchment area for a query, the determination based on a number of catchment area matches for the catchment area for the query and selecting, for the query and based on the catchment area scores for the catchment areas, one of the catchment areas as a selected catchment area for the query.
Abstract:
Methods, systems, and apparatus, including computer program products for structured content ranking. In an aspect, a method determines a service requirement from terms of a query, the service requirement being one of a plurality of service requirements fulfilled by databases; determines, for each of the databases, a service requirement score for the database, the service requirement score being a measure of an ability of the database to fulfill the service requirement; selects databases based on the service requirement scores; generates data responsive to the service requirement based on the terms of the query and one or more of the selected databases; and generates, from the data identifying resources that are determined to be responsive to the query and from the data responsive to the service requirement, search results that include first search results that each identify a corresponding resource that was determined to be responsive to the query.
Abstract:
Methods, systems, and apparatus for selecting geographic areas of interest for a query. A method includes for each query stored in search log data, determining an origin location for each occurrence of the query, determining a content location for each occurrence of the query, for each of a plurality of catchment areas, determining catchment area matches for the query based on the origin location of each occurrence of the query and the content location of the occurrence of the query, determining, for each catchment area, a catchment area score that is indicative of an interest level for the catchment area for a query, the determination based on a number of catchment area matches for the catchment area for the query and selecting, for the query and based on the catchment area scores for the catchment areas, one of the catchment areas as a selected catchment area for the query.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for correcting potentially inaccurate terms in queries. In an aspect, a search system determines that an original query includes a potentially inaccurate term. The determination is made on the basis that the term of the original query meets an inaccuracy criterion. Derivative queries are generated from the original query. Each derivative query does not include the potentially inaccurate term. For each derivative query resources that are determined to be responsive to the derivative query are determined, and from the resources, a corrected term is determined. The corrected term is substituted for the potentially inaccurate term, and a search operation that uses the corrected query as input is performed.
Abstract:
Methods, systems, and apparatus, including computer program products for structured content ranking. In an aspect, a method determines a service requirement from terms of a query, the service requirement being one of a plurality of service requirements fulfilled by databases; determines, for each of the databases, a service requirement score for the database, the service requirement score being a measure of an ability of the database to fulfill the service requirement; selects databases based on the service requirement scores; generates data responsive to the service requirement based on the terms of the query and one or more of the selected databases; and generates, from the data identifying resources that are determined to be responsive to the query and from the data responsive to the service requirement, search results that include first search results that each identify a corresponding resource that was determined to be responsive to the query.