Abstract:
Techniques are provided maintaining data that indicates for a plurality of query terms whether the plurality of query terms are date-qualified query terms. A query is received, and in response to receiving the query, the query is inspected to determine that the query contains a particular date-qualified query term. Then it is determined that the particular date-qualified query term has been associated with a plurality of dates, and it is determined which of the plurality of dates with which to associate the date-qualified query term for the query, based at least in part on the frequency with which each particular date of the plurality of dates has been associated with the particular date-qualified query term.
Abstract:
Techniques are provided maintaining data that indicates for a plurality of query terms whether the plurality of query terms are date-qualified query terms. A query is received, and in response to receiving the query, the query is inspected to determine that the query contains a particular date-qualified query term. Then it is determined that the particular date-qualified query term has been associated with a plurality of dates, and it is determined which of the plurality of dates with which to associate the date-qualified query term for the query, based at least in part on the frequency with which each particular date of the plurality of dates has been associated with the particular date-qualified query term.
Abstract:
Embodiments of methods, apparatuses, devices and systems associated with categorizing or selecting potential search engine results are disclosed.
Abstract:
A system and method is described herein that cache web documents in two or more caches in an optimized manner. The web documents result from a plurality of queries presented to a web search engine. Each web document has a tier score indicating a position of the web document in the caches. The tier scores are modified in a manner that rearranges the web documents in caches. This rearrangement of the web documents in the caches can advantageously improve the retrieval time for the web documents.
Abstract:
A system for detecting artificial promotion of a resource, including a search engine operative to index a set incoming links (“inlinks”) which reference the resource, a log module coupled with the search engine and configured to store log data associated with the set of inlinks, a partitioning module coupled with log module and operative to partition the set of inlinks into a plurality of groups of inlinks based on at least one partitioning scheme, a statistics module coupled with the partitioning module and operative to compute a statistic associated with the inlinks within each of the plurality of groups of inlinks, and a computation module coupled with the statistics module and operative to process the computed statistic associated with the inlinks of each of the plurality of groups of inlinks and compute a metric associated with set of inlinks where the metric indicates a level of uniformity of a distribution of values of the respective computed statistics among the plurality of groups of inlinks, and where the search engine places a list of search results, generated in response to a search query, in a pattern based on the metric.
Abstract:
Techniques are provided for generating quality measures for items, including web pages, based on a “random searcher” behavior model. The random searcher behavior model takes into account “implicit” links between items, instead of or in addition to the explicit links. After identifying the implicit links between items, the implicit links may be used as the basis for generating quality measures for the items to which the implicit links point. A variety of types of implicit links are described. To facilitate the generation of quality measures for items based on implicit links, a graph of the implicit links may be constructed in a manner similar to a webgraph.
Abstract:
Techniques are provided for generating quality measures for items, including web pages, based on a “random searcher” behavior model. The random searcher behavior model takes into account “implicit” links between items, instead of or in addition to the explicit links. After identifying the implicit links between items, the implicit links may be used as the basis for generating quality measures for the items to which the implicit links point. A variety of types of implicit links are described. To facilitate the generation of quality measures for items based on implicit links, a graph of the implicit links may be constructed in a manner similar to a webgraph.
Abstract:
A system and method is described herein that cache web documents in two or more caches in an optimized manner. The web documents result from a plurality of queries presented to a web search engine. Each web document has a tier score indicating a position of the web document in the caches. The tier scores are modified in a manner that rearranges the web documents in caches. This rearrangement of the web documents in the caches can advantageously improve the retrieval time for the web documents.
Abstract:
A system for detecting artificial promotion of a resource, including a search engine operative to index a set incoming links (“inlinks”) which reference the resource, a log module coupled with the search engine and configured to store log data associated with the set of inlinks, a partitioning module coupled with log module and operative to partition the set of inlinks into a plurality of groups of inlinks based on at least one partitioning scheme, a statistics module coupled with the partitioning module and operative to compute a statistic associated with the inlinks within each of the plurality of groups of inlinks, and a computation module coupled with the statistics module and operative to process the computed statistic associated with the inlinks of each of the plurality of groups of inlinks and compute a metric associated with set of inlinks where the metric indicates a level of uniformity of a distribution of values of the respective computed statistics among the plurality of groups of inlinks, and where the search engine places a list of search results, generated in response to a search query, in a pattern based on the metric.
Abstract:
Techniques are provided for storing queries received by a search engine are in a query log. For a particular query term in the query, it is determined how many queries in the query log contain that particular query term and an intent-indicating term, and determined how many queries in the query log contain that particular query term without an intent-indicating term. Based on the ratio between the number of queries in the query log that contain the particular query term and the intent-indicating term and the number of queries in the query log that contain the particular query term without the intent-indicating term, it is determined whether the particular query term is an intent-qualified query term. In response to determining that the particular query term is an intent-qualified query term, data is stored in a computer-readable medium that identifies the query term as an intent-qualified query term. Implicit-intent queries that contain the intent-qualified query term are processed based, at least in part, on the intent associated with the intent-qualified query term.