Abstract:
A method and apparatus for receiving training data that comprise a plurality of event-and-time-specific texts that are contextually related to a plurality of events; iteratively processing the training data to generate a modified network model that defines a plurality of states; receiving additional data that comprise a plurality of additional event-and-time-specific texts that are contextually related to a particular event; processing the additional data by applying the modified network model to the additional data to identify, within the plurality of additional event-and-time specific texts, a particular set of texts that belong to a particular state of the plurality of states; identifying, within the particular set of texts, one or more texts that are most representative of all texts in the particular set of texts that belong to the particular state; wherein the method is performed by one or more special-purpose computing devices.
Abstract:
Descriptive data relating to at least a subset of a plurality of entities on a website is retrieved over a network. Endorsement data relating to the plurality of entities is retrieved from the website. A first set of probabilities is determined reflecting a probability that endorsements can be attributed to specific aspects. A second set of probabilities is determined reflecting a probability that terms can be attributed to aspects. Using the first set of probabilities and the second set of probabilities, a subset of the terms that are most probably associated with each entity are selected. Tags are then generated for each entity using the selected terms.
Abstract:
An optimization-based framework is utilized to extract broad query aspects from query reformulations performed by users in historical user session logs. Objective functions are optimized to yield query aspects. At run-time, the best broad but unspecified query aspects relevant to any user query are presented along with the results of the run time query.