Abstract:
A method and system for extracting opinions about a subject of interest from a text document in which each sentence is analyzed individually to identify the opinions. The most relevant feature terms related to the subject are extracted from the document based on their relevancy scores. Candidate feature terms are definite noun phrases at the beginning of the sentences. For each sentence that refers to the subject or a feature term, the invention determines whether the sentence includes an opinion polarity about the subject or the feature term. The opinion polarity is detected by identifying opinion terms in the sentence using an opinion dictionary or an opinion rule base, parsing the sentence with an English parser to identify grammatical components in the sentence and their relationships, and finding a matching entry in the dictionary or the rule base.
Abstract:
A method and system for extracting opinions about a subject of interest from a text document in which each sentence is analyzed individually to identify the opinions. The most relevant feature terms related to the subject are extracted from the document based on their relevancy scores. Candidate feature terms are definite noun phrases at the beginning of the sentences. For each sentence that refers to the subject or a feature term, the invention determines whether the sentence includes an opinion polarity about the subject or the feature term. The opinion polarity is detected by identifying opinion terms in the sentence using an opinion dictionary or an opinion rule base, parsing the sentence with an English parser to identify grammatical components in the sentence and their relationships, and finding a matching entry in the dictionary or the rule base.
Abstract:
A system is described for providing feeds for entities not associated with feed services. The system may include a processor, a memory and an interface. The memory may store an identifier of an entity, an update condition and a feed. The entity may include content, and the update condition may describe an update to the content. The interface may communicate with a device of the user. The processor may receive the identifier of the entity and the update condition of the entity via the interface. The processor may generate a feed for the entity and the processor may add the content to the feed when the content is updated in accordance with the update condition. The processor may then provide the feed to the device of the user via the interface.
Abstract:
A system is described for associating data items with context. The system may include a processor, a memory and an interface. The processor may identify an action performed by a user and may determine the spatial, temporal, social and topical attributes of the action. The spatial attribute of the action may relate to the user's location, the temporal attribute may relate to the time the action was performed, the social attribute may relate to a social relation of the user, and the topical attribute may relate to a topic of interest to the user. The processor may store an association between a descriptor of the action, the spatial attribute of the action, the temporal attribute of the action, the social attribute of the action and the topical attribute of the action in the memory. The processor may use the stored association to provide a contextually relevant data item via the interface.
Abstract:
A computer program product is provided as an automatic mining system to discover terms that are relevant to a given target topic from a large databases of unstructured information such as the World Wide Web. The operation of the automatic mining system is performed in three stages: The first stage is carried out by a new terms discoverer for discovering the terms in a document, the second stage is carried out by a candidate terms discoverer for discovering potentially relevant terms, and the third stage is carried out by a relevant terms discoverer for refining or testing the discovered relevance to filter false relevance. The new terms discoverer includes a system for the automatic mining of patterns and relations, a system for the automatic mining of new relationships, and a system for selecting new terms from relations. In one embodiment, the system for the automatic mining of patterns and relations identifies a set of related terms on the WWW with a high degree of confidence, using a duality concept, and includes a terms database and two identifiers: a relation identifier and a pattern identifier. The system for the automatic mining of new relationships includes a database a knowledge module and a statistics module. The knowledge module includes a stemming unit, a synonym check unit, and a domain knowledge check unit. The candidate terms discoverer includes a metadata extractor, a document vector module, an association module, a filtering module, and a database. The relevant terms discoverer includes a stop word filter and a system for the automatic construction of generalization—specialization hierarchy of terms comprised of a terms database, an augmentation module, a generalization detection module, and a hierarchy database.
Abstract:
Systems and methods for optimizing webpage content based on a screen orientation of a device are disclosed. Generally, a plurality of content chunks comprising a webpage to be displayed on a device is identified. An indication of a screen orientation of the device is received and webpage content to be displayed on the device is modified based at least in part on the screen orientation of the device, the identified plurality of content chunks, and a focus priority associated with each content chunk of the plurality of content chunks.
Abstract:
A system for sharing data within a network, the system including a first peer device coupled with the network that comprises local cache storage configured to store data comprising at least one entry designated as network accessible cache data and a cache control module operative to control access to the data stored in the local cache storage. The system further includes a second peer device coupled with the first peer device via the network where the second peer device is configured to request network accessible cache data stored in the local cache storage of the first peer device. Furthermore, the cache control module of the first peer device is configured to transmit at least a portion of the requested network accessible cache data to the second peer device in response to the request for network accessible data stored from the second peer device.
Abstract:
A method, apparatus, and article of manufacture for a computer-implemented random reliability engine for computer-implemented dimension reduction using association rules for data mining application. The data mining is performed by the computer to retrieve data from a data store stored on a data storage device coupled to the computer. The data store has records that have multiple attributes. The multiple attributes of a table are clustered to produce a plurality of sets of attributes. Each set of attributes is clustered to obtain data mining attributes.
Abstract:
A system is described for providing contextually relevant data. The system may include a processor, a memory and an interface. The processor may receive a query associated with a user. The processor may determine a spatial context, a temporal context, a social context and a topical context of the query. The spatial context may represent the location of the user, and the temporal context may represent a time of the query. The topical context may represent an item of interest to the user, and the social context may represent an item of interest to other users associated with the user. The processor may identify a plurality of search results for the query searched for. Each search result may be associated with one or more of the spatial context, the temporal context, the social context and the topical context. The processor may provide the identified search results to the user.
Abstract:
An automatic mining system that identifies a set of relevant terms from a large text database of unstructured information, such as the World Wide Web with a high degree of confidence. The automatic mining system includes a software program that enables the discovery of new relationships by association mining and refinement of co-occurrences, using automatic and iterative recognition of new binary relations through phrases that embody related pairs, by applying lexicographic and statistical techniques to classify the relations, and further by applying a minimal amount of domain knowledge of the relevance of the terms and relations. The automatic mining system includes a knowledge module and a statistics module. The knowledge module is comprised of a stemming unit, a synonym check unit, and a domain knowledge check unit. The stemming unit determines if the relation being analyzed shares a common root with a previously mined relation. The synonym check unit identifies the synonyms of the relation, and the domain knowledge check unit considers extrinsic factors for indications that would further clarify the relationship being mined. The statistics module optimizes the confidence level in the relationship.