Abstract:
Systems and methods for trend detection using frequency analysis in accordance with embodiments of the invention are disclosed. In one embodiment of the invention, trend detection includes generating a discrete time sequence of word counts for a target word using a trend detection device, performing frequency analysis of the discrete time sequence of word counts to determine contributions of frequency components within different frequency ranges to the discrete time sequence of word counts using the trend detection device, and detecting that the target word is a trending keyword based upon at least the frequency analysis of the discrete time sequence of word counts for the target word using the trend detection device.
Abstract:
An information processing apparatus that obtains intimacy degree information corresponding to identification information of a first person, specifies an extraction period based on the intimacy degree information, and extracts content in the extraction period.
Abstract:
A conversation server system having one or more processors and memory stores a plurality of index components in an index. The server associates a first message having a first term with a conversation that includes at least a second message. The first term is not included in the second message and the second message includes a second term that is not included in the first message. The server stores, in the index, a plurality of index components for a same referenced object, including an index component indicative of the first term and an index component indicative of the second term. In some embodiments the same referenced object is associated with index components for a first sender of the first message and a second sender of the second message, so that a search for a conversation with messages from the first sender and the second sender retrieves the referenced object.
Abstract:
Systems and methods are provided for displaying relationships between stories. In some embodiments, a plurality of referents that are each related to a first story may be caused to be displayed, where a referent is at least one of an event, a character, an object, a subject, a time, a place and a person. A user may be enabled to select one of the plurality of referents in order to view identification information identifying at least one other story, where the other story is also associated with the selected referent. In response to user selection of one of the plurality of referents, identification information identifying the at least one other story that is associated with the selected referent may be caused to be displayed.
Abstract:
An index generating unit divides each name data of search target data both into words and into characters, calculates start and end scores showing a start and an end of each of the words and start and end scores showing a start and an end of each of the characters, links them to each entry word which constructs the name data as a list (a name ID, a position, and start and end scores), and stores this list in an index storage unit. A searching unit decomposes an input character string into partial character strings, acquires corresponding candidate entries from the index storage unit, and judges the continuity between candidate entries on the basis of lists to add a comparison score according to the continuity to a candidate entry.
Abstract:
One or more techniques and/or systems are provided for tagging search results, organizing tagged search results for later access from various devices, public sharing of tagged search results, and/or providing targeted content based upon search results tagged by a user. That is, a user may tag a search result (e.g., a website, an image, a social network profile, etc.), such as through a one-click user input, with a tag to create a tagged search result. The tagged search result may be organized into a public tag collection for sharing and/or exploration of tagged search results by other users. The tagged search result may be organized into a personal tag collection for later access by the user from any device. Because the tagged search result may be indicative of an interest of the user, targeted content associated with the tagged search result may be provided to the user.
Abstract:
A system and method for cataloging and indexing messages that utilizes a message reference number that may be translated among different formats for propagating through a standard network and for displaying at a terminal. The reference number may be permanently assigned for the life of the archive. In one embodiment, the reference number may be generated using system number, temporal and sequence fields. The reference number may be mapped using a reversible mapping algorithm to a standard control field format for propagation through the existing database infrastructure systems. The reference number enables a database of search results to be stored permanently indexed by the reference number. Searches may reference other search results by reference number, and queries may be related to search results by the reference number.
Abstract:
Embodiments of the invention provide a system, method and computer program products for information retrieval from multiple documents by proximity searching for search queries. A method includes generating an index for the multiple documents, wherein the index includes words in snippets in the documents. An input search query is processed against the index by searching query terms over the snippets to introduce term proximity information implicitly in the information retrieval. Results of multiple sentence level search operations are combined as output.
Abstract:
The present invention pertains to extraction of text from an index of a search engine starting at an arbitrary position in the text, and to analysis of texts for co-occurrence of words, and to the use of said extraction and analysis for inferring implicit (causal, associative, etc) relationships among objects in sequences thereof.The present invention is particularly useful in extending the utility of search engines to index and retrieve information represented by a sequence of objects different from text information objects.The present invention extends the basic hit with at least two wordID's, one for the “previous word”, whose position in the document is position−1, and one for the “next word”, whose position in the document is position+1.
Abstract:
A method and search engine for classifying a source publishing a document on a portion of a network, includes steps of electronically receiving a document, based on the document, determining a source which published the document, and assigning a code to the document based on whether data associated with the document published by the source matches with data contained in a database. An intelligent geographic- and business topic-specific resource discovery system facilitates local commerce on the World-Wide Web and also reduces search time by accurately isolating information for end-users. Distinguishing and classifying business pages on the Web by business categories using Standard Industrial Classification (SIC) codes is achieved through an automatic iterative process.