摘要:
A method and system for generating accurate search results using a content-index is provided. In a preferred embodiment, a content-index search system is invoked in response to a query on a collection of objects. The collection of objects is indexed by the content-index and may, for example, be a corpus of documents indexed by the terms contained in the documents. The content-index search system uses the content-index to generate and store an initial search result in response to the query. Because the content-index is typically out of date with respect to a dynamically changing collection of objects, the content-index search system invokes search result correction routines to remove from the stored search result references that were incorrectly included and to add to the stored search result references that were incorrectly excluded. References that were incorrectly included include those that refer to objects that no longer exist and those that refer to objects that have been modified since the content-index was last updated and that no longer match the search criteria. References that were incorrectly excluded include those that refer to new objects that were not indexed and match the search criteria and those that refer to objects that have been modified since the content-index was last updated and that now match the search criteria.
摘要:
A scheduler manages execution of a plurality of data-collection jobs, assigns individual jobs to specific forwarders in a set of forwarders, and generates and transmits tokens (e.g., pairs of data-collection tasks and target sources) to assigned forwarders. The forwarder uses the tokens, along with stored information applicable across jobs, to collect data from the target source and forward it onto an indexer for processing. For example, the indexer can then break a data stream into discrete events, extract a timestamp from each event and index (e.g., store) the event based on the timestamp. The scheduler can monitor forwarders' job performance, such that it can use the performance to influence subsequent job assignments. Thus, data-collection jobs can be efficiently assigned to and executed by a group of forwarders, where the group can potentially be diverse and dynamic in size.
摘要:
Data storage is improved by combining content indexing and data reduction in text-containing files by using common word elimination. Raw data is processed by finding words in selected files, creating an index of found words, and replacing the words in the raw data with pointers to the corresponding words in the index. Each word appears only once in the index. Consequently, the index is relatively small and the procedure is completely reversible. In particular, the index is small relative to other methods because the data is transformed in place, and the transformed data and index are used together to capture the total information about the data.
摘要:
An electronic device including a sensor, a first processor, and a second processor and a method for associating data with time information are provided. The method includes including receiving a notification signal corresponding to the data from the first processor, determining time information and first identification information that correspond to the notification signal in response to the reception, receiving the data and second identification information corresponding to the notification signal from the first processor, associating the data with the time information at least based on the first identification information and the second identification information, and providing the data associated with the time information to an application. Other various embodiments may also be possible.
摘要:
Embodiments of the invention provide a system and method for searching and reporting on semistructured data that can include dynamic metadata. One embodiment can comprise providing a user interface to a user based on an object type definition for an object type that allows the user to specify search criteria associated with a set of metadata, mapping the user search criteria to a query that comprises at least one structured query constraint and at least one unstructured query constraint, processing the query to search a set of data objects containing semistructured data associated with the object type according to the query and returning a set of results to the user. The search results can be returned to a user based on user-specified reporting parameters. Additionally, the reporting definition can be saved as an object for future execution.
摘要:
A client device can be configured to perform a local index search and a server index search to automatically identify and upload content items on the client device that have not been uploaded to an online content management system. A local index search can include creating a unique local identifier of a content item and searching a local upload index that includes the unique local identifier of each content item that has been uploaded. A server index search can include creating a unique server identifier of the content item and searching a server upload index that includes the unique server identifier of each content item stored on the online content management system. Content items that are determined to have not been uploaded to the online content management system based on the results of the two searches, can be uploaded to the content management system by the client device.
摘要:
A method for searching encrypted data includes identifying, with a client, a plurality of values within a predetermined search range in a search index stored within a memory of the client, each value in the plurality of values being present in a plaintext representation of at least one encrypted file in a plurality of encrypted files stored in a server. The method further includes generating and transmitting at least one search query to the server through a data network, and receiving, with the client, at least one response from the server through the data network, the response including the encrypted keyword corresponding to the value in the plurality of values and an identifier of at least one file in the plurality of encrypted files stored on the server that includes the value.
摘要:
Methods and systems to build and utilize a search infrastructure are described. The system generates index information components in real-time based on a database that is time-stamped. The system updates index information at a plurality of query node servers based on the index information components. A query engine receives a search query from a client machine and identifies search results based on the query and the index information. The system communicates the search results, over the network, to the client machine.
摘要:
Providing an encrypted search index for performing searches on encrypted documents, the method comprising: (i) providing a set of documents, the documents comprising a plurality of unencrypted phrases; (ii) providing a master key; (iii) providing, based on the master key, for each phrase a set of encryption keys comprising one or more encryption keys; (iv) selecting, for each phrase, one encryption key of the set of encryption keys; (v) encrypting each phrase with the selected encryption key; and (vi) building an index based on the encrypted phrases, the index comprising information regarding which encrypted phrase is comprised within a certain document.
摘要:
Methods, computer program products, and computer systems for configuration management are disclosed. Such methods, computer program products, and computer systems include identifying an associative template node and setting a configuration parameter to a parameter value, based on a template association. The associative template node is a node in a hierarchy of an associative template. The identifying indicates a template association between the associative template node and a node in a hierarchy of hierarchically-organized unstructured data (HOUD). The parameter value is maintained in the associative template node, and the node comprises the configuration parameter.