Abstract:
A system and method for evaluating claims from sources to update database records. A trust score is developed for each source. If a source submits a claim, the trust score for that source and the value of the claim are evaluated against prior conflicting claims. If the current claim is deemed the most likely, then it is adopted as provisional “truth”. If not, the current claim is rejected.
Abstract:
A system determines a count of each item in each item set, sorts each count into ascending order, assigns an ascending identifier to each item corresponding to each sorted count, and sorts each identifier in each item set in descending order. The system partitions item sets into a first group of item sets and a second group of item sets, each item set in the first group including a common largest identifier, determines a count for each subset of each item set of the first group, and determines a count of each subset of each item set by summing each count for each subset of each item set of the first group with each corresponding count for each corresponding subset of each item set of the second group. The system outputs a recommended item set based on the count of each subset of each item set.
Abstract:
A system receives an association of first item with first system user, generates first hash value by applying first hash function associated with first system user to first item identifier associated with first item, and sets a bit corresponding to first hash value in array. The system receives an association of second item with second system user, generates second hash value by applying second hash function associated with second user to second item identifier associated with second item, and sets a bit corresponding to second hash value in array. The system receives a request to determine whether third item is associated with first system user, generates third hash value by applying first hash function to third item identifier associated with third item, and outputs message that third item is not associated with first user if a bit corresponding to third hash value is not set in array.
Abstract:
Suggesting query items based on database fields is described. A database system receives a character sequence entered in a search box. The database system identifies a first distribution of first field-based items that include the character sequence, and a second distribution of second field-based items that include the character sequence. The database system identifies a first item based on combining the first distribution with a distribution of queried fields, and a second item based on combining the second distribution with the distribution of queried fields. The database system outputs the first item and the second item to a location associated with the search box. The database system executes a search based on any requested item, in response to receiving a request to search for any item output to the location associated with the search box.
Abstract:
Word phrases are stored in a phrase structure. Each word is stored as a keyword in a keyword structure. Each keyword is associated with usage attributes identifying use of a word in a word phrase. Any preceding words associated with a keyword, and a mapping from any preceding words to a word phrase, is stored for each word. A word string is input. Match attributes are updated in a match structure if a word in the word string matches any keyword and if any preceding words associated with any matching keyword includes a preceding word which precedes the word in the word string. The match attributes indicate use of the matching word in the word string and in a word phrase. Whether a word phrase is present in the word string is determined based on the usage attributes and the match attributes associated with multiple matching words.
Abstract:
User scores based on bulk record updates is described. A system receives record updates submitted by a user. The system subtracts a penalty debit from a user score, which corresponds to the user, for each record which corresponds to at least one of the record updates and which is removed from purchasing availability. The system adds a full credit to the user score for each record which corresponds to at least one of the record updates and which is purchased. The system adds a partial credit to the user score for each record which corresponds to at least one of the record updates and which is yet to be purchased and which is yet to be removed from purchasing availability, wherein the partial credit is a positive value that is less than the full credit. The system enables the user to access records, based on the user score.
Abstract:
A system receives a character sequence entered in a search box, identifies a first item that includes the character sequence and a second item that includes the character sequence, identifies a first item set that includes the first item and a second item set that includes the second item; and outputs the first item set and the second item set to a location associated with the search box. The system receives a selection of a third item from the first item set, identifies a third item set that includes the third item and a fourth item set that includes the third item, and outputs the third item set and the fourth item set to the location associated with the search box. The system receives a selection of any item set from the location associated with the search box, and executes a search based on the selected item set.
Abstract:
The technology disclosed describes systems and methods for generating feature vectors from resource description framework (RDF) graphs. Machine learning tasks frequently operate on vectors of features. Available systems for parsing multiple documents often generate RDF graphs. Once a set of interesting features to be considered has been established, the disclosed technology describes systems and methods for generating feature vectors from the RDF graphs for the documents. In one example setting, a machine learning system can use generated feature vectors to determine how interesting a news article might be, or to learn information-of-interest about a specific subject reported in multiple articles. In another example setting, viable interview candidates for a particular job opening can be identified using feature vectors generated from a resume database, using the disclosed systems and methods for generating feature vectors from RDF graphs.