SYSTEM TO LABEL K-MEANS CLUSTERS WITH HUMAN UNDERSTANDABLE LABELS

    公开(公告)号:US20210357429A1

    公开(公告)日:2021-11-18

    申请号:US15931233

    申请日:2020-05-13

    Abstract: Disclosed herein are system, method, and apparatus for generating labels for k-means clusters. The method includes accessing a plurality of data records from a database repository, and storing the plurality of data records into at least one of primary or secondary memory associated with at least one computer processor performing the method, along with a cluster number for each data record. All data records having a same cluster number form a cluster, and each record has been categorized or designated a cluster number out of a total K number of clusters. The method includes for each of a plurality of classification features, performing cluster-based analysis for a first cluster with respect to a single feature to generate a single feature overlap score. The method includes sorting, grouping, and generating a naming label for the first cluster based on the predetermined number of features having the lowest overlap scores.

    SORTING DATA ELEMENTS OF A GIVEN SET OF DATA ELEMENTS

    公开(公告)号:US20210357183A1

    公开(公告)日:2021-11-18

    申请号:US16876573

    申请日:2020-05-18

    Abstract: A computer implemented method is used for sorting data elements of a given set. The method includes performing an evaluation of a first type of usage of each data element. The method includes determining a set of data element candidates dependent on the evaluation of the first type of usage. The method includes performing an evaluation of a second type of usage of each data element of the set of data element candidates. The method includes sorting the data elements of the set of data element candidates dependent on the evaluation of the second type of usage of each data element of the set of data element candidates. The method includes providing the sorted data elements of the set of data element candidates, and in response, receiving a request for a data processing based on the provided sorted data elements of the set of data element candidates.

    Building and matching electronic standards profiles using machine learning

    公开(公告)号:US11176486B2

    公开(公告)日:2021-11-16

    申请号:US15857103

    申请日:2017-12-28

    Abstract: Method and apparatus for generating profiles using machine learning and influencing online interactions are provided. The methods include generating a user profile specifying a plurality of attribute values for a plurality of principle attributes, by processing a corpus of electronic documents using a first trained machine learning model. In an embodiment, the method further comprises generating a provider profile specifying a plurality of attribute values for the plurality of principle attributes, for each of a plurality of providers, by processing a respective corpus of electronic documents associated with each respective provider using a second trained machine learning model. A plurality of match coefficients based on comparing the user profile and the plurality of provider profiles are determined. Finally, one or more online interactions between the user and the target provider are influenced based on the determined match coefficients.

    System and method for vending consumer goods in a vehicle

    公开(公告)号:US11170600B2

    公开(公告)日:2021-11-09

    申请号:US16870474

    申请日:2020-05-08

    Abstract: A system for vending consumer goods in a vehicle includes a container for storing consumable units of at least one sort of consumer goods, a sensor system including at least one first sensor configured to detect removal of a consumable unit from the container by a consumer, and a computer system connected to the sensor system via a data network. The computer system includes a user database storing a user account assigned to the consumer and a service database storing a price for each consumable unit. Each user account comprises a service payment account listing services and a corresponding amount of money due for each service. The computer system is configured to retrieve from the service database a price for the removed consumable unit and to charge the service payment account of the consumer with an amount of money corresponding to the price of the removed consumable unit.

    Reclustering of database tables based on peaks and widths

    公开(公告)号:US11163746B2

    公开(公告)日:2021-11-02

    申请号:US17249796

    申请日:2021-03-12

    Applicant: Snowflake Inc.

    Abstract: The subject technology determines whether a table is sufficiently clustered. The subject technology in response to determining the table is not sufficiently clustered, selects one or more micro-partitions of the table to be reclustered. The subject technology constructs a data structure for the table. The subject technology extracts minimum and maximum endpoints for each micro-partition in the data structure. The subject technology sorts each of one or more peaks in the data structure based on height. The subject technology sorts overlapping micro-partitions based on width. The subject technology selects based on which micro-partitions are within the tallest peaks of the one or more peaks and further based on which of the overlapping micro-partitions have the widest widths.

    SYSTEMS AND METHODS FOR ADVANCED VELOCITY PROFILE PREPARATION AND ANALYSIS

    公开(公告)号:US20210312450A1

    公开(公告)日:2021-10-07

    申请号:US16837673

    申请日:2020-04-01

    Abstract: A system is provided. The system includes a computing device including at least one processor in communication with at least one memory device. The at least one processor is programmed to receive a plurality of data points. The at least one processor is also programmed to sort the plurality of data points into chronological order. The at least one processor is further programmed to divide the plurality of data points into a plurality of subsets. Each subset of the plurality of subsets represents a period of time. In addition, the at least one processor is programmed to process each subset to determine a velocity value for the individual subset. Moreover, the at least one processor is programmed to combine the plurality of velocity values to determine a final velocity value.

    COMPUTERIZED SYSTEMS AND METHODS FOR PRODUCT CATEGORIZATION USING ARTIFICIAL INTELLIGENCE

    公开(公告)号:US20210295185A1

    公开(公告)日:2021-09-23

    申请号:US17225056

    申请日:2021-04-07

    Applicant: Coupang Corp.

    Abstract: Systems and methods are provided for categorizing products using AI. One method comprises retrieving initial training data including products associated with one or more categories; pre-processing the initial training data to generate synthesized training data; generating a hierarchical model using the synthesized training data, the hierarchical model containing at least two layers of nodes below a root node; receiving information associated with a first uncategorized product; and receiving a request to predict a set of N categories with the highest N total probability scores. The method may further comprise predicting, using the hierarchical model, N categories of the first uncategorized product, by calculating total probability scores, and determining the N categories with the highest N total probability scores; sorting the first uncategorized product into the N categories associated with the nodes from the first and second layers having the highest total probability scores; and displaying the sorted first uncategorized product and its associated N categories on a user device associated with a user.

Patent Agency Ranking