DISTRIBUTED DATA PROCESSING IN MULTI-TENANT ENVIRONMENTS

    公开(公告)号:US20200004858A1

    公开(公告)日:2020-01-02

    申请号:US16024264

    申请日:2018-06-29

    Abstract: Methods, systems, and devices for data processing within a distributed data system are described. In a multi-tenant distributed data system, a provider may supply executable code for processing data using declarative processing instructions received from a tenant. For example, a tenant may provide tenant-specific processing instructions for a requested set of data. The processing instructions may indicate input information (e.g., a data structure, tenant-specific fields, etc.), transformation information (e.g., from a set of pre-defined transformations), and output information. The provider-supplied code may use the tenant-specific processing instructions to process and generate the requested set of data, where the code may be executed by multiple nodes within the system. As such, the code executed by multiple nodes may utilize the input information, transformation information, and output information from the tenant-specific processing instructions to generate the requested data and provide the data to the tenant.

    HIGH-DIMENSIONAL DATA MANAGEMENT AND PRESENTATION

    公开(公告)号:US20190236191A1

    公开(公告)日:2019-08-01

    申请号:US15885499

    申请日:2018-01-31

    Abstract: An online system manages data by determining relevance of data dimensions to users. The online system determines which data dimensions a user is likely to be interested in. If a user requests to access a data set that includes data of different dimensions, the online system analyzes the dimensions' relevance to the user before providing the data set to the user. The online system provides the data to the user by prioritizing data dimensions that are more relevant to the user. As such, the online system improves the user experience by allowing users to conveniently and quickly locate relevant data and minimizing the distraction caused by irrelevant data. The online system may create and provide a user interface to present data dimensions that are determined to be relevant.

    High-dimensional data management and presentation

    公开(公告)号:US10831757B2

    公开(公告)日:2020-11-10

    申请号:US15885499

    申请日:2018-01-31

    Abstract: An online system manages data by determining relevance of data dimensions to users. The online system determines which data dimensions a user is likely to be interested in. If a user requests to access a data set that includes data of different dimensions, the online system analyzes the dimensions' relevance to the user before providing the data set to the user. The online system provides the data to the user by prioritizing data dimensions that are more relevant to the user. As such, the online system improves the user experience by allowing users to conveniently and quickly locate relevant data and minimizing the distraction caused by irrelevant data. The online system may create and provide a user interface to present data dimensions that are determined to be relevant.

    Distributed data processing in multi-tenant environments

    公开(公告)号:US10853358B2

    公开(公告)日:2020-12-01

    申请号:US16024264

    申请日:2018-06-29

    Abstract: Methods, systems, and devices for data processing within a distributed data system are described. In a multi-tenant distributed data system, a provider may supply executable code for processing data using declarative processing instructions received from a tenant. For example, a tenant may provide tenant-specific processing instructions for a requested set of data. The processing instructions may indicate input information (e.g., a data structure, tenant-specific fields, etc.), transformation information (e.g., from a set of pre-defined transformations), and output information. The provider-supplied code may use the tenant-specific processing instructions to process and generate the requested set of data, where the code may be executed by multiple nodes within the system. As such, the code executed by multiple nodes may utilize the input information, transformation information, and output information from the tenant-specific processing instructions to generate the requested data and provide the data to the tenant.

    Interactive dataflow preview
    5.
    发明授权

    公开(公告)号:US11755608B2

    公开(公告)日:2023-09-12

    申请号:US16740918

    申请日:2020-01-13

    CPC classification number: G06F16/254 G06F16/258

    Abstract: DESCRIBED HEREIN ARE SYSTEMS, APPARATUS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR IMPLEMENTING DESIGN TIME AND BATCH TIME FOR AN EXTRACT, TRANSFORM, LOAD (ETL) PROCESS. WHEN A SESSION IS ESTABLISHED, A KUBERNETES POD INSTANCE MAY BE EXCLUSIVELY ASSOCIATED WITH A USER FOR THE USER'S SESSIONS. DESIGN TIME AND BATCH TIME MAY BOTH BE PERFORMED WITHIN THE KUBERNETES POD INSTANCE. AS SUCH, A SERVICE PROVIDER MAY PROVIDE A SECURE DATAFLOW PREVIEW. FURTHERMORE, THE DATAFLOW MAY BE CACHED. CACHING OF THE DATAFLOW ALLOWS THE SERVICE PROVIDER TO MORE QUICKLY PROVIDE FOLLOW UP PREVIEWS, DECREASING LATENCY.

    ORCHESTRATION FOR DATA PIPELINE EXECUTION PLANS

    公开(公告)号:US20210240519A1

    公开(公告)日:2021-08-05

    申请号:US16779040

    申请日:2020-01-31

    Abstract: Methods, systems, and devices supporting dynamic process orchestration are described. An orchestration server may receive a request defining a data modification process from a user device. The orchestration server may generate an execution file based on the request, and the execution file may include a set of tasks for performing the data modification process and an order for performing the set of tasks. The orchestration server may execute, for the execution file, a first set of tasks according to the order for performing the set of tasks and, in some cases, may update the execution file based on executing the first subset of tasks. For example, updating the execution file may involve modifying a second subset of tasks of the set of tasks. The orchestration server may execute, for the updated execution file, the modified second subset of tasks according to the order for performing the set of tasks.

Patent Agency Ranking