Techniques and architectures for managing cascading mutations in an environment having a data lake

    公开(公告)号:US11314767B2

    公开(公告)日:2022-04-26

    申请号:US16943314

    申请日:2020-07-30

    Abstract: Managing mutations in a data lake environment. A mutation request to cause write operations that modify data objects or structures within an environment for collecting unformatted raw data is received. The environment has at least a data table and a notification table. An entry is written to the data table with a streaming job configured to receive and process the mutation request. Entries to the data table specify at least records indicating changes to objects in the environment based on ingestion processing for the environment for collecting unformatted raw data and based on the mutation request. A corresponding entry is written to the notification table in response to a successful write attempt to the data table. The notification table entry has information about data table entries for a specified period. At least one data consumer is notified that the data table has been modified.

    TECHNIQUES AND ARCHITECTURES FOR MANAGING CASCADING MUTATIONS IN AN ENVIRONMENT HAVING A DATA LAKE

    公开(公告)号:US20220035829A1

    公开(公告)日:2022-02-03

    申请号:US16943314

    申请日:2020-07-30

    Abstract: Managing mutations in a data lake environment. A mutation request to cause write operations that modify data objects or structures within an environment for collecting unformatted raw data is received. The environment has at least a data table and a notification table. An entry is written to the data table with a streaming job configured to receive and process the mutation request. Entries to the data table specify at least records indicating changes to objects in the environment based on ingestion processing for the environment for collecting unformatted raw data and based on the mutation request. A corresponding entry is written to the notification table in response to a successful write attempt to the data table. The notification table entry has information about data table entries for a specified period. At least one data consumer is notified that the data table has been modified.

    Techniques and Architectures for Providing Atomic Transactions Across Multiple Data Sources

    公开(公告)号:US20220012259A1

    公开(公告)日:2022-01-13

    申请号:US16922989

    申请日:2020-07-07

    Abstract: Techniques and mechanisms for ingesting data through an atomic transaction are disclosed. Raw data is received from multiple disparate sources to be consumed in an environment that does not support atomic write operations to data consumers. The environment has at least a data table and a notification table. A write to an entry in the data table having an associated version is attempted. The data table entry corresponds to the data to be consumed. A write to a corresponding entry to the notification table is attempted in response to a successful write attempt to the data table. The notification table entry includes information about the corresponding data table entry. The version associated with the data table is modified in response to successful writes of both the data table entry and the notification table entry. At least one data consumer is notified that the data table version has been modified.

    TECHNIQUES AND ARCHITECTURES FOR SHARING REMOTE RESOURCES AMONG A TRUSTED GROUP OF USERS

    公开(公告)号:US20220345458A1

    公开(公告)日:2022-10-27

    申请号:US17238180

    申请日:2021-04-22

    Abstract: Various techniques and mechanisms for sharing remote resources among a trusted group are disclosed. A credential management agent utilizes a resource credential for a first user to access a secure resource corresponding to the first user for a second user by at least validating a second user and validating a consent of the first user to allow the second user to access the secure resource using the resource credential for the first user. The secure resource resides on a remote server system accessible via one or more application program interfaces (APIs). A platform management agent provides an interface for shared resource-agnostic credential sharing. The platform management agent validates credentials for the second user as belonging to a trusted group and forwards a request for access to the secure resource for the second user to the credential management agent. In response to receiving the resource credential for the first user to access the secure resource, the platform management agent accesses the secure resource on behalf of the second user to allow the second user to utilize the secure resource.

    Techniques and Architectures for Providing an Isolated Runtime Context in a Shared Environment

    公开(公告)号:US20220171696A1

    公开(公告)日:2022-06-02

    申请号:US17108875

    申请日:2020-12-01

    Abstract: Mechanisms and techniques for providing an isolated runtime environment are disclosed. Scoped properties are generated utilizing a namespace identifier. The namespace identifier is an identifier used to define a scope of at least an isolated application runtime context. Scoped objects are generated corresponding to one or more dependent services utilizing the one or more scoped properties and the namespace identifier. A target application is deployed to a container corresponding to the isolated application runtime context. The target application interacts with at least some of the dependent services utilizing the namespace identifier. At least one integration test is run on the target application in the container. The integration tests refer to the isolated application runtime context utilizing the namespace identifier.

    Monitoring and verification system for end-to-end distribution of messages

    公开(公告)号:US11488082B2

    公开(公告)日:2022-11-01

    申请号:US16366922

    申请日:2019-03-27

    Abstract: Described is a method and system for verifying the end-to-end distribution of messages within an on-demand services platform. To provide the ability to monitor and trace such messages, the system may include a specialized and queryable datastore (or database) that allows the system to track the distribution and acknowledgement of messages throughout the services platform. Accordingly, a specialized verification process may be initiated to query the database to verify the receipt of a message has been acknowledged by various components within the system. In addition, the verification process may automatically retry the distribution the messages to ensure particular downstream components have received the message. Accordingly, the system alleviates the need to manual re-crawl various message source to ensure the end-to-end distribution of such messages.

    PERFORMING CUSTOMIZED DATA COMPACTION FOR EFFICIENT PARALLEL DATA PROCESSING AMONGST A SET OF COMPUTING RESOURCES

    公开(公告)号:US20200250007A1

    公开(公告)日:2020-08-06

    申请号:US16264399

    申请日:2019-01-31

    Abstract: Described is a system and method for compacting data into customized (e.g. optimal) file sizes for processing by computing resources. The mechanism may leverage various computing resources such as a cluster computing frameworks combined with a stream processing platform to efficiently process the activity data. For example, activity data of an organization may be processed by a set of jobs (or sub-jobs) as part of a data stream by a set of distributed computing resources. In order to efficiently process such data, the mechanism may compact the data into customized (e.g. optimal) file sizes. For example, the customized file sizes may provide an optimal (or near optimal) amount of data to be processed by each job, for example, to improve performance.

Patent Agency Ranking