TECHNIQUES AND ARCHITECTURES FOR MANAGING CASCADING MUTATIONS IN AN ENVIRONMENT HAVING A DATA LAKE

    公开(公告)号:US20220035829A1

    公开(公告)日:2022-02-03

    申请号:US16943314

    申请日:2020-07-30

    Abstract: Managing mutations in a data lake environment. A mutation request to cause write operations that modify data objects or structures within an environment for collecting unformatted raw data is received. The environment has at least a data table and a notification table. An entry is written to the data table with a streaming job configured to receive and process the mutation request. Entries to the data table specify at least records indicating changes to objects in the environment based on ingestion processing for the environment for collecting unformatted raw data and based on the mutation request. A corresponding entry is written to the notification table in response to a successful write attempt to the data table. The notification table entry has information about data table entries for a specified period. At least one data consumer is notified that the data table has been modified.

Patent Agency Ranking