Managed Tables for Data Lakes
    3.
    发明申请

    公开(公告)号:US20250077477A1

    公开(公告)日:2025-03-06

    申请号:US18389331

    申请日:2023-11-14

    Applicant: Google LLC

    Abstract: Aspects of the disclosure are directed to merging data lake openness with scalable metadata for managed tables in a cloud database platform, allowing for atomicity, consistency, isolation, and durability (ACID) transactions, performant data manipulation language (DML), higher throughput stream ingestion, data consistency, schema evolution, time travel, clustering, fine-grained security, and/or automatic storage optimization. Table data is stored in various open-source file formats in cloud storage while physical metadata of the table data is stored in a scalable metadata storage system.

    Scalable Dataset Sharing With Linked Datasets

    公开(公告)号:US20240193295A1

    公开(公告)日:2024-06-13

    申请号:US18080178

    申请日:2022-12-13

    Applicant: Google LLC

    CPC classification number: G06F21/6227 G06F16/2471

    Abstract: Aspects of the disclosure relate to managing access to published data by different groups of users through linked datasets. A subscriber system generates a linked dataset that links the subscriber system to a source dataset published by a publisher system. The subscriber system queries the linked dataset. Queries to the linked dataset are redirected to the source dataset. The subscriber system manages access control to the linked dataset instead of the publisher system managing access control for the subscriber system directly to the source dataset. The source dataset does not need to be copied to the subscriber system. From the perspective of the subscriber system, changes to the source dataset appear instantly, as subscribers may query the source dataset through the linked dataset without waiting for copies of the source dataset to propagate.

Patent Agency Ranking