Simplified schema generation for data ingestion

    公开(公告)号:US12124480B2

    公开(公告)日:2024-10-22

    申请号:US18060136

    申请日:2022-11-30

    CPC classification number: G06F16/285 G06F16/211

    Abstract: A dataset is received from a data source. A first plurality of string similarities between metadata of the dataset with a plurality of attributes of a plurality of data classes in a target schema are calculated to determine a data class. A set of relationships are assigned to the data class based on relationships between the plurality of data classes in the target schema. A second plurality of string similarities between a plurality of attributes of the dataset and a plurality of attributes of the data class are calculated. Datatypes and measurement units are assigned to the plurality of attributes of the dataset according to the second plurality of string similarities. A source schema is generated based on the data class, the set of relationships, the plurality of attributes of the data class and the measurement units.

    DIGITAL SEISMIC FILE INGESTION
    2.
    发明公开

    公开(公告)号:US20240230938A1

    公开(公告)日:2024-07-11

    申请号:US18537958

    申请日:2023-12-13

    CPC classification number: G01V1/301 G01V2210/512 G01V2210/514

    Abstract: A method includes obtaining a digital seismic file, obtaining a digital seismic file, and performing autodetection of parameters of the digital seismic file. The method further includes extracting seismic data from the digital seismic file according to the parameters to generate normalized seismic data. The method further includes scanning the normalized seismic data to obtain metadata that includes geographic file boundaries and mapping the normalized seismic data to a parent virtual survey based at least in part on the geographic file boundaries being in a geographic region of a parent virtual survey. The method additionally includes storing, in a target store, the normalized seismic data and metadata, the normalized seismic data in a stored relationship with the parent virtual survey in the target store.

    DIGITAL SEISMIC FILE SCANNER
    3.
    发明公开

    公开(公告)号:US20240184009A1

    公开(公告)日:2024-06-06

    申请号:US18556432

    申请日:2022-04-20

    CPC classification number: G01V1/362 G01V2210/512 G01V2210/514

    Abstract: A method includes obtaining a digital seismic file, performing autodetection of parameters of the digital seismic file, and registering the parameters of the digital seismic file with the digital seismic file. Performing autodetection comprises a computer processor, repetitively until a candidate template successfully extracts the parameters, selecting a target candidate template, attempting extraction of a binary header using the target candidate template, attempting extraction of a trace header using the target candidate template, attempting extraction of the plurality of parameters when the target candidate template extracts the binary header and the trace header, and moving to a next target candidate template when extraction of the plurality of headers is unsuccessful.

    Digital seismic file scanner
    4.
    发明授权

    公开(公告)号:US12099154B2

    公开(公告)日:2024-09-24

    申请号:US18556432

    申请日:2022-04-20

    CPC classification number: G01V1/362 G01V2210/512 G01V2210/514

    Abstract: A method includes obtaining a digital seismic file, performing autodetection of parameters of the digital seismic file, and registering the parameters of the digital seismic file with the digital seismic file. Performing autodetection comprises a computer processor, repetitively until a candidate template successfully extracts the parameters, selecting a target candidate template, attempting extraction of a binary header using the target candidate template, attempting extraction of a trace header using the target candidate template, attempting extraction of the plurality of parameters when the target candidate template extracts the binary header and the trace header, and moving to a next target candidate template when extraction of the plurality of headers is unsuccessful.

    SIMPLIFIED SCHEMA GENERATION FOR DATA INGESTION

    公开(公告)号:US20240176803A1

    公开(公告)日:2024-05-30

    申请号:US18060136

    申请日:2022-11-30

    CPC classification number: G06F16/285 G06F16/211

    Abstract: A dataset is received from a data source. A first plurality of string similarities between metadata of the dataset with a plurality of attributes of a plurality of data classes in a target schema are calculated to determine a data class. A set of relationships are assigned to the data class based on relationships between the plurality of data classes in the target schema. A second plurality of string similarities between a plurality of attributes of the dataset and a plurality of attributes of the data class are calculated. Datatypes and measurement units are assigned to the plurality of attributes of the dataset according to the second plurality of string similarities. A source schema is generated based on the data class, the set of relationships, the plurality of attributes of the data class and the measurement units.

Patent Agency Ranking