JOINING DATA ACROSS A PARALLEL DATABASE AND A DISTRIBUTED PROCESSING SYSTEM
    1.
    发明申请
    JOINING DATA ACROSS A PARALLEL DATABASE AND A DISTRIBUTED PROCESSING SYSTEM 有权
    通过并行数据库和分布式处理系统连接数据

    公开(公告)号:US20160103877A1

    公开(公告)日:2016-04-14

    申请号:US14511345

    申请日:2014-10-10

    CPC classification number: G06F17/30445 G06F17/30545

    Abstract: Embodiments relate to joining data across a parallel database and a distributed processing system. Aspects include receiving a query on data stored in parallel database T and data stored in distributed processing system L, applying local query predicates and projection to data T to create T′, and applying local query predicates and projection to L to create L′. Based on determining that a size of L′ is less than a size of T′ and that the size of L′ is less than a first threshold, transmitting L′ to the parallel database and executing a join between T′ and L′. Based on determining that a number of the nodes distributed processing system n multiplied by the size of T′ is less than the size of L′ and that the size of T′ is less than a second threshold; transmitting T′ to the distributed processing system and executing a join between T′ and L′.

    Abstract translation: 实施例涉及跨并行数据库和分布式处理系统连接数据。 方面包括接收对存储在并行数据库T中的数据和存储在分布式处理系统L中的数据的查询,将本地查询谓词和投影应用于数据T以创建T',以及将本地查询谓词和投影应用于L创建L'。 基于确定L'的大小小于T'的大小并且L'的大小小于第一阈值,将L'发送到并行数据库并执行T'和L'之间的连接。 基于确定多个节点分布处理系统n乘以T'的大小小于L'的大小并且T'的大小小于第二阈值; 将T'发送到分布式处理系统并执行T'和L'之间的连接。

    Joining data across a parallel database and a distributed processing system

    公开(公告)号:US09767149B2

    公开(公告)日:2017-09-19

    申请号:US14511345

    申请日:2014-10-10

    CPC classification number: G06F17/30445 G06F17/30545

    Abstract: Embodiments relate to joining data across a parallel database and a distributed processing system. Aspects include receiving a query on data stored in parallel database T and data stored in distributed processing system L, applying local query predicates and projection to data T to create T′, and applying local query predicates and projection to L to create L′. Based on determining that a size of L′ is less than a size of T′ and that the size of L′ is less than a first threshold, transmitting L′ to the parallel database and executing a join between T′ and L′. Based on determining that a number of the nodes distributed processing system n multiplied by the size of T′ is less than the size of L′ and that the size of T′ is less than a second threshold; transmitting T′ to the distributed processing system and executing a join between T′ and L′.

Patent Agency Ranking