SECONDARY DATA STRUCTURES FOR STORAGE CLASS MEMORY (SCM) ENABLES MAIN-MEMORY DATABASES

    公开(公告)号:US20170109295A1

    公开(公告)日:2017-04-20

    申请号:US14887578

    申请日:2015-10-20

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for accessing and managing storage class memory (SCM) enabled main-memory database structures. An embodiment operates by traversing a first node to find a location of a second node corresponding to a search key, calculating a hash value for the search key, comparing the calculated hash value with at least one fingerprint value stored in the second node, wherein the fingerprint value is determined by hashing a stored key, accessing at least one key-value pair having a matching hash value, and returning a value associated with the matching key-value pair, wherein at least one of the traversing, calculating, comparing, accessing, and returning are performed by one or more computers.

    Constraint data statistics
    4.
    发明授权

    公开(公告)号:US11468035B2

    公开(公告)日:2022-10-11

    申请号:US15796326

    申请日:2017-10-27

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.

    Constructing join histograms from histograms with Q-error guarantees

    公开(公告)号:US10565286B2

    公开(公告)日:2020-02-18

    申请号:US15392107

    申请日:2016-12-28

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for generating join histograms. An embodiment operates by a computer implemented method that includes determining, by at least one processor, a first interval associated with a first histogram of a first data structure and a first estimate frequency associated with the first interval. The method further includes determining, by the at least one processor, a second interval associated with a second histogram of a second data structure and a second estimate frequency associated with the second interval. The method further includes determining, by the at least one processor, a join interval based on the first and second intervals by calculating an intersection of the first and second intervals. The method further includes calculating, by the at least one processor, a join estimate frequency based on the first and second estimate frequencies.

    Secondary data structures for storage class memory (scm) enables main-memory databases

    公开(公告)号:US10019382B2

    公开(公告)日:2018-07-10

    申请号:US14887578

    申请日:2015-10-20

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for accessing and managing storage class memory (SCM) enabled main-memory database structures. An embodiment operates by traversing a first node to find a location of a second node corresponding to a search key, calculating a hash value for the search key, comparing the calculated hash value with at least one fingerprint value stored in the second node, wherein the fingerprint value is determined by hashing a stored key, accessing at least one key-value pair having a matching hash value, and returning a value associated with the matching key-value pair, wherein at least one of the traversing, calculating, comparing, accessing, and returning are performed by one or more computers.

    DISTRIBUTED RESOURCE-AWARE TASK SCHEDULING WITH REPLICATED DATA PLACEMENT IN PARALLEL DATABASE CLUSTERS
    7.
    发明申请
    DISTRIBUTED RESOURCE-AWARE TASK SCHEDULING WITH REPLICATED DATA PLACEMENT IN PARALLEL DATABASE CLUSTERS 审中-公开
    分布式资源 - 特征任务调度与并行数据库中的替换数据放置

    公开(公告)号:US20170039239A1

    公开(公告)日:2017-02-09

    申请号:US14816681

    申请日:2015-08-03

    Applicant: SAP SE

    Abstract: A method and system, the system including a plurality of machines each having a processor and a main memory component; a shared distributed storage facility storing a set of data and accessible by the plurality of machines over a communication network; a controller to select, in response to a state of a query execution plan comprising a plurality of executable jobs for the set of data, which one of a set of scheduling algorithms to execute; an execution engine to execute the selected scheduling algorithm to determine, for each job in the plurality of jobs, which server to schedule to execute the respective job; and providing an indication of the scheduling of the servers determined to be schedules for the execution of the jobs.

    Abstract translation: 一种方法和系统,所述系统包括多个机器,每个机器具有处理器和主存储器部件; 共享分布式存储设备,其存储一组数据,并且可由所述多个机器通过通信网络访问; 控制器,响应于包括用于所述数据集合的多个可执行作业的查询执行计划的状态,以执行一组调度算法中的哪一个; 执行引擎,用于执行所选择的调度算法,以针对所述多个作业中的每个作业来确定哪个服务器计划执行相应的作业; 以及提供被确定为用于执行作业的日程表的服务器的调度的指示。

    Constraint Data Statistics
    8.
    发明申请

    公开(公告)号:US20220222244A1

    公开(公告)日:2022-07-14

    申请号:US17711608

    申请日:2022-04-01

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.

    Optimization of continuous queries in hybrid database and stream processing systems

    公开(公告)号:US10216793B2

    公开(公告)日:2019-02-26

    申请号:US14931005

    申请日:2015-11-03

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for optimizing continuous queries for hybrid execution over a stream processing engine and an in-memory database. In one example, a method may include identifying a continuous query executed at a stream processing engine, the continuous query including a plurality of operators. An optimal plan for execution of the identified continuous query at the stream processing engine is determined. For each of the plurality of operators in the determined optimal plan, an optimized placement decision for executing a particular operator in the stream processing engine or at a database system is determined. An optimized continuous query is generated from the identified continuous query based on the determined optimal placement decisions for each of the plurality of operators in the determined optimal plan. The optimized continuous query is then executed at the stream processing engine and the database system.

    OPTIMIZATION OF CONTINUOUS QUERIES IN HYBRID DATABASE AND STREAM PROCESSING SYSTEMS

    公开(公告)号:US20170124151A1

    公开(公告)日:2017-05-04

    申请号:US14931005

    申请日:2015-11-03

    Applicant: SAP SE

    CPC classification number: G06F17/30463 G06F17/30516

    Abstract: The present disclosure involves systems, software, and computer implemented methods for optimizing continuous queries for hybrid execution over a stream processing engine and an in-memory database. In one example, a method may include identifying a continuous query executed at a stream processing engine, the continuous query including a plurality of operators. An optimal plan for execution of the identified continuous query at the stream processing engine is determined. For each of the plurality of operators in the determined optimal plan, an optimized placement decision for executing a particular operator in the stream processing engine or at a database system is determined. An optimized continuous query is generated from the identified continuous query based on the determined optimal placement decisions for each of the plurality of operators in the determined optimal plan. The optimized continuous query is then executed at the stream processing engine and the database system.

Patent Agency Ranking