Abstract:
Metadata can be provided to multiple processing units of a database system by using local storages respectively provided for the processing units, such that a local storage is accessible only to its respective processing unit. As a result, processing units can access metadata when needed (e.g., when needed to process a database request at runtime) without having to access a source external to the database system. In addition, metadata (e.g., an XML object, XML schema, XSLT stylesheets, XQuery modules) can be provided using a database request or command, for example, by using a register statement.
Abstract:
Various embodiments herein each include at least one of systems, methods, and software for generating, storing, and using non-unique secondary indexes of semi-structured data in database tables. One method embodiment includes creating and storing a non-unique secondary index (NUSI) for a database table based on column data stored in rows of the database table. The NUSI in some such embodiments identifying an offset to specific data included in the column data of at least some rows of the database table where the specific data is located. Other embodiments include building a NUSI index that includes values embedded therein.
Abstract:
Streaming data is populated to an in-memory data table and a continuous query is executed against an in-memory data table using a database interface to perform analytical operations on the populated in-memory data table. Results from the analytical operations performed are streamed to consuming applications.
Abstract:
Various embodiments herein each include at least one of systems, methods, and software for generating, storing, and using non-unique secondary indexes of semi-structured data in database tables. One method embodiment includes creating and storing a non-unique secondary index (NUSI) for a database table based on column data stored in rows of the database table. The NUSI in some such embodiments identifying an offset to specific data included in the column data of at least some rows of the database table where the specific data is located. Other embodiments include building a NUSI index that includes values embedded therein.
Abstract:
Metadata can be provided to multiple processing units of a database system by using local storages respectively provided for the processing units, such that a local storage is accessible only to its respective processing unit. As a result, processing units can access metadata when needed (e.g., when needed to process a database request at runtime) without having to access a source external to the database system. In addition, metadata (e.g., an XML object, XML schema, XSLT stylesheets, XQuery modules) can be provided using a database request or command, for example, by using a register statement.
Abstract:
A system and method for storing time series data in a distributed relational database. A SQL CREATE TABLE statement including a PRIMARY INDEX CLAUSE creates a primary index and associated time series table which organizes time series data records by time, while simultaneously storing the time series data records in an easily queryable time sequential manner
Abstract:
Router threads read real-time stream data as that data is received at processing nodes along a processing path for the data. The data is initially parsed into workloads. Each router thread maintains real-time analytics for the workloads and reports deviations in the analytics to a workload coordinator. The workload coordinator dynamically and in real time provides updated data distribution maps based on the reported deviations, and the router threads use the data distribution maps to determine a next processing unit to forward the workloads, where the next processing unit includes an instance of a next processing node in the processing path for the workload. The next processing node performs additional processing on the workloads along the processing path.
Abstract:
Improved techniques for performing Spatial Joins multi-processing computing systems and environments are disclosed. One or more intersection of bounds (or limits) of data sets is determined as a join bounding space. The join bounding space is in a space (Global space or Global universe) where a spatial join between (or for) the data can be performed. The determined join bounding space can be partitioned into sub-partitions of the join bounding space. The sub-partitions of the join bounding space can assigned respectively to multiple processing unit for processing in parallel in. In addition, distribution cost information associated with the cost of distribution of the datasets (and/or their components) to the processing units of a multi-processing system can be provided and/or used to effectively distribute and/or redistribute processing of the Spatial Join between the processing units of a multi-processing system.
Abstract:
A data store system may include a storage device configured to store a plurality of data store tables. The data store may further include a plurality of processing units. At least one processing unit from the plurality of processing units may receive an analytic function call. The at least one processing unit may further identify, in the analytic function call, at least one column of a data store table on which to execute an analytic function in the analytic function call and may further identify, in the analytic function call, an identifier column of the data store table. Each row of the at least one column may be associated with a common row value of the identifier column. The at least one processing unit may further identify, in the analytic function call, at least one index column of the data store table. Each value in each at the least one index column may identify an index value on which to index each value of the at least one column with respect to each value of the identifier column. The at least one processing unit may further order values of the at least one column in accordance with the identifier column and the at least one index column, execute the analytic function on the ordered values to generate a result set, and order the result set in accordance with the identifier column and the at least one index column. A computer-readable medium and method are also disclosed.
Abstract:
Router threads read real-time stream data as that data is received at processing nodes along a processing path for the data. The data is initially parsed into workloads. Each router thread maintains real-time analytics for the workloads and reports deviations in the analytics to a workload coordinator. The workload coordinator dynamically and in real time provides updated data distribution maps based on the reported deviations, and the router threads use the data distribution maps to determine a next processing unit to forward the workloads, where the next processing unit includes an instance of a next processing node in the processing path for the workload. The next processing node performs additional processing on the workloads along the processing path.