摘要:
Several methods and a system of a replicated service for write ahead logs are disclosed. In one embodiment, a method includes persisting a state of a distributed system through a write ahead log (WAL) interface. The method also includes maintaining a set of replicas of a WAL through a consensus protocol. In addition, the method includes providing a set of mechanisms for at least one of detection and a recovery from a hardware failure. The method further includes recovering a persistent state of a set of applications. In addition, the method includes maintaining the persistent state across a set of nodes through the hardware failover. In one embodiment, the system may include a WAL interface to persist a state of a distributed system. The system may also include a WAL replication servlet to maintain and/or recover a set of replicas of a WAL.
摘要:
Techniques for finding a column with column partitioning are provided. Metadata for a container row is expanded to include information for searching ranges of partitioned column values. The metadata identifies offsets to specific ranges and specific columns within a specific range. The offsets also identify where compressed data for a desired column resides. Thereby, permitting partitioned columns having compressed data to be located without being decompressed and decompressed on demand as needed.
摘要:
A computer running a database system receives one or more queries, each query comprised of parallel threads of execution working towards the common goal of completing a user request. These threads are grouped into a schedulable object called a task group. The task groups are placed within a specific multiple tier hierarchy, and database system resources allocated to the task groups according to their placement within the hierarchy. Beginning with the top tier of the hierarchy, resources remaining after allocations to each task group within a tier are passed to the next lower tier for allocation.
摘要:
A disk-error detection system may include a storage array having a plurality of storage disks configured to store data. The disk-error detection system may include a processor in communication with the storage disks. The processor may detect presence of a media error in a storage disk from the plurality of storage disks during a read operation of the data stored on the storage disk. The processor may perform a media-error detection scan of a predetermined portion of the storage disk in response to the detection of the media error. The location on the storage disk of the predetermined portion of the storage disk may be based on location of the media error on the storage disk. A method and computer-readable medium may also implement disk-error detection.
摘要:
Statistics for a Relational Database Management System (RDBMS) set of tables are normalized and a grid computed. A particular grid cell having above threshold number of entries from the tables is determined and an R-Tree is dynamically constructed for that grid cell for use when performing a geospatial join operation against the tables.
摘要:
Techniques for data integration are provided. Source attributes for source data are interactively mapped to target attributes for target data. Rules define how records from the source data are merged, selected, and for duplication detection. The mappings and rules are recorded as a profile for the source data and processed against the source data to transform the source attributes to the target attributes.
摘要:
Techniques for data modeling are provided. Enterprise data is organized into reference data for entities that an enterprise wants to track and monitor. Relationship data is created that establishes relationships among the various entities within the enterprise data. The reference data and the relationship data are published within an enterprise data warehouse for accessing the enterprise data.
摘要:
Techniques for accessing a parallel database system via an external program using vertical and/or horizontal partitioning are provided. An external program to a database management system (DBMS) configures external mappers to process a specific portion of query results on specific access module processors of the DBMS that are to house query results. The query is submitted by the external program to the DBMS and the DBMS is directed to organize the query results in a vertical or horizontal manner. Each external mapper accesses its portion of the query results for processing in parallel on its designated AMP or set of AMPS to process the query results.
摘要:
An apparatus, method and computer program product for query optimization in a Relational Database Management System (RDBMS), wherein an optimizer accesses a query expression repository (QER) storing planning and execution information for QEs from previous queries, wherein the QEs comprise table relations, intermediate results and/or final results of operations in the previous queries. Additionally, dynamic join indexes representing QE results are created for high-value QEs selected from the QER and maintained within a DJI repository. During query plan creation for a current or subsequent query, the optimizer searches the QER and DJI repository for DJIs created for high-value QEs corresponding to QEs contained in the current or subsequent query. DJIs corresponding to the matching QEs are used in the query planning phase to rewrite the current or subsequent user query so that stored QE results are used to answer QEs contained in the current or subsequent query.
摘要:
A system may include a storage device. The storage device may store a plurality of user-defined functions (“UDFs”). Each of the plurality of UDFs may be containerized to allow each UDF to be executed using content unshared with other UDFs. The storage device may also include a plurality of data objects. The system may further include a plurality of processing nodes. At least one processing node may receive a call to execute one of the plurality of UDFs on at least one of the plurality of data objects. The at least one processing node may execute the called UDF on the at least one of the plurality of data objects. A method and computer-readable medium are also disclosed.