摘要:
Implementations of the present disclosure include methods for providing transparency in streaming event data. In some implementations, methods include receiving a plurality of events, each event comprising event data and being generated by an event source in response to a real-world activity, processing the plurality of events using one or more complex event processing (CEP) rules to generate a complex event, in response to generating the complex event, generating at least one lineage event that comprises lineage information, the lineage information comprising information corresponding to one or more source events, each of the one or more source events contributing to the complex event, and storing the lineage event and the one or more source events in an event archive provided as a computer-readable storage medium.
摘要:
Implementations of the present disclosure include methods for managing data quality for event data. In some implementations, methods include receiving, at one or more computing devices, a plurality of events, each event comprising event data and being generated by an event source in response to a real-world activity, processing, using the one or more computing devices, the plurality of events using one or more complex event processing (CEP) rules to generate a complex event, in response to generating the complex event, annotating the complex event with quality information corresponding to each of the plurality of events to provide a quality-enriched event, and publishing the quality-enriched event.
摘要:
Implementations of the present disclosure include methods for managing data quality for event data. In some implementations, methods include receiving, at one or more computing devices, a plurality of events, each event comprising event data and being generated by an event source in response to a real-world activity, processing, using the one or more computing devices, the plurality of events using one or more complex event processing (CEP) rules to generate a complex event, in response to generating the complex event, annotating the complex event with quality information corresponding to each of the plurality of events to provide a quality-enriched event, and publishing the quality-enriched event.
摘要:
Implementations of the present disclosure include methods for providing transparency in streaming event data. In some implementations, methods include receiving a plurality of events, each event comprising event data and being generated by an event source in response to a real-world activity, processing the plurality of events using one or more complex event processing (CEP) rules to generate a complex event, in response to generating the complex event, generating at least one lineage event that comprises lineage information, the lineage information comprising information corresponding to one or more source events, each of the one or more source events contributing to the complex event, and storing the lineage event and the one or more source events in an event archive provided as a computer-readable storage medium.
摘要:
A pipelined computing device is provided that is designed i) to generate a list of coordinates of starting points and endpoints of chains and to store these in a memory, ii) for each starting point and endpoint, to search the list of coordinates of starting points and endpoints for the last occurrence of the same coordinates or coordinates lying within a neighborhood of a specified size, and iii) to allocate to each starting point or endpoint a vertex index and an instance index, wherein the vertex index is a running index of vertices and the instance index represents a running index of the starting points and endpoints belonging to a vertex, wherein associated points from the set of starting points and endpoints receive the same vertex index and are those points that have the same coordinates or coordinates that lie within the neighborhood of a specified size.