Hardware optimized string table for accelerated relational database queries

    公开(公告)号:US12242481B1

    公开(公告)日:2025-03-04

    申请号:US18423196

    申请日:2024-01-25

    Abstract: Data structures and methods are described to enable a hardware optimized dynamic string table for accelerating relational database queries. A method comprises retrieving a lookup key for a query against a dynamic string table, the lookup key associated with a key length and a key hash. The method further comprises configuring a formatted lookup key as in-line or out-of-line based on whether the key length exceeds a maximum inline key size. The method further comprises replicating, into a first plurality of single instruction, multiple data (SIMD) lanes, the formatted lookup key. The method further comprises writing a candidate bucket, selected from the dynamic string table based on the key hash, into a second plurality of SIMD lanes. The method further comprises performing a SIMD compare of the first plurality of SIMD lanes and the second plurality of SIMD lanes, and returning an associated code when the lookup key is matched.

    RESUMABILITY SUPPORT FOR GRAPH EXECUTION ON SINGLE-INSTRUCTION-MULTIPLE-THREAD ARCHITECTURE

    公开(公告)号:US20250138830A1

    公开(公告)日:2025-05-01

    申请号:US18384734

    申请日:2023-10-27

    Abstract: An approach is provided for a thorough and clean way of handling graph overflows in graph execution on single instruction, multiple threads (SIMT) hardware with resumable graph support. The solution does not assume that the input and output fit in the buffers allocated in the SIMT hardware. The approach maintains state of the execution for each kernel and uses multiple iterations of graph execution, making progress in each iteration until all data items are processed through the graph on SIMT hardware. This iterative processing of the graph is transparent to the end user. For resumability, the approach treats buffers as circular buffers instead of serial buffers. With the help of counters, the approach keeps track of the start and end indexes of input and output buffers, thus achieving seamless graph resumability when re-execution is required for only a subset of kernels.

Patent Agency Ranking