REGULAR EXPRESSION MATCHING IN DICTIONARY-ENCODED STRINGS

    公开(公告)号:US20240176823A1

    公开(公告)日:2024-05-30

    申请号:US18070791

    申请日:2022-11-29

    IPC分类号: G06F16/903 G06F9/448

    CPC分类号: G06F16/90344 G06F9/4498

    摘要: Techniques are described for generating an encoded-string automaton for a regex pattern from a decoded-string automaton of the regex pattern. In an embodiment, the process obtains a decoded-string automaton of the regex pattern and applies unique decoded string value(s) from the dictionary of the encoding. When applied at a selected state in the decoded-string automaton, the application may yield a transition to at least one target state in the decoded-string automaton for a unique dictionary value. Such a transition generates a transition in the encoded-string automaton from an encoded state corresponding to the selected state in the decoded-string automaton to a target state in the encoded-string automaton corresponding to the target state in the decoded-string automaton. The generated transition in the encoded-string automaton is conditioned on the token of the unique decoded string value in the dictionary.