Invention Application
- Patent Title: TOKENIZING PROGRAMMING CODE WITH CANONICAL REPRESENTATIONS
-
Application No.: US18749483Application Date: 2024-06-20
-
Publication No.: US20240427993A1Publication Date: 2024-12-26
- Inventor: Carmit Sahar , Daniel Yellin , Stojancho Ganchev , Zohar Fox
- Applicant: Aurora Labs Ltd.
- Applicant Address: IL Tel Aviv
- Assignee: Aurora Labs Ltd.
- Current Assignee: Aurora Labs Ltd.
- Current Assignee Address: IL Tel Aviv
- Main IPC: G06F40/284
- IPC: G06F40/284

Abstract:
Disclosed herein are techniques for creating and using tokens representing portions of programming code. Techniques include identifying a body of programming code; associating a plurality of tokens with respective portions of the body of programming code to generate a token-based representation of the body of programming code, wherein the associating comprises determining at least one canonical representation of at least one of the respective portions of the body of programming code; providing the token-based representation of the body of programming code to an emulator, the emulator being configured to interpret token-based representations; and receiving, from the emulator, an emulation result.
Information query