CHARACTER-LEVEL ATTENTION NEURAL NETWORKS
    3.
    发明公开

    公开(公告)号:US20240289552A1

    公开(公告)日:2024-08-29

    申请号:US18564859

    申请日:2022-05-27

    Applicant: Google LLC

    CPC classification number: G06F40/284

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on an input sequence of characters that has a respective character at each of a plurality of character positions to generate a network output. One of the systems includes a neural network configured to perform the machine learning task, the neural network comprising a gradient-based sub-word tokenizer and an output neural network. The gradient-based sub-word tokenizer is configured to apply a learned, i.e., flexible, sub-word tokenization strategy to the input sequence of characters to generate a sequence of latent sub-word representations. The output neural network is configured to process the latent sub-word representation to generate the network output for the task.

Patent Agency Ranking