DETECTING OBJECTS IN IMAGES BY GENERATING SEQUENCES OF TOKENS

    公开(公告)号:US20250139959A1

    公开(公告)日:2025-05-01

    申请号:US18690550

    申请日:2022-09-19

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for object detection using neural networks. In one aspect, one of the methods includes obtaining an input image; processing the input image using an object detection neural network to generate an output sequence that comprises respective token at each of a plurality of time steps, wherein each token is selected from a vocabulary of tokens that comprises (i) a first set of tokens that each represent a respective discrete number from a set of discretized numbers and (ii) a second set of tokens that each represent a respective object category from a set of object categories; and generating, from the tokens in the output sequence, an object detection output for the input image.

Patent Agency Ranking