Patent search ap:("Google LLC") AND inv:"David James Fleet" Page 1

1.

发明申请
DETECTING OBJECTS IN IMAGES BY GENERATING SEQUENCES OF TOKENS 有权

公开(公告)号：US20250139959A1

公开(公告)日：2025-05-01

申请号：US18690550

申请日：2022-09-19

Applicant: Google LLC

Inventor： Ting Chen , Saurabh Saxena , Yi Li , Geoffrey E. Hinton , David James Fleet

IPC: G06V10/82 , G06V10/764 , G06V10/774 , G06V10/776 , G06V20/70

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for object detection using neural networks. In one aspect, one of the methods includes obtaining an input image; processing the input image using an object detection neural network to generate an output sequence that comprises respective token at each of a plurality of time steps, wherein each token is selected from a vocabulary of tokens that comprises (i) a first set of tokens that each represent a respective discrete number from a set of discretized numbers and (ii) a second set of tokens that each represent a respective object category from a set of object categories; and generating, from the tokens in the output sequence, an object detection output for the input image.

2.

发明公开
GENERATING IMAGES USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240249456A1

公开(公告)日：2024-07-25

申请号：US18624960

申请日：2024-04-02

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70

CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/70

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

3.

发明公开
GENERATING VIDEOS USING DIFFUSION MODELS 审中-公开

公开(公告)号：US20240338936A1

公开(公告)日：2024-10-10

申请号：US18296938

申请日：2023-04-06

Applicant: Google LLC

Inventor： Jonathan Ho , Tim Salimans , Alexey Alexeevich Gritsenko , William Chan , Mohammad Norouzi , David James Fleet

IPC: G06V10/82 , G06V10/771 , H04N7/01

CPC classification number: G06V10/82 , G06V10/771 , H04N7/0117 , H04N7/013

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output video conditioned on an input. In one aspect, a method comprises receiving the input; initializing a current intermediate representation; generating an output video by updating the current intermediate representation at each of a plurality of iterations, wherein the updating comprises, at each iteration: processing an intermediate input for the iteration comprising the current intermediate representation using a diffusion model that is configured to process the intermediate input to generate a noise output; and updating the current intermediate representation using the noise output for the iteration.

4.

发明授权
Generating images using sequences of generative neural networks 有权

公开(公告)号：US11978141B2

公开(公告)日：2024-05-07

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/40 , G06T3/4053 , G06T5/00

CPC classification number: G06T11/60 , G06F40/284 , G06F40/40 , G06N3/08 , G06T3/4053 , G06T5/002

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

5.

发明公开
GENERATING IMAGES USING SEQUENCES OF GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20230377226A1

公开(公告)日：2023-11-23

申请号：US18199883

申请日：2023-05-19

Applicant: Google LLC

Inventor： Chitwan Saharia , William Chan , Mohammad Norouzi , Saurabh Saxena , Yi Li , Jay Ha Whang , David James Fleet , Jonathan Ho

IPC: G06T11/60 , G06T3/40 , G06T5/00 , G06F40/40 , G06F40/284 , G06N3/08

CPC classification number: G06T11/60 , G06T3/4053 , G06T5/002 , G06F40/40 , G06F40/284 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.

Patent Agency Ranking