-
公开(公告)号:US20240386215A1
公开(公告)日:2024-11-21
申请号:US18319249
申请日:2023-05-17
Applicant: Google LLC
Inventor: Julian Martin Eisenschlos , Francesco Piccinno , Yasemin Altun , Syrine Krichene , Kenton Chiu Tsun Lee , Fangyu Liu , Mandar Joshi , Chenxi Pang , Wenhu Chen
Abstract: Provided is a one-shot solution to visual language reasoning. Example systems described herein decompose the challenge of visual language reasoning into two steps: translation of a graphical depiction of data (e.g., a plot or chart) into text; followed by reasoning over the translated text. In particular, example systems described herein can include a machine-learned visual-to-language conversion model that translates a graphical depiction of a dataset to a set of text descriptive of the dataset. The output of visual-to-language conversion model can then be directly used to prompt a language model, (e.g., a pretrained large language model (LLM)), exploiting the few-shot reasoning capabilities of the language model.
-
公开(公告)号:US20240378375A1
公开(公告)日:2024-11-14
申请号:US18650501
申请日:2024-04-30
Applicant: Google LLC
Inventor: Killian Robert Coate , Alexander Burmistrov , Aliya Aliya , Dmitriy Brezhnev , Elliott Malkin , Gaye Oncul Kok , Kester Christopher Tong , Lauren Nicole DeNaut , Shira Gilboa , Aleksandr Sinayev , Victoria Mary Taylor , Julian Martin Eisenschlos , Chenxi Pang
IPC: G06F40/18 , G06F40/40 , G06N3/0475 , G06N3/08
Abstract: Systems and methods for generating structured content using a collaborative generator provide a user interface to a user computing system and receive a prompt from the user computing system via the user interface, the prompt including existing content within an integrated development environment. The systems and methods provide the prompt to a generative model, with the generative model being a machine-learned model trained to process language input prompts to generate a language output. The systems and methods receive a generative output generated by the generative model in response to the prompt, the generative output including generative content divided into one or more generative content cells. Additionally, the systems and methods provide the generative output via the user interface.
-