-
公开(公告)号:US20240233569A9
公开(公告)日:2024-07-11
申请号:US17969303
申请日:2022-10-19
Applicant: Google LLC
Inventor: Jessica Lee , David Trotter Oleson , Fabian Roth , Nils Grimsmo
IPC: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/94 , G06V20/70 , G06V30/12 , G06V30/19
CPC classification number: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/945 , G06V20/70 , G06V30/127 , G06V30/19133 , G06V30/19147
Abstract: Systems and methods for augmented-reality tutoring can utilize optical character recognition, natural language processing, and/or augmented-reality rendering for providing real-time notifications for completing a determined task. The systems and methods can include utilizing one or more machine-learned models trained for quantitative reasoning and can include providing a plurality of different user interface elements at different times.
-
公开(公告)号:US20240135835A1
公开(公告)日:2024-04-25
申请号:US17969303
申请日:2022-10-18
Applicant: Google LLC
Inventor: Jessica Lee , David Trotter Oleson , Fabian Roth , Nils Grimsmo
IPC: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/94 , G06V20/70 , G06V30/12 , G06V30/19
CPC classification number: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/945 , G06V20/70 , G06V30/127 , G06V30/19133 , G06V30/19147
Abstract: Systems and methods for augmented-reality tutoring can utilize optical character recognition, natural language processing, and/or augmented-reality rendering for providing real-time notifications for completing a determined task. The systems and methods can include utilizing one or more machine-learned models trained for quantitative reasoning and can include providing a plurality of different user interface elements at different times.
-
公开(公告)号:US12254785B2
公开(公告)日:2025-03-18
申请号:US17969303
申请日:2022-10-19
Applicant: Google LLC
Inventor: Jessica Lee , David Trotter Oleson , Fabian Roth , Nils Grimsmo
IPC: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/94 , G06V20/70 , G06V30/12 , G06V30/19
Abstract: Systems and methods for augmented-reality tutoring can utilize optical character recognition, natural language processing, and/or augmented-reality rendering for providing real-time notifications for completing a determined task. The systems and methods can include utilizing one or more machine-learned models trained for quantitative reasoning and can include providing a plurality of different user interface elements at different times.
-
公开(公告)号:US20250087207A1
公开(公告)日:2025-03-13
申请号:US18736113
申请日:2024-06-06
Applicant: Google LLC
Inventor: Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Fabian Roth , Dounia Berrada , Samer Hassan Hassan , Afroz Mohiuddin , Misha Khalman , Ali Essam Ali Elqursh , Belinda Luna Zeng
IPC: G10L15/183 , G06F16/583 , G06V10/778 , G06V30/14 , G06V30/148 , G10L15/22 , G10L15/30
Abstract: The present disclosure provides computer-implemented methods, systems, and devices for responding to requests associated with an image. A computing system obtains, wherein the image depicts a first set of textual content. The computing system determines one or more characteristics of the first set of textual content. The computing system determines a response type from a plurality of response types based on the one or more characteristics. The computing system generates a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type. The computing system provides providing the model input as an input to a machine-learned language model. The computing system receives a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input. The computing system provides the second set of text for display to a user, wherein the second set of textual content is associated with the response type.
-
公开(公告)号:US12033620B1
公开(公告)日:2024-07-09
申请号:US18463951
申请日:2023-09-08
Applicant: Google LLC
Inventor: Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Fabian Roth , Dounia Berrada , Samer Hassan Hassan , Afroz Mohiuddin , Mikhail Khalman , Ali Essam Ali Elqursh , Belinda Luna Zeng
IPC: G06F3/0483 , G06F16/30 , G06F16/33 , G06F16/583 , G06V10/778 , G06V30/14 , G06V30/148 , G10L15/183 , G10L15/22 , G10L15/30
CPC classification number: G10L15/183 , G06F16/5846 , G06V10/778 , G06V30/1456 , G06V30/153 , G10L15/22 , G10L15/30
Abstract: The present disclosure provides computer-implemented methods, systems, and devices for responding to requests associated with an image. A computing system obtains, wherein the image depicts a first set of textual content. The computing system determines one or more characteristics of the first set of textual content. The computing system determines a response type from a plurality of response types based on the one or more characteristics. The computing system generates a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type. The computing system provides providing the model input as an input to a machine-learned language model. The computing system receives a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input. The computing system provides the second set of text for display to a user, wherein the second set of textual content is associated with the response type.
-
-
-
-