-
公开(公告)号:US20240362279A1
公开(公告)日:2024-10-31
申请号:US18306638
申请日:2023-04-25
Applicant: Google LLC
Inventor: Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Christopher James Kelley , Jessica Lee , Pendar Yousefi , Dounia Berrada , Sundeep Vaddadi , Kai Yu , Balint Miklos , Severin Heiniger , Louis Wang
IPC: G06F16/9532 , G06F16/538 , G06F40/40
CPC classification number: G06F16/9532 , G06F16/538 , G06F40/40
Abstract: A multimodal search system is described. The system can receive image data captured by a camera of a user device. Additionally, the system can receive audio data associated with the image data. The audio data can be captured by a microphone of the user device. Moreover, the system can process the image data to generate visual features. Furthermore, the system can process the audio data to generate a plurality of words. The system can generate a plurality of search terms based on the plurality of words and the visual features. Subsequently, the system can determine one or more search results associated with the plurality of search terms and provide the one or more search results as an output.
-
公开(公告)号:USD1048067S1
公开(公告)日:2024-10-22
申请号:US29866692
申请日:2022-09-23
Applicant: Google LLC
Designer: Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier
Abstract: The sole FIGURE is a front view of a display screen or portion thereof with graphical user interface showing the claimed design.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.-
公开(公告)号:USD1048066S1
公开(公告)日:2024-10-22
申请号:US29866691
申请日:2022-09-23
Applicant: Google LLC
Designer: Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier
Abstract: FIG. 1 is a front view of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof; and,
FIG. 3 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3. The process or period in which an image transitions to another forms no part of the claimed design.-
公开(公告)号:USD1048063S1
公开(公告)日:2024-10-22
申请号:US29866687
申请日:2022-09-23
Applicant: Google LLC
Designer: Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier
Abstract: FIG. 1 is a front view of a first embodiment of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof;
FIG. 3 is a third image thereof;
FIG. 4 is a front view of a second embodiment of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 5 is a second image thereof; and,
FIG. 6 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3 and FIGS. 4-6. The process or period in which an image transitions to another forms no part of the claimed design.-
公开(公告)号:US20230259993A1
公开(公告)日:2023-08-17
申请号:US18165084
申请日:2023-02-06
Applicant: Google LLC
Inventor: Harshit Kharbanda , Christopher Kelley , Louis Wang
IPC: G06Q30/0282 , G06F16/953 , G06V20/20 , G06F3/0482 , G06F18/40 , G06F18/2113 , G06V20/00
CPC classification number: G06Q30/0282 , G06F16/953 , G06V20/20 , G06F3/0482 , G06F18/40 , G06F18/2113 , G06V20/00 , G06V30/10
Abstract: In a general aspect, a method can include receiving, by an electronic device, a visual scene; identifying, by the electronic device, a plurality of elements of the visual scene; and determining, based on the plurality of elements identified in the visual scene, a context of the visual scene. The method can further include applying, based on the determined context of the visual scene, at least one filter to identify at least one element of the plurality of elements corresponding with the at least one filter; and visually indicate, in the visual scene on a display of the electronic device, the at least one element identified using the at least one filter.
-
公开(公告)号:US12266065B1
公开(公告)日:2025-04-01
申请号:US18409268
申请日:2024-01-10
Applicant: Google LLC
Inventor: Harshit Kharbanda , Louis Wang , Christopher James Kelley , Jessica Lee , Igor Bonaci , Daniel Valcarce Silva
Abstract: Systems and methods for providing visual indications of generative model responses can include obtaining a user input and processing the user input with a generative model to generate a model-generated-response. The systems and methods can process the model-generated response and an image of an environment to generate an augmented image. The augmented image can include visual indicators of the model-generated response, which can include annotating the image based on detected features within the image. Generation of the augmented image can include object detection and annotation based on the content of the model-generated response.
-
公开(公告)号:USD1048064S1
公开(公告)日:2024-10-22
申请号:US29866689
申请日:2022-09-23
Applicant: Google LLC
Designer: Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier
Abstract: FIG. 1 is a front view of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof; and,
FIG. 3 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3. The process or period in which an image transitions to another forms no part of the claimed design.-
公开(公告)号:US20220121331A1
公开(公告)日:2022-04-21
申请号:US17563695
申请日:2021-12-28
Applicant: Google LLC
Inventor: Charles Yang , Louis Wang , Charles J. Rosenberg
IPC: G06F3/0482 , G06F16/583 , G06F16/951 , G06F40/205 , G06F3/04845
Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.
-
公开(公告)号:US12277635B1
公开(公告)日:2025-04-15
申请号:US18532470
申请日:2023-12-07
Applicant: Google LLC
Inventor: Harshit Kharbanda , Louis Wang , Christopher James Kelley , Jessica Lee
Abstract: A multimodal search system is described. The system can receive image data from a user device. Additionally, the system can receive a prompt associated with the image data. Moreover, the system can determine, using a computer vision model, a first object in the image data that is associated with the prompt. Furthermore, the system can receive, from the user device, a user indication on whether the image data includes the first object. Subsequently, in response to receiving the user indication, the system can generate a response using a large language model.
-
公开(公告)号:US20240403362A1
公开(公告)日:2024-12-05
申请号:US18326496
申请日:2023-05-31
Applicant: Google LLC
Inventor: Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Aashi Jain , David William Hendon , Christopher James Kelley , Jessica Lee , Dounia Berrada , Kai Yu , Louis Wang , Thomas J. Duerig , Radu Soricut , Robin Dua
IPC: G06F16/735 , G06F16/732 , G06F16/783 , G06T7/70 , G06V10/62 , G06V10/774 , G06V20/40
Abstract: A multimodal search system using a video query is described. The system can receive video data captured by a camera of a user device. The video data can have a sequence of image frames. Additionally, the system can receive audio data associated with the video data captured by the user device. Moreover, the system can process, using one or more machine-learned models, the sequence of image frames to generate video embeddings related to the sequence of the image frames. The video embeddings can have a plurality of image embeddings associated with the sequence of image frames. Furthermore, the system can determine one or more video results based on the video embeddings and the audio data. Subsequently, the system can transmit, to the user device, the one or more video results.
-
-
-
-
-
-
-
-
-