Patent search ap:("Google LLC") AND inv:"Louis Wang" Page 1

1.

发明公开
Visual and Audio Multimodal Searching System 审中-公开

公开(公告)号：US20240362279A1

公开(公告)日：2024-10-31

申请号：US18306638

申请日：2023-04-25

Applicant: Google LLC

Inventor： Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Christopher James Kelley , Jessica Lee , Pendar Yousefi , Dounia Berrada , Sundeep Vaddadi , Kai Yu , Balint Miklos , Severin Heiniger , Louis Wang

IPC: G06F16/9532 , G06F16/538 , G06F40/40

CPC classification number: G06F16/9532 , G06F16/538 , G06F40/40

Abstract: A multimodal search system is described. The system can receive image data captured by a camera of a user device. Additionally, the system can receive audio data associated with the image data. The audio data can be captured by a microphone of the user device. Moreover, the system can process the image data to generate visual features. Furthermore, the system can process the audio data to generate a plurality of words. The system can generate a plurality of search terms based on the plurality of words and the visual features. Subsequently, the system can determine one or more search results associated with the plurality of search terms and provide the one or more search results as an output.

2.

外观设计
Display screen or portion thereof with graphical user interface 有权

公开(公告)号：USD1048067S1

公开(公告)日：2024-10-22

申请号：US29866692

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: The sole FIGURE is a front view of a display screen or portion thereof with graphical user interface showing the claimed design.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.

3.

外观设计
Display screen or portion thereof with transitional graphical user interface 有权

公开(公告)号：USD1048066S1

公开(公告)日：2024-10-22

申请号：US29866691

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: FIG. 1 is a front view of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof; and,
FIG. 3 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3. The process or period in which an image transitions to another forms no part of the claimed design.

4.

外观设计
Display screen or portion thereof with transitional graphical user interface 有权

公开(公告)号：USD1048063S1

公开(公告)日：2024-10-22

申请号：US29866687

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: FIG. 1 is a front view of a first embodiment of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof;
FIG. 3 is a third image thereof;
FIG. 4 is a front view of a second embodiment of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 5 is a second image thereof; and,
FIG. 6 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3 and FIGS. 4-6. The process or period in which an image transitions to another forms no part of the claimed design.

5.

发明公开
Finding and Filtering Elements of a Visual Scene 审中-公开

公开(公告)号：US20230259993A1

公开(公告)日：2023-08-17

申请号：US18165084

申请日：2023-02-06

Applicant: Google LLC

Inventor： Harshit Kharbanda , Christopher Kelley , Louis Wang

IPC: G06Q30/0282 , G06F16/953 , G06V20/20 , G06F3/0482 , G06F18/40 , G06F18/2113 , G06V20/00

CPC classification number: G06Q30/0282 , G06F16/953 , G06V20/20 , G06F3/0482 , G06F18/40 , G06F18/2113 , G06V20/00 , G06V30/10

Abstract: In a general aspect, a method can include receiving, by an electronic device, a visual scene; identifying, by the electronic device, a plurality of elements of the visual scene; and determining, based on the plurality of elements identified in the visual scene, a context of the visual scene. The method can further include applying, based on the determined context of the visual scene, at least one filter to identify at least one element of the plurality of elements corresponding with the at least one filter; and visually indicate, in the visual scene on a display of the electronic device, the at least one element identified using the at least one filter.

6.

发明授权
Visual indicators of generative model response details 有权

公开(公告)号：US12266065B1

公开(公告)日：2025-04-01

申请号：US18409268

申请日：2024-01-10

Applicant: Google LLC

Inventor： Harshit Kharbanda , Louis Wang , Christopher James Kelley , Jessica Lee , Igor Bonaci , Daniel Valcarce Silva

IPC: G06T19/00 , G06V20/20

Abstract: Systems and methods for providing visual indications of generative model responses can include obtaining a user input and processing the user input with a generative model to generate a model-generated-response. The systems and methods can process the model-generated response and an image of an environment to generate an augmented image. The augmented image can include visual indicators of the model-generated response, which can include annotating the image based on detected features within the image. Generation of the augmented image can include object detection and annotation based on the content of the model-generated response.

7.

外观设计
Display screen or portion thereof with transitional graphical user interface 有权

公开(公告)号：USD1048064S1

公开(公告)日：2024-10-22

申请号：US29866689

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: FIG. 1 is a front view of a display screen or portion thereof with transitional graphical user interface showing a first image of the claimed design.
FIG. 2 is a second image thereof; and,
FIG. 3 is a third image thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-3. The process or period in which an image transitions to another forms no part of the claimed design.

8.

发明申请
EFFICIENTLY AUGMENTING IMAGES WITH RELATED CONTENT 有权

公开(公告)号：US20220121331A1

公开(公告)日：2022-04-21

申请号：US17563695

申请日：2021-12-28

Applicant: Google LLC

Inventor： Charles Yang , Louis Wang , Charles J. Rosenberg

IPC: G06F3/0482 , G06F16/583 , G06F16/951 , G06F40/205 , G06F3/04845

Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.

9.

发明授权
User verification of a generative response to a multimodal query 有权

公开(公告)号：US12277635B1

公开(公告)日：2025-04-15

申请号：US18532470

申请日：2023-12-07

Applicant: Google LLC

Inventor： Harshit Kharbanda , Louis Wang , Christopher James Kelley , Jessica Lee

IPC: G06T11/60 , G06T13/80

Abstract: A multimodal search system is described. The system can receive image data from a user device. Additionally, the system can receive a prompt associated with the image data. Moreover, the system can determine, using a computer vision model, a first object in the image data that is associated with the prompt. Furthermore, the system can receive, from the user device, a user indication on whether the image data includes the first object. Subsequently, in response to receiving the user indication, the system can generate a response using a large language model.

10.

发明申请
Video and Audio Multimodal Searching System 有权

公开(公告)号：US20240403362A1

公开(公告)日：2024-12-05

申请号：US18326496

申请日：2023-05-31

Applicant: Google LLC

Inventor： Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Aashi Jain , David William Hendon , Christopher James Kelley , Jessica Lee , Dounia Berrada , Kai Yu , Louis Wang , Thomas J. Duerig , Radu Soricut , Robin Dua

IPC: G06F16/735 , G06F16/732 , G06F16/783 , G06T7/70 , G06V10/62 , G06V10/774 , G06V20/40

Abstract: A multimodal search system using a video query is described. The system can receive video data captured by a camera of a user device. The video data can have a sequence of image frames. Additionally, the system can receive audio data associated with the video data captured by the user device. Moreover, the system can process, using one or more machine-learned models, the sequence of image frames to generate video embeddings related to the sequence of the image frames. The video embeddings can have a plurality of image embeddings associated with the sequence of image frames. Furthermore, the system can determine one or more video results based on the video embeddings and the audio data. Subsequently, the system can transmit, to the user device, the one or more video results.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification