Patent search ap:("Google LLC") AND inv:"Louis Wang" Page 2

11.

发明授权
Efficiently augmenting images with related content 有权

公开(公告)号：US11231832B2

公开(公告)日：2022-01-25

申请号：US16069071

申请日：2017-09-13

Applicant: Google LLC

Inventor： Charles Yang , Louis Wang , Charles J. Rosenberg

IPC: G06F17/00 , G06F3/0482 , G06F16/583 , G06F16/951 , G06F40/205 , G06F3/0484

Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A re user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.

12.

发明申请
EFFICIENTLY AUGMENTING IMAGES WITH RELATED CONTENT 有权

公开(公告)号：US20210208741A1

公开(公告)日：2021-07-08

申请号：US16069071

申请日：2017-09-13

Applicant: Google LLC

Inventor： Charles Yang , Louis Wang , Charles J. Rosenberg

IPC: G06F3/0482 , G06F3/0484 , G06F40/205 , G06F16/951 , G06F16/583

Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A re user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.

13.

发明申请
Instance Level Scene Recognition with a Vision Language Model 有权

公开(公告)号：US20250140006A1

公开(公告)日：2025-05-01

申请号：US18620136

申请日：2024-03-28

Applicant: Google LLC

Inventor： Harshit Kharbanda , Boris Bluntschli , Vibhuti Mahajan , Louis Wang

IPC: G06V20/70 , G06V10/764 , G06V20/40

Abstract: Systems and methods for image understanding can include one or more object recognition systems and one or more vision language models to generate an augmented language output that can be both scene-aware and object-aware. The systems and methods can process an input image with an object recognition model to generate an object recognition output descriptive of identification details for an object depicted in the input image. The systems and methods can include processing the input image with a vision language model to generate a language output descriptive of a predicted scene description. The object recognition output can then be utilized to augment the language output to generate an augmented language output that includes the scene understanding of the language output with the specificity of the object recognition output.

14.

发明申请
Efficiently Augmenting Images with Related Content 有权

公开(公告)号：US20250013351A1

公开(公告)日：2025-01-09

申请号：US18887662

申请日：2024-09-17

Applicant: Google LLC

Inventor： Charles Yang , Louis Wang , Charles J. Rosenberg

IPC: G06F3/0482 , G06F3/04845 , G06F16/583 , G06F16/951 , G06F40/205

Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.

15.

发明申请
Visual Citations for Information Provided in Response to Multimodal Queries 有权

公开(公告)号：US20240378237A1

公开(公告)日：2024-11-14

申请号：US18314663

申请日：2023-05-09

Applicant: Google LLC

Inventor： Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Belinda Luna Zeng , Louis Wang

IPC: G06F16/583 , G06V10/74

Abstract: Result images are retrieved based on a similarity to a query image. A set of textual inputs is processed with a machine-learned language model to obtain a language output comprising textual content, wherein the set of textual inputs comprises textual content from source documents that include the result images, and a prompt associated with the query image. The language output and the result images are provided to a user computing device. Information is received descriptive of an indication by a user that a first result image is visually dissimilar to the query image. Textual content associated with the source document that includes the first result image from the set of textual inputs is removed. The set of textual inputs is processed with the machine-learned language model to obtain a refined language output. The refined language output is provided to the user computing device.

16.

外观设计
Display screen or portion thereof with graphical user interface 有权

公开(公告)号：USD1048065S1

公开(公告)日：2024-10-22

申请号：US29866690

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: The sole FIGURE is a front view of a display screen or portion thereof with graphical user interface showing the claimed design.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.

17.

外观设计
Display screen or portion thereof with graphical user interface 有权

公开(公告)号：USD1048062S1

公开(公告)日：2024-10-22

申请号：US29866685

申请日：2022-09-23

Applicant: Google LLC

Designer： Christopher Kelley , Minsang Choi , Pritam Singh Pebam , Caroline Chilton , Carrie Linda Bisazza , Matthew Roth , Sabrina Curry , Natalie Michele Salaets , Jongwon Yu , Belinda Zeng , Harshit Kharbanda , Louis Wang , Austin Wu , Nishant Ranka , Morgane Magali Laure Sanglier

Abstract: FIG. 1 is a front view of a first embodiment of a display screen or portion thereof with graphical user interface showing the claimed design;
FIG. 2 is a second embodiment thereof;
FIG. 3 is a third embodiment thereof; and,
FIG. 4 is a fourth embodiment thereof.
The outermost evenly spaced broken lines in the drawings show the electronic device, which is the environment of the design and forms no part of the claimed design. The dot-dash broken lines showing the display screen or portion thereof forms no part of the claimed design. The remaining broken lines showing portions of the graphical user interface form no part of the claimed design.

18.

发明公开
Medical Condition Visual Search 审中-公开

公开(公告)号：US20240339217A1

公开(公告)日：2024-10-10

申请号：US18620434

申请日：2024-03-28

Applicant: Google LLC

Inventor： Peggy Yen Phuong Bui , Bianca Madalina Buisman , Quang Anh Duong , Anastasia Martynova , Ayush Jain , Yuan Liu , Jonathan David Krause , Amit Sanjay Talreja , Rajeev Vijay Rikhye , Mahvish A. Nagda , Pinal Bavishi , Christopher James Eicher , Abigail Ward , Jieming Yu , Louis Wang , Dounia Berrada , Dale Richard Webster , Harshit Kharbanda , Igor Bonaci , Kai Yu , Ke Lan , Kaan Yücer , Willa Angel Chen Miller , Lars Thomas Hansen

IPC: G16H50/20 , G06T7/00 , G16H30/40

CPC classification number: G16H50/20 , G06T7/0012 , G16H30/40 , G06T2207/20104 , G06T2207/30088

Abstract: Systems and methods for diagnostic visual search can include processing a search query with a plurality of classification models to determine a search query intent and predict potential diagnosis. The search query can include an image that is processed to determine the presence of a body part and may be processed to determine if the search query is descriptive of a diagnostic search query. Based on the intent determination, the image may then be processed by a conditions classification model to determine one or more predicted condition classifications. Condition information can then be obtained and provided based on the one or more predicted condition classifications.

19.

发明授权
Instance level scene recognition with a vision language model 有权

公开(公告)号：US11978271B1

公开(公告)日：2024-05-07

申请号：US18496402

申请日：2023-10-27

Applicant: Google LLC

Inventor： Harshit Kharbanda , Boris Bluntschli , Vibhuti Mahajan , Louis Wang

IPC: G06V20/70 , G06V10/764 , G06V20/40

CPC classification number: G06V20/70 , G06V10/764 , G06V20/41

Abstract: Systems and methods for image understanding can include one or more object recognition systems and one or more vision language models to generate an augmented language output that can be both scene-aware and object-aware. The systems and methods can process an input image with an object recognition model to generate an object recognition output descriptive of identification details for an object depicted in the input image. The systems and methods can include processing the input image with a vision language model to generate a language output descriptive of a predicted scene description. The object recognition output can then be utilized to augment the language output to generate an augmented language output that includes the scene understanding of the language output with the specificity of the object recognition output.

20.

发明授权
Efficiently augmenting images with related content 有权

公开(公告)号：US11747960B2

公开(公告)日：2023-09-05

申请号：US17563695

申请日：2021-12-28

Applicant: Google LLC

Inventor： Charles Yang , Louis Wang , Charles J. Rosenberg

IPC: G06F17/00 , G06F3/0482 , G06F16/583 , G06F16/951 , G06F40/205 , G06F3/04845

CPC classification number: G06F3/0482 , G06F3/04845 , G06F16/583 , G06F16/951 , G06F40/205 , G06F2203/04803 , G06F2203/04806

Abstract: The subject matter of this specification generally relates to providing content related to text depicted in images. In one aspect, a system includes a data processing apparatus configured to extract text from an image. The extracted text is partitioned into multiple blocks. The multiple blocks are presented as respective first user-selectable targets on a user interface at a first zoom level. A user selection of a first block of the multiple blocks is detected. In response to detecting the user selection of the first block, portions of the extracted text in the first block are presented as respective second user-selectable targets on the user interface at a second zoom level greater than the first zoom level. In response to detecting a user selection of a portion of the extracted text within the first block, an action is initiated based on content of the user-selected text.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification