-
公开(公告)号:US20240330357A1
公开(公告)日:2024-10-03
申请号:US18743754
申请日:2024-06-14
Applicant: Google LLC
Inventor: Brett Aladdin Barros , Joshua Lee Fleetwood , Paulo Jose Telo Coelho
IPC: G06F16/532 , G06F16/535 , G06F16/538 , G06F16/55 , G06F16/583 , G06F16/901
CPC classification number: G06F16/532 , G06F16/535 , G06F16/538 , G06F16/55 , G06F16/583 , G06F16/9024
Abstract: A user can submit a visual query that includes one or more images. Various processing techniques such as optical character recognition (OCR) techniques can be used to recognize text (e.g. in the image, surrounding image(s), etc.) and/or various object detection techniques (e.g., machine-learned object detection models, etc.) may be used to detect objects (e.g., products, landmarks, animals, humans, etc.) within or related to the visual query. Content related to the detected text or object(s) can be identified and potentially provided to a user as search results or a proactive content feed. As such, aspects of the present disclosure enable the visual search system to more intelligently process a visual query to provide improved search results and content feeds, including those search results which are more personalized and/or consider contextual signals to account for implicit characteristics of the visual query and/or user's search intent.
-
公开(公告)号:US20240135126A1
公开(公告)日:2024-04-25
申请号:US18397918
申请日:2023-12-27
Applicant: Google LLC
Inventor: Alexander James Faaborg , Brett Aladdin Barros
CPC classification number: G06K7/1443 , G06K7/1447 , G06K19/06103 , G06V10/255
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
-
公开(公告)号:US11289084B2
公开(公告)日:2022-03-29
申请号:US16689784
申请日:2019-11-20
Applicant: Google LLC
Inventor: Tim Wantland , Donald A. Barnett , David Matthew Jones , Christopher Breithaupt , Brett Aladdin Barros , Allison Lee Stanfield , Nicholas Aceves , Megan Elizabeth Fazio , Christopher Robert Conover
Abstract: Provided are methods, systems, and devices for generating semantic objects and an output based on the detection or recognition of the state of an environment that includes objects. State data, based in part on sensor output, can be received from one or more sensors that detect a state of an environment including objects. Based in part on the state data, semantic objects are generated. The semantic objects can correspond to the objects and include a set of attributes. Based in part on the set of attributes of the semantic objects, one or more operating modes, associated with the semantic objects can be determined. Based in part on the one or more operating modes, object outputs associated with the semantic objects can be generated. The object outputs can include one or more visual indications or one or more audio indications.
-
公开(公告)号:US11836553B2
公开(公告)日:2023-12-05
申请号:US17894411
申请日:2022-08-24
Applicant: Google LLC
Inventor: Alexander James Faaborg , Brett Aladdin Barros
CPC classification number: G06K7/1443 , G06K7/1447 , G06K19/06103 , G06V10/255
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
-
公开(公告)号:US11429804B2
公开(公告)日:2022-08-30
申请号:US17025597
申请日:2020-09-18
Applicant: Google LLC
Inventor: Alexander James Faaborg , Brett Aladdin Barros
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
-
公开(公告)号:US20220092284A1
公开(公告)日:2022-03-24
申请号:US17025597
申请日:2020-09-18
Applicant: Google LLC
Inventor: Alexander James Faaborg , Brett Aladdin Barros
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
-
公开(公告)号:US12014727B2
公开(公告)日:2024-06-18
申请号:US17305806
申请日:2021-07-14
Applicant: Google LLC
Inventor: Brett Aladdin Barros , James Flynn , Theo Goguely
CPC classification number: G10L15/08 , G10L15/22 , G10L2015/088
Abstract: A method for a soft acceptance of a hotword receives audio data characterizing a soft hotword event detected by a hotword detector in streaming audio captured by a user device. The method also processes the audio data to determine that the audio data corresponds to a query specifying an action to perform on the user device. Without triggering performance of the action on the user device or the other device, the method provides a notification for output from the user device where the notification prompts a user associated with the user device to provide an affirmative input indication in order to trigger performance of the action on the user device or the other device and, when the user fails to provide the affirmative input indication, instructs the user device or the other device to not perform the action specified by the query.
-
公开(公告)号:US20230042878A1
公开(公告)日:2023-02-09
申请号:US17822900
申请日:2022-08-29
Applicant: Google LLC
Inventor: Brett Aladdin Barros , Theophile Vincent Goguely
IPC: G06F3/04845 , G06F3/01
Abstract: A method for dynamically changing a graphical user interface element occurs in response to detecting that a temporal user interface element displayed on a user interface of user device. The method includes receiving, at the user device, a contextual signal characterizing a state of a user. The method further includes determining, by the user device, that the contextual signal characterizing the state of the user is indicative of the user intending to interact with a temporal user interface element. The methods also include, in response to determining that the contextual signal characterizing the state of the user is indicative of the user intending to interact with a temporal user interface element, modifying a respective state of the temporal user interface element displayed on the user interface of the user device.
-
公开(公告)号:US20230042215A1
公开(公告)日:2023-02-09
申请号:US17971135
申请日:2022-10-21
Applicant: Google LLC
Inventor: Alexander James Faaborg , Brett Aladdin Barros
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
-
公开(公告)号:US12045278B2
公开(公告)日:2024-07-23
申请号:US17025435
申请日:2020-09-18
Applicant: Google LLC
Inventor: Brett Aladdin Barros , Joshus Lee Fleetwood , Paulo Jose Telo Coelho
IPC: G06F16/538 , G06F16/532 , G06F16/535 , G06F16/55 , G06F16/583 , G06F16/901
CPC classification number: G06F16/532 , G06F16/535 , G06F16/538 , G06F16/55 , G06F16/583 , G06F16/9024
Abstract: A user can submit a visual query that includes one or more images. Various processing techniques such as optical character recognition (OCR) techniques can be used to recognize text (e.g. in the image, surrounding image(s), etc.) and/or various object detection techniques (e.g., machine-learned object detection models, etc.) may be used to detect objects (e.g., products, landmarks, animals, humans, etc.) within or related to the visual query. Content related to the detected text or object(s) can be identified and potentially provided to a user as search results or a proactive content feed. As such, aspects of the present disclosure enable the visual search system to more intelligently process a visual query to provide improved search results and content feeds, including those search results which are more personalized and/or consider contextual signals to account for implicit characteristics of the visual query and/or user's search intent.
-
-
-
-
-
-
-
-
-