Intelligent Systems and Methods for Visual Search Queries

    公开(公告)号:US20240330357A1

    公开(公告)日:2024-10-03

    申请号:US18743754

    申请日:2024-06-14

    Applicant: Google LLC

    Abstract: A user can submit a visual query that includes one or more images. Various processing techniques such as optical character recognition (OCR) techniques can be used to recognize text (e.g. in the image, surrounding image(s), etc.) and/or various object detection techniques (e.g., machine-learned object detection models, etc.) may be used to detect objects (e.g., products, landmarks, animals, humans, etc.) within or related to the visual query. Content related to the detected text or object(s) can be identified and potentially provided to a user as search results or a proactive content feed. As such, aspects of the present disclosure enable the visual search system to more intelligently process a visual query to provide improved search results and content feeds, including those search results which are more personalized and/or consider contextual signals to account for implicit characteristics of the visual query and/or user's search intent.

    Platform for Registering and Processing Visual Encodings

    公开(公告)号:US20240135126A1

    公开(公告)日:2024-04-25

    申请号:US18397918

    申请日:2023-12-27

    Applicant: Google LLC

    CPC classification number: G06K7/1443 G06K7/1447 G06K19/06103 G06V10/255

    Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.

    Platform for registering and processing visual encodings

    公开(公告)号:US11836553B2

    公开(公告)日:2023-12-05

    申请号:US17894411

    申请日:2022-08-24

    Applicant: Google LLC

    CPC classification number: G06K7/1443 G06K7/1447 G06K19/06103 G06V10/255

    Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.

    Platform for registering and processing visual encodings

    公开(公告)号:US11429804B2

    公开(公告)日:2022-08-30

    申请号:US17025597

    申请日:2020-09-18

    Applicant: Google LLC

    Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.

    Platform for Registering and Processing Visual Encodings

    公开(公告)号:US20220092284A1

    公开(公告)日:2022-03-24

    申请号:US17025597

    申请日:2020-09-18

    Applicant: Google LLC

    Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.

    Hotwording by degree
    7.
    发明授权

    公开(公告)号:US12014727B2

    公开(公告)日:2024-06-18

    申请号:US17305806

    申请日:2021-07-14

    Applicant: Google LLC

    CPC classification number: G10L15/08 G10L15/22 G10L2015/088

    Abstract: A method for a soft acceptance of a hotword receives audio data characterizing a soft hotword event detected by a hotword detector in streaming audio captured by a user device. The method also processes the audio data to determine that the audio data corresponds to a query specifying an action to perform on the user device. Without triggering performance of the action on the user device or the other device, the method provides a notification for output from the user device where the notification prompts a user associated with the user device to provide an affirmative input indication in order to trigger performance of the action on the user device or the other device and, when the user fails to provide the affirmative input indication, instructs the user device or the other device to not perform the action specified by the query.

    Preserving Engagement State Based On Contextual Signals

    公开(公告)号:US20230042878A1

    公开(公告)日:2023-02-09

    申请号:US17822900

    申请日:2022-08-29

    Applicant: Google LLC

    Abstract: A method for dynamically changing a graphical user interface element occurs in response to detecting that a temporal user interface element displayed on a user interface of user device. The method includes receiving, at the user device, a contextual signal characterizing a state of a user. The method further includes determining, by the user device, that the contextual signal characterizing the state of the user is indicative of the user intending to interact with a temporal user interface element. The methods also include, in response to determining that the contextual signal characterizing the state of the user is indicative of the user intending to interact with a temporal user interface element, modifying a respective state of the temporal user interface element displayed on the user interface of the user device.

    Platform for Registering and Processing Visual Encodings

    公开(公告)号:US20230042215A1

    公开(公告)日:2023-02-09

    申请号:US17971135

    申请日:2022-10-21

    Applicant: Google LLC

    Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.

    Intelligent systems and methods for visual search queries

    公开(公告)号:US12045278B2

    公开(公告)日:2024-07-23

    申请号:US17025435

    申请日:2020-09-18

    Applicant: Google LLC

    Abstract: A user can submit a visual query that includes one or more images. Various processing techniques such as optical character recognition (OCR) techniques can be used to recognize text (e.g. in the image, surrounding image(s), etc.) and/or various object detection techniques (e.g., machine-learned object detection models, etc.) may be used to detect objects (e.g., products, landmarks, animals, humans, etc.) within or related to the visual query. Content related to the detected text or object(s) can be identified and potentially provided to a user as search results or a proactive content feed. As such, aspects of the present disclosure enable the visual search system to more intelligently process a visual query to provide improved search results and content feeds, including those search results which are more personalized and/or consider contextual signals to account for implicit characteristics of the visual query and/or user's search intent.

Patent Agency Ranking