-
公开(公告)号:US20250054246A1
公开(公告)日:2025-02-13
申请号:US18707075
申请日:2022-10-14
Applicant: Google LLC
Inventor: Ruofei Du , Alex Olwal
Abstract: A user can interact with sounds and speech in an environment using an augmented reality device. The augmented reality device can be configured to identify objects in the environment and display messages beside the object that are related to sounds produced by the object. For example, the messages may include sound statistics, transcripts of speech, and/or sound detection events. The disclosed approach enables a user to interact with these messages using a gaze and a gesture.
-
公开(公告)号:US12008204B2
公开(公告)日:2024-06-11
申请号:US18001830
申请日:2021-10-04
Applicant: Google LLC
Inventor: Alex Olwal , Thad Eugene Starner
IPC: G06F3/044 , G06F3/0488
CPC classification number: G06F3/0446 , G06F3/0488
Abstract: This document describes techniques directed to a scalable gesture sensor for wearable and soft electronic devices. The scalable gesture sensor is integrated into an object such as a wearable garment or a large-surface embedded system to provide a touch-sensitive surface for the object. The sensor includes a repeated localized crossover pattern formed by the same few sensor lines, resulting in the same two conductive lines having multiple crossover points across the touch-sensitive surface. The repeated crossover pattern enables detection of the occurrence and relative direction of a swipe gesture based at least on a repeated sequence of capacitance changes over a set of conductive lines in the repeated crossover pattern. Also, the scalable gesture sensor is computationally simple, uses low power, and is uniquely scalable to cover a large area with few electrodes.
-
公开(公告)号:US20230367960A1
公开(公告)日:2023-11-16
申请号:US18315113
申请日:2023-05-10
Applicant: Google LLC
Inventor: Boris Smus , Vikas Bahirwani , Ruofei Du , Christopher Ross , Alex Olwal
IPC: G06F40/20 , G06F40/166 , G10L15/26
CPC classification number: G06F40/20 , G06F40/166 , G10L15/26
Abstract: A method performed by a computing system comprises generating text from audio data and determining an end portion of the text to include in a summarization of the text based on a length of a portion of the audio data from which the text was generated and which ends with a proposed end portion and a time value associated with the proposed end portion, the proposed end portion including a word from the text.
-
公开(公告)号:US11656579B2
公开(公告)日:2023-05-23
申请号:US17023784
申请日:2020-09-17
Applicant: Google LLC
Inventor: Alex Olwal
CPC classification number: G04B45/0061 , G04B19/04 , G04C3/14
Abstract: Aspects of the disclosure provide a hybrid smartwatch that incorporates digital technology with an analog timepiece in a wristwatch form factor. A digital display layer of a non-emissive material is configured to present notices, data, content and other information. An analog display layer includes one or more hands of the timepiece, and overlies the digital display layer. The hands may be controlled by a processor through micro-stepper motors or other actuators. Physical motion of the hands provides expressivity, for instance via visual mechatronic effects. This may include buzzing, clapping, providing stylized visual features, hiding or minimizing information, and revealing information. The information presented on the digital display layer is presented concurrently with the hand movement, in a manner that complements the hand motion. This provides a rich, symbiotic dual-display layer arrangement that enhances the capabilities of the digital and analog display layers.
-
公开(公告)号:US20230136553A1
公开(公告)日:2023-05-04
申请号:US18050329
申请日:2022-10-27
Applicant: GOOGLE LLC
Inventor: Alex Olwal , Ruofei Du
Abstract: Smart devices can be configured to collect and share various forms of context data about where a user is located (e.g., location), what a user will be doing (e.g., schedule), and what a user is currently doing (e.g., activity). This context data may be combined with fingerprint data (e.g., biometrics) to help identify the fingerprint data. For example, a location of a user may help associated speech detected at that location with the user. These associations may be stored in a mapping database that can be updated over time to reduce ambiguities in identification. The mappings in the database may be used to train a machine learning model to recognize fingerprints as identities, which may be useful in applications, such as speaker identification.
-
公开(公告)号:US20230132041A1
公开(公告)日:2023-04-27
申请号:US18047494
申请日:2022-10-18
Applicant: GOOGLE LLC
Inventor: Alex Olwal , Ruofei Du
Abstract: The disclosed systems and method correlates user behaviors with audio processing to achieve more accurate conclusions about sounds in a user's environment. These conclusions may, in turn, be used to adjust the way a device, such as AR glasses, operate or respond to the sounds. For example, audio events determined from processing speech can be correlated with behavior events determined by sensing a user to improve a speech-to-text transcript of the speech by separating, or otherwise altering, the text in the transcript by speaker.
-
公开(公告)号:US20230071778A1
公开(公告)日:2023-03-09
申请号:US17446877
申请日:2021-09-03
Applicant: Google LLC
Inventor: Ruofei Du , Hendrik Wagenaar , Alex Olwal
IPC: G10L21/10 , G06K9/00 , G06F3/01 , H04R3/00 , H04R1/40 , G06T7/70 , G10L15/26 , G10L15/22 , H04R1/08 , G10L15/25 , G10L21/18 , G06T7/50
Abstract: An augmented reality (AR) device, such as AR glasses, may include a microphone array. The sensitivity of the microphone array can be directed to a target by beamforming, which includes combining the audio of each microphone of the array in a particular way based on a location of the target. The present disclosure describes systems and methods to determine the location of the target based on a gaze of a user and beamform the audio accordingly. This eye-tracked beamforming (i.e., foveated beamforming) can be used by AR applications to enhance sounds from a gaze direction and to suppress sounds from other directions. Additionally, the gaze information can be used to help visualize the results of an AR application, such as speech-to-text.
-
公开(公告)号:US20230005354A1
公开(公告)日:2023-01-05
申请号:US17366890
申请日:2021-07-02
Applicant: Google LLC
Inventor: Franziska Schlagenhauf , Nina R. Sinatra , Kurt Gross , Kelly Elizabeth Dobson , Artem Dementyev , Alex Olwal , Richard Francis Lyon
Abstract: A wearable device can include a wearable band configured to contact a user of the wearable device, an actuator, a sensor, and one or more processors in communication with the actuator and the sensor. The processors can be configured to measure a back electromotive force (“EMF”) of the actuator; determine, based on the measured back EMF, data that describes a contact force between the wearable band and the user; and determine, based on the data that describes the contact force, a quality metric describing a data quality of sensor data collected by the sensor. In some embodiments, the processor(s) can determine, generate sensor output data based on the sensor data and based at least in part on the data describing the contact force between the wearable band and the user. For example, one or more machine-learned models maybe leveraged to generate sensor output data that is compensated for the wearable band being too tight or too loose.
-
公开(公告)号:US11144014B2
公开(公告)日:2021-10-12
申请号:US16487541
申请日:2019-04-17
Applicant: Google LLC
Inventor: Alex Olwal , Philip Dam Roadley-Battin , Tyler Gough
Abstract: Aspects of the disclosure provide a hybrid smartwatch that incorporates digital technology with an analog timepiece in a wristwatch form factor. A digital display layer of a non-emissive material is configured to present notices, data, content and other information. An analog display layer includes one or more hands of the timepiece, and overlies the digital display layer. The hands may be controlled by a processor through micro-stepper motors or other actuators. Physical motion of the hands is simultaneously coupled with arrangement of content or other elements on the display layer, which results in numerous types of hybrid visualizations. This includes temporal presentations using hourly, daily, monthly or other time scales. Shape-shifting of the watch hands between 2D and 1D arrangements can linearly focus on certain information. And various content-aware layouts can highlight, bracket, occlude or otherwise emphasize or deemphasize displayed information.
-
10.
公开(公告)号:US20250165728A1
公开(公告)日:2025-05-22
申请号:US18862092
申请日:2023-05-10
Applicant: GOOGLE LLC
Inventor: Ruofei Du , Alex Olwal , Vikas Bahirwani , Boris Smus , Christopher Ross
IPC: G06F40/58
Abstract: Described techniques may be utilized to process transcribed text of a transcription stream using a summarization ML model to obtain a summary stream. In this way, a live conversation or other live speech may be provided to a user in real-time, using a display of a head-mounted device (HMD) or other suitable device. A user interface may be provided with a toggle or other stream selector for selecting either or both of the transcription stream or the summary stream at a given point in time. Actionable items may be identified within either or both of the transcription stream or the summary stream, and a scroll bar or other action selector may be provided in the user interface to execute corresponding actions.
-
-
-
-
-
-
-
-
-