-
公开(公告)号:US20250118292A1
公开(公告)日:2025-04-10
申请号:US18891045
申请日:2024-09-20
Applicant: Google LLC
Inventor: Yiling Huang , Weiran Wang , Quan Wang , Guanlong Zhao , Hank Liao , Han Lu
Abstract: A method includes obtaining labeled training data including a plurality of spoken terms spoken during a conversation. For each respective spoken term, the method includes generating a corresponding sequence of intermediate audio encodings from a corresponding sequence of acoustic frames, generating a corresponding sequence of final audio encodings from the corresponding sequence of intermediate audio encodings, generating a corresponding speech recognition result, and generating a respective speaker token representing a predicted identity of a speaker for each corresponding speech recognition result. The method also includes training the joint speech recognition and speaker diarization model jointly based on a first loss derived from the generated speech recognition results and the corresponding transcriptions and a second loss derived from the generated speaker tokens and the corresponding speaker labels.
-
公开(公告)号:US20250118291A1
公开(公告)日:2025-04-10
申请号:US18832864
申请日:2023-01-30
Applicant: Google LLC
Inventor: Chung-Cheng CHIU , Weikeng QIN , Jiahui YU , Yonghui WU , Yu ZHANG
Abstract: Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training an audio-processing neural network that includes at least (1) a first encoder network having a first set of encoder network parameters and (2) a decoder network having a set of decoder network parameters. The system obtains a set of un-labeled audio data segments, and generates, from the set of un-labeled audio data segments, a set of encoder training examples. The system performs training of a second encoder neural network that includes at least the first encoder neural network on the set of generated encoder training examples. The system also obtains one or more labeled training examples, and performs training of the audio-processing neural network on the labeled training examples.
-
公开(公告)号:US20250118064A1
公开(公告)日:2025-04-10
申请号:US18913134
申请日:2024-10-11
Applicant: Google LLC
Inventor: Noam M. Shazeer , Lukasz Mieczyslaw Kaiser , Jakob D. Uszkoreit , Niki J. Parmar , Ashish Teku Vaswani
IPC: G06V10/82 , G06F18/21 , G06F18/213 , G06F18/28 , G06N3/04 , G06N3/084 , G06T3/4053 , G06V10/56 , G06V10/77
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating an output image. In one aspect, one of the methods includes generating the output image intensity value by intensity value according to a generation order of pixel-color channel pairs from the output image, comprising, for each particular generation order position in the generation order: generating a current output image representation of a current output image, processing the current output image representation using a decoder neural network to generate a probability distribution over possible intensity values for the pixel—color channel pair at the particular generation order position, wherein the decoder neural network includes one or more local masked self-attention sub-layers; and selecting an intensity value for the pixel—color channel pair at the particular generation order position using the probability distribution.
-
公开(公告)号:US20250117234A1
公开(公告)日:2025-04-10
申请号:US18903803
申请日:2024-10-01
Applicant: Google LLC
Inventor: Indu Ramamurthi , Ryan Kam Wang Tai
Abstract: This document describes systems and techniques for implementing personalized suggestions for a user interacting with a facility management system based on contextual metadata to assist the user in controlling the facility management system. For example, a system includes a request module configured to receive a request from a user. A metadata module is configured to access and identify metadata related to a content or context of the request. A large language model (LLM) module is configured to receive the request and the metadata and to generate a suggestion relevant to the content or context of the request. A suggestion module is configured to present the suggestion to the user.
-
公开(公告)号:US20250116732A1
公开(公告)日:2025-04-10
申请号:US18565891
申请日:2022-09-22
Applicant: Google LLC
Inventor: Shandor Glenn Dektor , Martin Johannes Kraemer , Mark Fralick , Chuck Tally
IPC: G01R33/00
Abstract: Methods, systems, and apparatus, for calibration quality control using multiple magnetometers. One of the methods includes: receiving measurements by two or more magnetic field sensors of a device over a period of time, wherein each measurement measures a magnetic field at each magnetic field sensor, wherein each measurement at each time point over the period of time includes a vector in one or more spatial axes of a three-dimensional space; computing a difference between the measurements over the period of time, wherein the difference at each time point over the period of time is a result of computing a difference based on one or more pairs of the vectors at the time point; determining that the difference does not remain within a predetermined range over the period of time; and in response, classifying calibration quality of the device as unsuitable for computing a heading of the device.
-
公开(公告)号:USD1069827S1
公开(公告)日:2025-04-08
申请号:US29957361
申请日:2024-08-13
Applicant: Google LLC
Designer: Christopher Norman
-
公开(公告)号:US12274079B2
公开(公告)日:2025-04-08
申请号:US18244716
申请日:2023-09-11
Applicant: Google LLC
Inventor: Nam Hoon Kim , Teckgyu Kang , Scott Lee Kirkman , Woon-Seong Kwon
Abstract: This disclosure relates to deep trench capacitors embedded in a package substrate on which an integrated circuit is mounted. In some aspects, a chip package includes an integrated circuit die that has a power distribution circuit for one or more circuits of the integrated circuit. The chip package also includes a substrate different from the integrated circuit and having a first surface on which the integrated circuit die is mounted and a second surface opposite the first surface. The substrate includes one or more cavities formed in at least one of the first surface or the second surface. The chip package also includes one or more deep trench capacitors disposed in at least one of the one or more cavities. Each deep trench capacitor is connected to the power distribution circuit by conductors.
-
公开(公告)号:US12273697B2
公开(公告)日:2025-04-08
申请号:US18042258
申请日:2020-08-26
Applicant: Google LLC
Inventor: Aren Jansen , Manoj Plakal , Dan Ellis , Shawn Hershey , Richard Channing Moore, III
Abstract: A computer-implemented method for upmixing audiovisual data can include obtaining audiovisual data including input audio data and video data accompanying the input audio data. Each frame of the video data can depict only a portion of a larger scene. The input audio data can have a first number of audio channels. The computer-implemented method can include providing the audiovisual data as input to a machine-learned audiovisual upmixing model. The audiovisual upmixing model can include a sequence-to-sequence model configured to model a respective location of one or more audio sources within the larger scene over multiple frames of the video data. The computer-implemented method can include receiving upmixed audio data from the audiovisual upmixing model. The upmixed audio data can have a second number of audio channels. The second number of audio channels can be greater than the first number of audio channels.
-
公开(公告)号:US12273167B2
公开(公告)日:2025-04-08
申请号:US17636887
申请日:2020-08-31
Applicant: Google LLC
Inventor: Erik Richard Stauffer , Jibing Wang , Aamir Akram , Vijay L. Asrani
Abstract: A user equipment (UE) manages thermal levels of antenna modules with reference to a temperature threshold. The UE includes multiple antenna modules having a first antenna module and a second antenna module and at least one wireless transceiver coupled to the multiple antenna modules. The UE also includes a processor and memory system implementing an antenna module thermal manager. The manager is configured to obtain a first temperature indication corresponding to the first antenna module of the multiple antenna modules. The manager is also configured to perform a comparison of the first temperature indication to at least one temperature threshold. The manager is further configured to switch, based on the comparison, from using the first antenna module to using the second antenna module for wireless communication with the at least one wireless transceiver.
-
公开(公告)号:US12272096B2
公开(公告)日:2025-04-08
申请号:US18335614
申请日:2023-06-15
Applicant: Google LLC
Inventor: Jianing Wei , Matthias Grundmann
Abstract: The present disclosure provides systems and methods for calibration-free instant motion tracking useful, for example, for rending virtual content in augmented reality settings. In particular, a computing system can iteratively augment image frames that depict a scene to insert virtual content at an anchor region within the scene, including situations in which the anchor region moves relative to the scene. To do so, the computing system can estimate, for each of a number of sequential image frames: a rotation of an image capture system that captures the image frames; and a translation of the anchor region relative to an image capture system, thereby providing sufficient information to determine where and at what orientation to render the virtual content within the image frame.
-
-
-
-
-
-
-
-
-