WORD-LEVEL END-TO-END NEURAL SPEAKER DIARIZATION WITH AUXNET

    公开(公告)号:US20250118292A1

    公开(公告)日:2025-04-10

    申请号:US18891045

    申请日:2024-09-20

    Applicant: Google LLC

    Abstract: A method includes obtaining labeled training data including a plurality of spoken terms spoken during a conversation. For each respective spoken term, the method includes generating a corresponding sequence of intermediate audio encodings from a corresponding sequence of acoustic frames, generating a corresponding sequence of final audio encodings from the corresponding sequence of intermediate audio encodings, generating a corresponding speech recognition result, and generating a respective speaker token representing a predicted identity of a speaker for each corresponding speech recognition result. The method also includes training the joint speech recognition and speaker diarization model jointly based on a first loss derived from the generated speech recognition results and the corresponding transcriptions and a second loss derived from the generated speaker tokens and the corresponding speaker labels.

    SELF-SUPERVISED LEARNING FOR AUDIO PROCESSING

    公开(公告)号:US20250118291A1

    公开(公告)日:2025-04-10

    申请号:US18832864

    申请日:2023-01-30

    Applicant: Google LLC

    Abstract: Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training an audio-processing neural network that includes at least (1) a first encoder network having a first set of encoder network parameters and (2) a decoder network having a set of decoder network parameters. The system obtains a set of un-labeled audio data segments, and generates, from the set of un-labeled audio data segments, a set of encoder training examples. The system performs training of a second encoder neural network that includes at least the first encoder neural network on the set of generated encoder training examples. The system also obtains one or more labeled training examples, and performs training of the audio-processing neural network on the labeled training examples.

    ATTENTION-BASED IMAGE GENERATION NEURAL NETWORKS

    公开(公告)号:US20250118064A1

    公开(公告)日:2025-04-10

    申请号:US18913134

    申请日:2024-10-11

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating an output image. In one aspect, one of the methods includes generating the output image intensity value by intensity value according to a generation order of pixel-color channel pairs from the output image, comprising, for each particular generation order position in the generation order: generating a current output image representation of a current output image, processing the current output image representation using a decoder neural network to generate a probability distribution over possible intensity values for the pixel—color channel pair at the particular generation order position, wherein the decoder neural network includes one or more local masked self-attention sub-layers; and selecting an intensity value for the pixel—color channel pair at the particular generation order position using the probability distribution.

    Personalized Suggestion Manager
    84.
    发明申请

    公开(公告)号:US20250117234A1

    公开(公告)日:2025-04-10

    申请号:US18903803

    申请日:2024-10-01

    Applicant: Google LLC

    Abstract: This document describes systems and techniques for implementing personalized suggestions for a user interacting with a facility management system based on contextual metadata to assist the user in controlling the facility management system. For example, a system includes a request module configured to receive a request from a user. A metadata module is configured to access and identify metadata related to a content or context of the request. A large language model (LLM) module is configured to receive the request and the metadata and to generate a suggestion relevant to the content or context of the request. A suggestion module is configured to present the suggestion to the user.

    CALIBRATION QUALITY CONTROL USING MULTIPLE MAGNETOMETERS

    公开(公告)号:US20250116732A1

    公开(公告)日:2025-04-10

    申请号:US18565891

    申请日:2022-09-22

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, for calibration quality control using multiple magnetometers. One of the methods includes: receiving measurements by two or more magnetic field sensors of a device over a period of time, wherein each measurement measures a magnetic field at each magnetic field sensor, wherein each measurement at each time point over the period of time includes a vector in one or more spatial axes of a three-dimensional space; computing a difference between the measurements over the period of time, wherein the difference at each time point over the period of time is a result of computing a difference based on one or more pairs of the vectors at the time point; determining that the difference does not remain within a predetermined range over the period of time; and in response, classifying calibration quality of the device as unsuitable for computing a heading of the device.

    Deep trench capacitors embedded in package substrate

    公开(公告)号:US12274079B2

    公开(公告)日:2025-04-08

    申请号:US18244716

    申请日:2023-09-11

    Applicant: Google LLC

    Abstract: This disclosure relates to deep trench capacitors embedded in a package substrate on which an integrated circuit is mounted. In some aspects, a chip package includes an integrated circuit die that has a power distribution circuit for one or more circuits of the integrated circuit. The chip package also includes a substrate different from the integrated circuit and having a first surface on which the integrated circuit die is mounted and a second surface opposite the first surface. The substrate includes one or more cavities formed in at least one of the first surface or the second surface. The chip package also includes one or more deep trench capacitors disposed in at least one of the one or more cavities. Each deep trench capacitor is connected to the power distribution circuit by conductors.

    Systems and methods for upmixing audiovisual data

    公开(公告)号:US12273697B2

    公开(公告)日:2025-04-08

    申请号:US18042258

    申请日:2020-08-26

    Applicant: Google LLC

    Abstract: A computer-implemented method for upmixing audiovisual data can include obtaining audiovisual data including input audio data and video data accompanying the input audio data. Each frame of the video data can depict only a portion of a larger scene. The input audio data can have a first number of audio channels. The computer-implemented method can include providing the audiovisual data as input to a machine-learned audiovisual upmixing model. The audiovisual upmixing model can include a sequence-to-sequence model configured to model a respective location of one or more audio sources within the larger scene over multiple frames of the video data. The computer-implemented method can include receiving upmixed audio data from the audiovisual upmixing model. The upmixed audio data can have a second number of audio channels. The second number of audio channels can be greater than the first number of audio channels.

    Thermal management with antenna modules

    公开(公告)号:US12273167B2

    公开(公告)日:2025-04-08

    申请号:US17636887

    申请日:2020-08-31

    Applicant: Google LLC

    Abstract: A user equipment (UE) manages thermal levels of antenna modules with reference to a temperature threshold. The UE includes multiple antenna modules having a first antenna module and a second antenna module and at least one wireless transceiver coupled to the multiple antenna modules. The UE also includes a processor and memory system implementing an antenna module thermal manager. The manager is configured to obtain a first temperature indication corresponding to the first antenna module of the multiple antenna modules. The manager is also configured to perform a comparison of the first temperature indication to at least one temperature threshold. The manager is further configured to switch, based on the comparison, from using the first antenna module to using the second antenna module for wireless communication with the at least one wireless transceiver.

    Calibration-free instant motion tracking for augmented reality

    公开(公告)号:US12272096B2

    公开(公告)日:2025-04-08

    申请号:US18335614

    申请日:2023-06-15

    Applicant: Google LLC

    Abstract: The present disclosure provides systems and methods for calibration-free instant motion tracking useful, for example, for rending virtual content in augmented reality settings. In particular, a computing system can iteratively augment image frames that depict a scene to insert virtual content at an anchor region within the scene, including situations in which the anchor region moves relative to the scene. To do so, the computing system can estimate, for each of a number of sequential image frames: a rotation of an image capture system that captures the image frames; and a translation of the anchor region relative to an image capture system, thereby providing sufficient information to determine where and at what orientation to render the virtual content within the image frame.

Patent Agency Ranking