Home appliance having speech recognition function

    公开(公告)号:US12225338B2

    公开(公告)日:2025-02-11

    申请号:US18103171

    申请日:2023-01-30

    Abstract: A home appliance includes an electrical equipment compartment disposed in an upper portion of the home appliance, and including an upper side that is open, an electrical equipment compartment cover to cover the open upper side of the electrical equipment compartment, and including a speaker hole, a microphone accommodating portion which protrudes upward from an upper side of the electrical equipment compartment cover and including an accommodating space and a front portion that includes microphone holes laterally spaced apart from each other and which face toward a front of the home appliance, a microphone unit including a printed circuit board (PCB) disposed in the accommodation space behind the microphone holes and including microphone chips mounted on the PCB, and a speaker unit disposed in the electrical equipment compartment to correspond to the speaker hole.

    Always-on audio control for mobile device

    公开(公告)号:US12211506B2

    公开(公告)日:2025-01-28

    申请号:US18501786

    申请日:2023-11-03

    Applicant: Apple Inc.

    Abstract: In an embodiment, an integrated circuit may include one or more CPUs, a memory controller, and a circuit configured to remain powered on when the rest of the SOC is powered down. The circuit may be configured to receive audio samples from a microphone, and match those audio samples against a predetermined pattern to detect a possible command from a user of the device that includes the SOC. In response to detecting the predetermined pattern, the circuit may cause the memory controller to power up so that audio samples may be stored in the memory to which the memory controller is coupled. The circuit may also cause the CPUs to be powered on and initialized, and the operating system (OS) may boot. During the time that the CPUs are initializing and the OS is booting, the circuit and the memory may be capturing the audio samples.

    Transcription presentation of communication sessions

    公开(公告)号:US12190889B2

    公开(公告)日:2025-01-07

    申请号:US18316427

    申请日:2023-05-12

    Abstract: A system is provided that includes a first network interface for a first network type and a second network interface for a second network type that is different from the first network type. The system also includes at least one processor configured to cause the system to perform operations. The operations may include obtaining, from the first network interface, audio from a communication session with a remote device established over the first network and obtaining an indication of a communication device available to participate in the communication session and direct audio obtained from the communication session to a remote transcription system. The operations may also include directing the audio to the second network interface for transmission to the communication device, obtaining transcript data from the remote transcription system based on the audio, and directing the transcript data to the second network interface for transmission to the communication device.

    AUTOMATED ASSISTANT PERFORMANCE OF A NON-ASSISTANT APPLICATION OPERATION(S) IN RESPONSE TO A USER INPUT THAT CAN BE LIMITED TO A PARAMETER(S)

    公开(公告)号:US20240361982A1

    公开(公告)日:2024-10-31

    申请号:US18765101

    申请日:2024-07-05

    Applicant: GOOGLE LLC

    CPC classification number: G06F3/167 G10L15/22 G10L15/28 G10L2015/223

    Abstract: Implementations set forth herein relate to an automated assistant that can provide a selectable action intent suggestion when a user is accessing a third party application that is controllable via the automated assistant. The action intent can be initialized by the user without explicitly invoking the automated assistant using, for example, an invocation phrase (e.g., “Assistant . . . ”). Rather, the user can initialize performance of the corresponding action by identifying one or more action parameters. In some implementations, the selectable suggestion can indicate that a microphone is active for the user to provide a spoken utterance that identifies a parameter(s). When the action intent is initialized in response to the spoken utterance from the user, the automated assistant can control the third party application according to the action intent and any identified parameter(s).

    Real-time name mispronunciation detection

    公开(公告)号:US12020683B2

    公开(公告)日:2024-06-25

    申请号:US17513335

    申请日:2021-10-28

    CPC classification number: G10L13/08 G10L13/04 G10L15/083 G10L15/187 G10L15/285

    Abstract: A real-time name mispronunciation detection feature can enable a user to receive instant feedback anytime they have mispronounced another person's name in an online meeting. The feature can receive audio input of a speaker and obtain a transcript of the audio input; identify a name from text of the transcript based on names of meeting participants; and extract a portion of the audio input corresponding to the name identified from the text of the transcript. The feature can obtain a reference pronunciation for the name using a user identifier associated with the name; and can obtain a pronunciation score for the name based on a comparison between the reference pronunciation for the name and the portion of the audio input corresponding to the name. The feature can then determine whether the pronunciation score is below a threshold; and in response, notify the speaker of a pronunciation error.

Patent Agency Ranking