DEVICE ARBITRATION FOR DIGITAL ASSISTANT-BASED INTERCOM SYSTEMS

    公开(公告)号:US20210350810A1

    公开(公告)日:2021-11-11

    申请号:US17073092

    申请日:2020-10-16

    Applicant: Apple Inc.

    Abstract: Systems and processes for operating an intercom system via a digital assistant are provided. The intercom system is trigger-free, in that users communicate, in real-time, via devices without employing a trigger to speak. Acoustic fingerprints are employed to associate users with devices. Acoustic fingerprints include vector embeddings of speech input in an acoustic-feature vector space. Speech heard at multiple devices, as embedded in a fingerprint, may be clustered in the vector space, and the structure of the clusters is employed to associate users and devices. Based on the fingerprints, a device is mapped to a user, and the user employs that device to participate in a conversation, via the intercom service.

    REDUCING DEVICE PROCESSING OF UNINTENDED AUDIO

    公开(公告)号:US20220093095A1

    公开(公告)日:2022-03-24

    申请号:US17123428

    申请日:2020-12-16

    Applicant: Apple Inc.

    Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.

    ROBUST END-POINTING OF SPEECH SIGNALS USING SPEAKER RECOGNITION
    5.
    发明申请
    ROBUST END-POINTING OF SPEECH SIGNALS USING SPEAKER RECOGNITION 审中-公开
    使用扬声器识别的语音信号的稳健终点

    公开(公告)号:US20150371665A1

    公开(公告)日:2015-12-24

    申请号:US14701147

    申请日:2015-04-30

    Applicant: Apple Inc.

    CPC classification number: G10L25/87 G10L17/00 G10L17/22 G10L25/78

    Abstract: Systems and processes for robust end-pointing of speech signals using speaker recognition are provided. In one example process, a stream of audio having a spoken user request can be received. A first likelihood that the stream of audio includes user speech can be determined. A second likelihood that the stream of audio includes user speech spoken by an authorized user can be determined. A start-point or an end-point of the spoken user request can be determined based at least in part on the first likelihood and the second likelihood.

    Abstract translation: 提供了使用说话人识别的语音信号的鲁棒终端指向的系统和过程。 在一个示例过程中,可以接收具有口头用户请求的音频流。 可以确定音频流包括用户语音的第一可能性。 可以确定音频流包括授权用户说出的用户语音的第二可能性。 可以至少部分地基于第一可能性和第二似然性来确定口头用户请求的起始点或终点。

Patent Agency Ranking