Reducing device processing of unintended audio

    公开(公告)号:US11620999B2

    公开(公告)日:2023-04-04

    申请号:US17123428

    申请日:2020-12-16

    Applicant: Apple Inc.

    Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.

    Personalization of media streams
    13.
    发明授权

    公开(公告)号:US10187440B2

    公开(公告)日:2019-01-22

    申请号:US15167898

    申请日:2016-05-27

    Applicant: APPLE INC.

    Abstract: In some implementations, a user device can personalize a media stream by converting notifications into audio speech data and presenting the audio speech data at locations within the media stream that do not interrupt the enjoyment of the media stream by the user. In some implementations, the user device can receive notifications from various communication services, applications installed on the user device, and/or other sources, determine information describing the notifications, and present the information to the user using the audio speech data. In some implementations, the user device can generate personalized notifications based on the media stream and/or media items selected by the user. The user device can generate personalized notifications based on the user's context (e.g., environment, location, activity, etc.). The personalized notifications can then be presented to the user using audio speech data at appropriate locations in the media stream.

    Robust end-pointing of speech signals using speaker recognition

    公开(公告)号:US10186282B2

    公开(公告)日:2019-01-22

    申请号:US14701147

    申请日:2015-04-30

    Applicant: Apple Inc.

    Abstract: Systems and processes for robust end-pointing of speech signals using speaker recognition are provided. In one example process, a stream of audio having a spoken user request can be received. A first likelihood that the stream of audio includes user speech can be determined. A second likelihood that the stream of audio includes user speech spoken by an authorized user can be determined. A start-point or an end-point of the spoken user request can be determined based at least in part on the first likelihood and the second likelihood.

    PERSONALIZATION OF MEDIA STREAMS
    16.
    发明申请

    公开(公告)号:US20170346872A1

    公开(公告)日:2017-11-30

    申请号:US15167898

    申请日:2016-05-27

    Applicant: APPLE INC.

    Abstract: In some implementations, a user device can personalize a media stream by converting notifications into audio speech data and presenting the audio speech data at locations within the media stream that do not interrupt the enjoyment of the media stream by the user. In some implementations, the user device can receive notifications from various communication services, applications installed on the user device, and/or other sources, determine information describing the notifications, and present the information to the user using the audio speech data. In some implementations, the user device can generate personalized notifications based on the media stream and/or media items selected by the user. The user device can generate personalized notifications based on the user's context (e.g., environment, location, activity, etc.). The personalized notifications can then be presented to the user using audio speech data at appropriate locations in the media stream.

    Method for extracting salient dialog usage from live data

    公开(公告)号:US11314370B2

    公开(公告)日:2022-04-26

    申请号:US16144871

    申请日:2018-09-27

    Applicant: Apple Inc.

    Abstract: Systems and processes are disclosed for virtual assistant request recognition using live usage data and data relating to future events. User requests that are received but not recognized can be used to generate candidate request templates. A count can be associated with each candidate request template and can be incremented each time a matching candidate request template is received. When a count reaches a threshold level, the corresponding candidate request template can be used to train a virtual assistant to recognize and respond to similar user requests in the future. In addition, data relating to future events can be mined to extract relevant information that can be used to populate both recognized user request templates and candidate user request templates. Populated user request templates (e.g., whole expected utterances) can then be used to recognize user requests and disambiguate user intent as future events become relevant.

    Social reminders
    18.
    发明授权

    公开(公告)号:US10390213B2

    公开(公告)日:2019-08-20

    申请号:US15988887

    申请日:2018-05-24

    Applicant: Apple Inc.

    Abstract: Techniques for providing reminders based on social interactions between users of electronic devices are described. Social reminders can be set to trigger based on social interactions of users. For example, a user may request to be reminded to discuss a certain discussion topic with a particular phonebook contact, when the user next encounters the contact.

Patent Agency Ranking