Systems and methods for name pronunciation

    公开(公告)号:US11069336B2

    公开(公告)日:2021-07-20

    申请号:US16048043

    申请日:2018-07-27

    Applicant: Apple Inc.

    Inventor: Devang K. Naik

    Abstract: Systems and methods are provided for associating a phonetic pronunciation with a name by receiving the name, mapping the name to a plurality of monosyllabic components that are combinable to construct the phonetic pronunciation of the name, receiving a user input to select one or more of the plurality, and combining the selected one or more of the plurality of monosyllabic components to construct the phonetic pronunciation of the name.

    Method for extracting salient dialog usage from live data

    公开(公告)号:US10296160B2

    公开(公告)日:2019-05-21

    申请号:US14099776

    申请日:2013-12-06

    Applicant: Apple Inc.

    Abstract: Systems and processes are disclosed for virtual assistant request recognition using live usage data and data relating to future events. User requests that are received but not recognized can be used to generate candidate request templates. A count can be associated with each candidate request template and can be incremented each time a matching candidate request template is received. When a count reaches a threshold level, the corresponding candidate request template can be used to train a virtual assistant to recognize and respond to similar user requests in the future. In addition, data relating to future events can be mined to extract relevant information that can be used to populate both recognized user request templates and candidate user request templates. Populated user request templates (e.g., whole expected utterances) can then be used to recognize user requests and disambiguate user intent as future events become relevant.

    System and method for updating an adaptive speech recognition model

    公开(公告)号:US09697822B1

    公开(公告)日:2017-07-04

    申请号:US14263869

    申请日:2014-04-28

    Applicant: Apple Inc.

    Abstract: A method for updating an adaptive speech recognition model is provided. In some implementations, the method is performed at a communications device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes determining that a first user of a first mobile communication device is engaged in a call over a communications network and providing an adaptive speech recognition model The method also includes analyzing an outbound audio channel of the first mobile communication device to obtain a call audio signal corresponding to audio input from one or more microphones of the first mobile communication device and updating the adaptive speech recognition model with training data derived from the call audio signal.

    Reducing device processing of unintended audio

    公开(公告)号:US11620999B2

    公开(公告)日:2023-04-04

    申请号:US17123428

    申请日:2020-12-16

    Applicant: Apple Inc.

    Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.

    Personalization of media streams
    10.
    发明授权

    公开(公告)号:US10187440B2

    公开(公告)日:2019-01-22

    申请号:US15167898

    申请日:2016-05-27

    Applicant: APPLE INC.

    Abstract: In some implementations, a user device can personalize a media stream by converting notifications into audio speech data and presenting the audio speech data at locations within the media stream that do not interrupt the enjoyment of the media stream by the user. In some implementations, the user device can receive notifications from various communication services, applications installed on the user device, and/or other sources, determine information describing the notifications, and present the information to the user using the audio speech data. In some implementations, the user device can generate personalized notifications based on the media stream and/or media items selected by the user. The user device can generate personalized notifications based on the user's context (e.g., environment, location, activity, etc.). The personalized notifications can then be presented to the user using audio speech data at appropriate locations in the media stream.

Patent Agency Ranking