Outcome-oriented dialogs on a speech recognition platform

    公开(公告)号:US11915707B1

    公开(公告)日:2024-02-27

    申请号:US17346916

    申请日:2021-06-14

    CPC classification number: G10L17/00 G06F3/167 G10L15/22 G10L2015/223

    Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform multiple actions corresponding to this intent. The platform may select a target action to perform, and may engage in a back-and-forth dialog to obtain information for completing the target action. The action may include streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user.

    MESSAGE PLAYBACK USING A SHARED DEVICE

    公开(公告)号:US20210110823A1

    公开(公告)日:2021-04-15

    申请号:US17030445

    申请日:2020-09-24

    Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.

Patent Agency Ranking