Abstract:
Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
Abstract:
A computing device displays a call history graphical user interface (GUI). The call history GUI includes a new list and an old list. The new list may include new missed call elements and missed call elements associated with new unopened voicemails. The old list may include other call history GUI elements, such as old missed call elements and missed call elements associated with opened voicemails.
Abstract:
A computing device is configured to initiate actions in response to speech input that includes a name or other indication of an entity, in a first spoken utterance, followed by an action, in a second spoken utterance. The computing device receives the first spoken utterance, identifies an entity based on the first spoke utterance, and indicates a plurality of available actions based on the identified entity. The computing device then receives the second spoken utterance and identifies a selection of at least one of the available actions based on the second spoken utterance. The computing device then initiates the at least one selected action.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating direct speech messages based on voice commands that include indirect speech messages. In one aspect, a method includes receiving a voice input corresponding to an utterance. A determination is made whether a transcription of the utterance includes a command to initiate a communication to a user and a segment that is classified as indirect speech. In response to determining that the transcription of the utterance includes the command and the segment that is classified as indirect speech, the segment that is classified as indirect speech is provided as input to a machine translator. In response to providing the segment that is classified as indirect speech to the machine translator, a direct speech segment is received from the machine translator. A communication is initiated that includes the direct speech segment.
Abstract:
The present application describes systems, articles of manufacture, and methods for continuous speech recognition for mobile computing devices. One embodiment includes determining whether a mobile computing device is receiving operating power from an external power source or a battery power source, and activating a trigger word detection subroutine in response to determining that the mobile computing device is receiving power from the external power source. In some embodiments, the trigger word detection subroutine operates continually while the mobile computing device is receiving power from the external power source. The trigger word detection subroutine includes determining whether a plurality of spoken words received via a microphone includes one or more trigger words, and in response to determining that the plurality of spoken words includes at least one trigger word, launching an application corresponding to the at least one trigger word included in the plurality of spoken words.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for audible alert tones are disclosed. The methods, systems, and apparatus include actions of determining whether audio input data received after ceasing output of a first instance of an audible alert tone includes voice activity and determining whether to delay a successive instance of the audible alert tone based on determining whether the audio input data includes voice activity.
Abstract:
In an example implementation of the disclosed technology, a method includes receiving an indication of a gesture of an input object moving, at a rate of movement, from a first location of a presence-sensitive input device toward a second location of the presence-sensitive input device. The method also includes, responsive to determining that the rate of movement does not exceed a predetermined rate of movement, outputting, for display, a visual indicator moving from a first location of a display toward a second location of the display. The method also includes, responsive to determining that the rate of movement exceeds the predetermined rate of movement, outputting, for display, the visual indicator moving at a rate of movement that does not exceed the predetermined rate of movement.
Abstract:
Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
Abstract:
In an example implementation of the disclosed technology, a method includes receiving an indication of a gesture of an input object moving, at a rate of movement, from a first location of a presence-sensitive input device toward a second location of the presence-sensitive input device. The method also includes, responsive to determining that the rate of movement does not exceed a predetermined rate of movement, outputting, for display, a visual indicator moving from a first location of a display toward a second location of the display. The method also includes, responsive to determining that the rate of movement exceeds the predetermined rate of movement, outputting, for display, the visual indicator moving at a rate of movement that does not exceed the predetermined rate of movement.
Abstract:
A computing device receives an audio input from a user. The computing device determines a series of words from the audio input. The computing device outputs, for display, one or more substituted symbols. The one or more substituted symbols correspond to at least a portion of the series of words. In response to determining that receipt of the audio input has completed, the computing device outputs, for display, alphanumeric characters comprising the series of words in place of the one or more substituted symbols.