Abstract:
A computing device receives an audio input from a user. The computing device determines a series of words from the audio input. The computing device outputs, for display, one or more substituted symbols. The one or more substituted symbols correspond to at least a portion of the series of words. In response to determining that receipt of the audio input has completed, the computing device outputs, for display, alphanumeric characters comprising the series of words in place of the one or more substituted symbols.
Abstract:
Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
Abstract:
A computing device displays a call history graphical user interface (GUI). The call history GUI includes a new list and an old list. The new list may include new missed call elements and missed call elements associated with new unopened voicemails. The old list may include other call history GUI elements, such as old missed call elements and missed call elements associated with opened voicemails.
Abstract:
In an example implementation of the disclosed technology, a method includes receiving an indication of a gesture of an input object moving, at a rate of movement, from a first location of a presence-sensitive input device toward a second location of the presence-sensitive input device. The method also includes, responsive to determining that the rate of movement does not exceed a predetermined rate of movement, outputting, for display, a visual indicator moving from a first location of a display toward a second location of the display. The method also includes, responsive to determining that the rate of movement exceeds the predetermined rate of movement, outputting, for display, the visual indicator moving at a rate of movement that does not exceed the predetermined rate of movement.
Abstract:
The present application describes systems, articles of manufacture, and methods for continuous speech recognition for mobile computing devices. One embodiment includes determining whether a mobile computing device is receiving operating power from an external power source or a battery power source, and activating a trigger word detection subroutine in response to determining that the mobile computing device is receiving power from the external power source. In some embodiments, the trigger word detection subroutine operates continually while the mobile computing device is receiving power from the external power source. The trigger word detection subroutine includes determining whether a plurality of spoken words received via a microphone includes one or more trigger words, and in response to determining that the plurality of spoken words includes at least one trigger word, launching an application corresponding to the at least one trigger word included in the plurality of spoken words.
Abstract:
In an example implementation of the disclosed technology, a method includes setting a command entry timer based on a predetermined command entry threshold. The method also includes receiving an indication of user input to prompt receipt of an audio command and, responsive to determining that the command entry timer has expired since receiving the indication of user input, outputting, for display, at least one indicator associated with at least one supported audio command that is associated with at least one function of the computing device. The method also includes receiving a plurality of audio input samples and updating the command entry threshold based on plurality of audio input samples.
Abstract:
A computing device displays a call history graphical user interface (GUI). The call history GUI includes a new list and an old list. The new list may include new missed call elements and missed call elements associated with new unopened voicemails. The old list may include other call history GUI elements, such as old missed call elements and missed call elements associated with opened voicemails.
Abstract:
Cloud-based media can be locally cached at a vehicle in which a user will travel. The media may be accessed by an authorized user during an authorized period that can be based upon vehicle status, such as vehicle speed or altitude. When the authorized period expires, the media stored at the vehicle can be deleted.
Abstract:
A computing device may receive an incoming communication and, in response, generate a notification that indicates that the incoming communication can be accessed using a particular application on the communication device. The computing device may further provide an audio signal indicative of the notification and automatically activate a listening mode. The computing device may receive a voice input during the listening mode, and an input text may be obtained based on speech recognition performed upon the voice input. A command may be detected in the input text. In response to the command, the computing device may generate an output text that is based on at least the notification and provide a voice output that is generated from the output text via speech synthesis. The voice output identifies at least the particular application.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating direct speech messages based on voice commands that include indirect speech messages. In one aspect, a method includes receiving a voice input corresponding to an utterance. A determination is made whether a transcription of the utterance includes a command to initiate a communication to a user and a segment that is classified as indirect speech. In response to determining that the transcription of the utterance includes the command and the segment that is classified as indirect speech, the segment that is classified as indirect speech is provided as input to a machine translator. In response to providing the segment that is classified as indirect speech to the machine translator, a direct speech segment is received from the machine translator. A communication is initiated that includes the direct speech segment.