Abstract:
This relates to systems and processes for using a virtual assistant to control electronic devices. In one example process, a user can speak an input in natural language form to a user device to control one or more electronic devices. The user device can transmit the user speech to a server to be converted into a textual representation. The server can identify the one or more electronic devices and appropriate commands to be performed by the one or more electronic devices based on the textual representation. The identified one or more devices and commands to be performed can be transmitted back to the user device, which can forward the commands to the appropriate one or more electronic devices for execution. In response to receiving the commands, the one or more electronic devices can perform the commands and transmit their current states to the user device.
Abstract:
Systems and processes for structured dictation using intelligent automated assistants are provided. In one example process, a speech input representing a user request can be received. In addition, metadata associated with the speech input can be received. A text string corresponding to the speech input can be determined. The process can determine whether to perform natural language processing on the text string and whether the metadata identifies one or more domains corresponding to the user request. In response to the determination that natural language processing is to be performed on the text string and that the metadata identifies one or more domains corresponding to the user request, natural language processing of the text string can be constrained to the one or more domains. A result can be obtained based on the one or more domains and the result can be outputted from the electronic device.
Abstract:
Systems and processes are disclosed for real-time updating of virtual assistant media knowledge. Virtual assistant knowledge can be updated with timely information associated with playing media (e.g., a sporting event, a television show, or the like). A data feed can be received that includes data relating events to particular times in a media stream. A user request can be received based on speech input, and the user request can be associated with an event in a media stream or show. In response to receiving the request, the media stream can be cued to commence playback at a time in the media stream associated with the event referred to in the request. In another example, a response to the user request can be generated based on the data relating to the events. The response can then be delivered to the user (e.g., spoken aloud, displayed, etc.).
Abstract:
Systems and processes for operating an intelligent automated assistant to explore media items are provided. In one example process, a speech input representing a request for one or more media items is received from a user. The process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items. In response to determining that the speech input corresponds to a user intent of obtaining personalized recommendations for media items, at least one media item is obtained from a user-specific corpus of media items. The user-specific corpus of media items is generate based on data associated with the user. The at least one media item is provided.
Abstract:
This relates to systems and processes for using a virtual assistant to arbitrate among and/or control electronic devices. In one example process, a first electronic device samples an audio input using a microphone. The first electronic device broadcasts a first set of one or more values based on the sampled audio input. Furthermore, the first electronic device receives a second set of one or more values, which are based on the audio input, from a second electronic device. Based on the first set of one or more values and the second set of one or more values, the first electronic device determines whether to respond to the audio input or forego responding to the audio input.
Abstract:
Systems and processes are disclosed for operating a digital assistant for media search and playback. In an exemplary process, an audio input containing a media search request can be received. A primary user intent corresponding to the media search request can be determined and one or more secondary user intents based on one or more previous user intents can be determined. A primary set of media items corresponding to the primary user intent can be displayed and one or more secondary sets of media items corresponding to the one or more secondary user intents can be displayed.
Abstract:
An exemplary method for identifying media may include receiving user input associated with a request for media, where that user input includes unstructured natural language speech including one or more words; identifying at least one context associated with the user input; causing a search for the media based on the at least one context and the user input; determining, based on the at least one context and the user input, at least one media item that satisfies the request; and in accordance with a determination that the at least one media item satisfies the request, obtaining the at least one media item.