Abstract:
Systems and processes are disclosed for controlling television user interactions using a virtual assistant. In an example process, a virtual assistant can interact with a television set-top box to control content shown on a television display. Speech input for the virtual assistant can be received from a device with a microphone. The speech input can comprise a query associated with content shown on the television display. A user intent of the query can be determined based on one or more of the content shown on the television display and a viewing history of media content. A result of the query can be caused to be displayed based on the determined user intent.
Abstract:
Systems and processes are disclosed for controlling television user interactions using a virtual assistant. In an example process, a virtual assistant can interact with a television set-top box to control content shown on a television display. Speech input for the virtual assistant can be received from a device with a microphone. The speech input can comprise a query associated with content shown on the television display. A user intent of the query can be determined based on one or more of the content shown on the television display and a viewing history of media content. A result of the query can be caused to be displayed based on the determined user intent.
Abstract:
An example process includes while an electronic device is engaged in a communication session with external device(s): receiving, from a first user of the electronic device, input to invoke a first digital assistant; receiving, from the first user, a natural language input corresponding to a task; in accordance with invoking the first digital assistant, generating, by the first digital assistant, a prompt for further user input about the task; transmitting, to the external device(s), the prompt for further user input about the task; after transmitting the prompt for further user input, receiving, from an external device of the external device(s), a response to the prompt for further user input; initiating, by the first digital assistant, based on the response and information corresponding to the first user stored on the electronic device, the task; and transmitting, to the external device(s), an output indicative of the initiated task.
Abstract:
Systems and processes for application integration with a digital assistant are provided. In accordance with one example, a method includes, at an electronic device having one or more processors and memory, receiving a natural-language user input; identifying, with the one or more processors, an intent object of a set of intent objects and a parameter associated with the intent, where the intent object and the parameter are derived from the natural-language user input. The method further includes identifying a software application associated with the intent object of the set of intent objects; and providing the intent object and the parameter to the software application.
Abstract:
Systems and processes are disclosed for operating a digital assistant in a media environment. In an example process, a primary set of media items can be displayed. An audio input containing a media-related request can be received. A primary user intent corresponding to the media-related request can be determined. In accordance with a determination that the primary user intent comprises a user intent to narrow the primary media search query, a second primary media search query corresponding to the primary user intent can be generated. The second primary media search query can be based on the media-related request and the primary media search query. The second primary media search query can be performed to obtain a second primary set of media items. Display of the primary set of media items can be replaced with display of the second primary set of media items.
Abstract:
The present disclosure relates to resolving natural language ambiguities with respect to a simulated reality setting. In an exemplary embodiment, a simulated reality setting having one or more virtual objects is displayed. A stream of gaze events is generated from the simulated reality setting and a stream of gaze data. A speech input is received within a time period and a domain is determined based on a text representation of the speech input. Based on the time period and a plurality of event times for the stream of gaze events, one or more gaze events are identified from the stream of gaze events. The identified one or more gaze events is used to determine a parameter value for an unresolved parameter of the domain. A set of tasks representing a user intent for the speech input is determined based on the parameter value and the set of tasks is performed.
Abstract:
Systems and processes are disclosed for operating a digital assistant in a media environment. In an example process, a primary set of media items can be displayed. An audio input containing a media-related request can be received. A primary user intent corresponding to the media-related request can be determined. In accordance with a determination that the primary user intent comprises a user intent to narrow the primary media search query, a second primary media search query corresponding to the primary user intent can be generated. The second primary media search query can be based on the media-related request and the primary media search query. The second primary media search query can be performed to obtain a second primary set of media items. Display of the primary set of media items can be replaced with display of the second primary set of media items.