-
公开(公告)号:US20240347059A1
公开(公告)日:2024-10-17
申请号:US18739167
申请日:2024-06-10
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Willem MATTELAER , Jessica PECK , Lily Shuting ZHANG
CPC classification number: G10L15/22 , G10L15/083 , G10L15/1815 , G10L15/26 , G10L2015/223
Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device: receiving a first user voice input; in accordance with a determination that the first user voice input represents a communal digital assistant request, transmitting a request to provide context information associated with the first user voice input to the first user device; receiving context information associated with the first user voice input; obtaining a first digital assistant response based at least on a portion of the context information received from the second user device and at least a portion of context information associated with the first user voice input that is stored on the first user device; providing the first digital assistant response to the second user device; and outputting the first digital assistant response.
-
公开(公告)号:US20240281109A1
公开(公告)日:2024-08-22
申请号:US18439679
申请日:2024-02-12
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Martynas LAURITA , Arun K. THAMPI , Haishan YE
IPC: G06F3/04815 , G01C21/36 , G06T19/00
CPC classification number: G06F3/04815 , G01C21/365 , G06T19/006
Abstract: Some examples of the disclosure are directed to systems and methods for transitioning display of user interfaces in an extended reality environment based on tilt of an electronic device. In some examples, an electronic device presents an extended reality environment that includes a virtual object in a first visual state within the extended reality environment. In some examples, if the electronic device detects a first input that includes movement of the viewpoint, in accordance with a determination that the movement of the viewpoint exceeds a threshold movement, the electronic device displays the virtual object in a second visual state, different from the first visual state. In some examples, while displaying the virtual object in the second visual state, if the electronic device detects a second input that satisfies one or more first criteria, the electronic device displays the virtual object in the first visual state.
-
公开(公告)号:US20230386464A1
公开(公告)日:2023-11-30
申请号:US18232267
申请日:2023-08-09
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Willem MATTELAER , Jessica PECK , Lily Shuting ZHANG
CPC classification number: G10L15/22 , G10L15/26 , G10L15/1815 , G10L15/083 , G10L2015/223
Abstract: Embodiments provide a context-aware digital assistant at multiple user devices participating in a video communication session by using context information from a first user device to determine a digital assistant response at a second user device. In this manner, users participating in the video communication session may interact with the digital assistant during the video communication session as if the digital assistant is another participant in the video communication session. Embodiments further describe automatically determining candidate digital assistant tasks based on a shared transcription of user voice inputs received at user devices participating in a video communication session. In this manner, a digital assistant of a user device participating in a video communication session may proactively determine one or more tasks that a user of the user device may want the digital assistant to perform based on conversations held during the video communication session.
-
公开(公告)号:US20230215435A1
公开(公告)日:2023-07-06
申请号:US18115721
申请日:2023-02-28
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Willem MATTELAER , Jessica PECK , Lily Shuting ZHANG
CPC classification number: G10L15/22 , G10L15/26 , G10L15/1815 , G10L15/083 , G10L2015/223
Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device: receiving a first user voice input; in accordance with a determination that the first user voice input represents a communal digital assistant request, transmitting a request to provide context information associated with the first user voice input to the first user device; receiving context information associated with the first user voice input; obtaining a first digital assistant response based at least on a portion of the context information received from the second user device and at least a portion of context information associated with the first user voice input that is stored on the first user device; providing the first digital assistant response to the second user device; and outputting the first digital assistant response.
-
公开(公告)号:US20230042836A1
公开(公告)日:2023-02-09
申请号:US17969601
申请日:2022-10-19
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Scott M. ANDRUS , Xinyuan HUANG , William W. LUCIW , Jonathan H. RUSSELL
Abstract: The present disclosure relates to resolving natural language ambiguities with respect to a simulated reality setting. In an exemplary embodiment, a simulated reality setting having one or more virtual objects is displayed. A stream of gaze events is generated from the simulated reality setting and a stream of gaze data. A speech input is received within a time period and a domain is determined based on a text representation of the speech input. Based on the time period and a plurality of event times for the stream of gaze events, one or more gaze events are identified from the stream of gaze events. The identified one or more gaze events is used to determine a parameter value for an unresolved parameter of the domain. A set of tasks representing a user intent for the speech input is determined based on the parameter value and the set of tasks is performed.
-
公开(公告)号:US20210249009A1
公开(公告)日:2021-08-12
申请号:US17158703
申请日:2021-01-26
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Willem MATTELAER , Jessica PECK , Lily Shuting ZHANG
Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device of the at least two user devices: receiving a first user input; obtaining a first digital assistant response based on the first user input; providing, to a second user device of the at least two user devices, the first digital assistant response and context information associated with the first user input; outputting the first digital assistant response; receiving a second digital assistant response and context information associated with a second user input, wherein the second user input is received at the second user device, and wherein the second digital assistant response is determined based on the second user input and the context information associated with the first user input; and outputting the second digital assistant response.
-
公开(公告)号:US20250157467A1
公开(公告)日:2025-05-15
申请号:US19022810
申请日:2025-01-15
Applicant: Apple Inc.
Inventor: Rae L. LASKO , German W. BAUER , Felicia W. EDWARDS , Niranjan MANJUNATH , Kurt W. PIERSOL , Jonathan H. RUSSELL , Lynn I. STREJA , Keith C. STRICKLING , Garrett L. WEINBERG
Abstract: An example process includes while an electronic device is engaged in a communication session with external device(s): receiving, from a first user of the electronic device, input to invoke a first digital assistant; receiving, from the first user, a natural language input corresponding to a task; in accordance with invoking the first digital assistant, generating, by the first digital assistant, a prompt for further user input about the task; transmitting, to the external device(s), the prompt for further user input about the task; after transmitting the prompt for further user input, receiving, from an external device of the external device(s), a response to the prompt for further user input; initiating, by the first digital assistant, based on the response and information corresponding to the first user stored on the electronic device, the task; and transmitting, to the external device(s), an output indicative of the initiated task.
-
公开(公告)号:US20230058929A1
公开(公告)日:2023-02-23
申请号:US17866984
申请日:2022-07-18
Applicant: Apple Inc.
Inventor: Rae L. LASKO , German W. BAUER , Felicia W. EDWARDS , Niranjan MANJUNATH , Jonathan H. RUSSELL , Lynn I. STREJA , Keith C. STRICKLING , Garrett L. WEINBERG
Abstract: An example process includes while an electronic device is engaged in a communication session with external device(s): receiving, from a first user of the electronic device, input to invoke a first digital assistant; receiving, from the first user, a natural language input corresponding to a task; in accordance with invoking the first digital assistant, generating, by the first digital assistant, a prompt for further user input about the task; transmitting, to the external device(s), the prompt for further user input about the task; after transmitting the prompt for further user input, receiving, from an external device of the external device(s), a response to the prompt for further user input; initiating, by the first digital assistant, based on the response and information corresponding to the first user stored on the electronic device, the task; and transmitting, to the external device(s), an output indicative of the initiated task.
-
公开(公告)号:US20210224031A1
公开(公告)日:2021-07-22
申请号:US17056126
申请日:2019-04-24
Applicant: Apple Inc.
Inventor: Rahul NAIR , Golnaz ABDOLLAHIAN , Avi BAR-ZEEV , Niranjan MANJUNATH
Abstract: In an exemplary technique for providing audio information, an input is received, and audio information responsive to the received input is provided using a speaker. While providing the audio information responsive to received input information, an external sound is detected. If it is determined that the external sound is a communication of a first type, then the provision of the audio information is stopped. If it is determined that the external sound is a communication of a second type, then the provision of the audio information continues.
-
公开(公告)号:US20210089124A1
公开(公告)日:2021-03-25
申请号:US17022680
申请日:2020-09-16
Applicant: Apple Inc.
Inventor: Niranjan MANJUNATH , Scott M. ANDRUS , Xinyuan HUANG , William W. LUCIW , Jonathan H. RUSSELL
Abstract: The present disclosure relates to resolving natural language ambiguities with respect to a simulated reality setting. In an exemplary embodiment, a simulated reality setting having one or more virtual objects is displayed. A stream of gaze events is generated from the simulated reality setting and a stream of gaze data. A speech input is received within a time period and a domain is determined based on a text representation of the speech input. Based on the time period and a plurality of event times for the stream of gaze events, one or more gaze events are identified from the stream of gaze events. The identified one or more gaze events is used to determine a parameter value for an unresolved parameter of the domain. A set of tasks representing a user intent for the speech input is determined based on the parameter value and the set of tasks is performed.
-
-
-
-
-
-
-
-
-