-
公开(公告)号:US11609739B2
公开(公告)日:2023-03-21
申请号:US17056126
申请日:2019-04-24
Applicant: Apple Inc.
Inventor: Rahul Nair , Golnaz Abdollahian , Avi Bar-Zeev , Niranjan Manjunath
Abstract: In an exemplary technique for providing audio information, an input is received, and audio information responsive to the received input is provided using a speaker. While providing the audio information, an external sound is detected. If it is determined that the external sound is a communication of a first type, then the provision of the audio information is stopped. If it is determined that the external sound is a communication of a second type, then the provision of the audio information continues.
-
公开(公告)号:US11507183B2
公开(公告)日:2022-11-22
申请号:US17022680
申请日:2020-09-16
Applicant: Apple Inc.
Inventor: Niranjan Manjunath , Scott M. Andrus , Xinyuan Huang , William W. Luciw , Jonathan H. Russell
Abstract: The present disclosure relates to resolving natural language ambiguities with respect to a simulated reality setting. In an exemplary embodiment, a simulated reality setting having one or more virtual objects is displayed. A stream of gaze events is generated from the simulated reality setting and a stream of gaze data. A speech input is received within a time period and a domain is determined based on a text representation of the speech input. Based on the time period and a plurality of event times for the stream of gaze events, one or more gaze events are identified from the stream of gaze events. The identified one or more gaze events is used to determine a parameter value for an unresolved parameter of the domain. A set of tasks representing a user intent for the speech input is determined based on the parameter value and the set of tasks is performed.
-
公开(公告)号:US11861265B2
公开(公告)日:2024-01-02
申请号:US18123886
申请日:2023-03-20
Applicant: Apple Inc.
Inventor: Rahul Nair , Golnaz Abdollahian , Avi Bar-Zeev , Niranjan Manjunath
Abstract: In an exemplary technique, speech input including one or more instructions is received. After the speech input has stopped, if it is determined that one or more visual characteristics indicate that further speech input is not expected, a response to the one or more instructions is provided. If it is determined that one or more visual characteristics indicate that further speech input is expected, a response to the one or more instructions is not provided.
-
公开(公告)号:US12147733B2
公开(公告)日:2024-11-19
申请号:US18389485
申请日:2023-11-14
Applicant: Apple Inc.
Inventor: Rahul Nair , Golnaz Abdollahian , Avi Bar-Zeev , Niranjan Manjunath
Abstract: In an exemplary technique, audio information responsive to received input is provided. While providing the audio information, one or more conditions for stopping the provision of audio information are detected, and in response, the provision of the audio information is stopped. After stopping the provision of the audio information, if the one or more conditions for stopping the provision of audio information have ceased, then resumed audio information is provided, where the resumed audio information includes a rephrased version of a previously provided segment of the audio information.
-
公开(公告)号:US11837232B2
公开(公告)日:2023-12-05
申请号:US18115721
申请日:2023-02-28
Applicant: Apple Inc.
Inventor: Niranjan Manjunath , Willem Mattelaer , Jessica Peck , Lily Shuting Zhang
CPC classification number: G10L15/22 , G10L15/083 , G10L15/1815 , G10L15/26 , G10L2015/223
Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device: receiving a first user voice input; in accordance with a determination that the first user voice input represents a communal digital assistant request, transmitting a request to provide context information associated with the first user voice input to the first user device; receiving context information associated with the first user voice input; obtaining a first digital assistant response based at least on a portion of the context information received from the second user device and at least a portion of context information associated with the first user voice input that is stored on the first user device; providing the first digital assistant response to the second user device; and outputting the first digital assistant response.
-
公开(公告)号:US11769497B2
公开(公告)日:2023-09-26
申请号:US17158703
申请日:2021-01-26
Applicant: Apple Inc.
Inventor: Niranjan Manjunath , Willem Mattelaer , Jessica Peck , Lily Shuting Zhang
CPC classification number: G10L15/22 , G10L15/083 , G10L15/1815 , G10L15/26 , G10L2015/223
Abstract: Embodiments provide a context-aware digital assistant at multiple user devices participating in a video communication session by using context information from a first user device to determine a digital assistant response at a second user device. In this manner, users participating in the video communication session may interact with the digital assistant during the video communication session as if the digital assistant is another participant in the video communication session. Embodiments further describe automatically determining candidate digital assistant tasks based on a shared transcription of user voice inputs received at user devices participating in a video communication session. In this manner, a digital assistant of a user device participating in a video communication session may proactively determine one or more tasks that a user of the user device may want the digital assistant to perform based on conversations held during the video communication session.
-
公开(公告)号:US12033636B2
公开(公告)日:2024-07-09
申请号:US18232267
申请日:2023-08-09
Applicant: Apple Inc.
Inventor: Niranjan Manjunath , Willem Mattelaer , Jessica Peck , Lily Shuting Zhang
CPC classification number: G10L15/22 , G10L15/083 , G10L15/1815 , G10L15/26 , G10L2015/223
Abstract: This relates to an intelligent automated assistant in a video communication environment. An example includes, during a video communication session between at least two devices, receiving a voice input at one device, generating and transmitting to a server a textual representation of the voice input, receiving from the server a shared transcription including both the textual representation of the voice input and one or more additional textual representations generated by another device, and determining and presenting one or more candidate tasks based on the shared transcription.
-
公开(公告)号:US12230264B2
公开(公告)日:2025-02-18
申请号:US17866984
申请日:2022-07-18
Applicant: Apple Inc.
Inventor: Rae L. Lasko , German W. Bauer , Felicia W. Edwards , Niranjan Manjunath , Jonathan H. Russell , Lynn Streja , Keith C. Strickling , Garrett L Weinberg
Abstract: An example process includes while an electronic device is engaged in a communication session with external device(s): receiving, from a first user of the electronic device, input to invoke a first digital assistant; receiving, from the first user, a natural language input corresponding to a task; in accordance with invoking the first digital assistant, generating, by the first digital assistant, a prompt for further user input about the task; transmitting, to the external device(s), the prompt for further user input about the task; after transmitting the prompt for further user input, receiving, from an external device of the external device(s), a response to the prompt for further user input; initiating, by the first digital assistant, based on the response and information corresponding to the first user stored on the electronic device, the task; and transmitting, to the external device(s), an output indicative of the initiated task.
-
9.
公开(公告)号:US20230336689A1
公开(公告)日:2023-10-19
申请号:US17989972
申请日:2022-11-18
Applicant: Apple Inc.
Inventor: Jessica J. Peck , Niranjan Manjunath , Willem Mattelaer
IPC: H04N7/15 , H04L12/18 , G06F3/14 , G06V40/20 , G06F3/16 , G06T7/70 , H04N7/14 , G06F3/01 , G06T19/00
CPC classification number: H04N7/152 , H04N7/157 , H04L12/1822 , G06F3/14 , G06V40/20 , G06F3/16 , G06T7/70 , H04N7/147 , G06F3/017 , G06T19/006 , G06F3/013
Abstract: A method of invoking public and private interactions during a multiuser communication session includes presenting a multiuser communication session; detecting a user invocation input that corresponds to a trigger to a digital assistant; detecting a user search input that corresponds to a request for information; obtaining the information based on the request; presenting the information; in accordance with a determination that at least one of the user invocation input and the user search input satisfy first input criteria associated with a first request type: transmitting the information to other electronic devices for presentation to other users; and in accordance with a determination that at least one of the user invocation input and the user search input satisfy second input criteria associated with a second request type: forgoing transmitting the information to the other electronic devices for presentation to other users.
-
-
-
-
-
-
-
-