-
公开(公告)号:US12118999B2
公开(公告)日:2024-10-15
申请号:US18231135
申请日:2023-08-07
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/013 , G06F3/167 , G10L15/26 , H04W4/025 , G06F2203/0381 , G10L15/1815 , G10L15/1822 , G10L2015/223 , G10L2015/227 , G10L2015/228 , G10L17/00
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
公开(公告)号:US11127397B2
公开(公告)日:2021-09-21
申请号:US16139648
申请日:2018-09-24
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
Abstract: Systems and processes for device voice control are provided. An example process includes, at an electronic device, receiving a spoken user input and interpreting the spoken user input to derive a representation of user intent. The process further includes determining whether a task may be identified based on the representation of user intent. In accordance with a determination that a task may be identified based on the representation of user intent, the task is performed, and in accordance with a determination that a task may not be identified based on the representation of user intent, the spoken user input is disambiguated.
-
公开(公告)号:US09715875B2
公开(公告)日:2017-07-25
申请号:US14502737
申请日:2014-09-30
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/167 , G10L15/1815 , G10L15/1822 , G10L15/26 , G10L17/00 , G10L2015/223 , G10L2015/227 , G10L2015/228 , H04W4/025
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
4.
公开(公告)号:US09620104B2
公开(公告)日:2017-04-11
申请号:US14298690
申请日:2014-06-06
Applicant: Apple Inc.
Inventor: Devang K. Naik , Thomas R. Gruber , Liam Weiner , Justin G. Binder , Charles Srisuwananukorn , Gunnar Evermann , Shaun Eric Williams , Hong Chen , Lia T. Napolitano
CPC classification number: G10L13/027 , G10L13/04 , G10L13/08 , G10L15/063 , G10L15/22 , G10L15/26 , G10L15/265 , G10L2015/0631 , G10L2015/0638
Abstract: The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
5.
公开(公告)号:US12254887B2
公开(公告)日:2025-03-18
申请号:US17543292
申请日:2021-12-06
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US11810562B2
公开(公告)日:2023-11-07
申请号:US17461018
申请日:2021-08-30
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/013 , G06F3/167 , G10L15/26 , H04W4/025 , G06F2203/0381 , G10L15/1815 , G10L15/1822 , G10L17/00 , G10L2015/223 , G10L2015/227 , G10L2015/228
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
公开(公告)号:US10748546B2
公开(公告)日:2020-08-18
申请号:US16267146
申请日:2019-02-04
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
8.
公开(公告)号:US09966060B2
公开(公告)日:2018-05-08
申请号:US15445863
申请日:2017-02-28
Applicant: Apple Inc.
Inventor: Devang K. Naik , Thomas R. Gruber , Liam Weiner , Justin G. Binder , Charles Srisuwananukorn , Gunnar Evermann , Shaun Eric Williams , Hong Chen , Lia T. Napolitano
CPC classification number: G10L13/027 , G10L13/04 , G10L13/08 , G10L15/063 , G10L15/22 , G10L15/26 , G10L15/265 , G10L2015/0631 , G10L2015/0638
Abstract: The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
公开(公告)号:US12267623B2
公开(公告)日:2025-04-01
申请号:US17751405
申请日:2022-05-23
Applicant: Apple Inc.
Inventor: Justin G. Binder , Abhimanyu Yadav , Ahmed S. Hussen Abdelaziz , Abhishek Walia , Anushree Prasanna Kumar
Abstract: An example process includes receiving, from a user, an input corresponding to a request to render, without using a camera, and during a communication session with an external electronic device, an avatar associated with the user; and in accordance with receiving the input: in accordance with a determination that the electronic device is coupled to an external accessory device: during the communication session with the external electronic device, and while a camera corresponding to the communication session is disabled: receiving, from the external accessory device, a first data stream detected by a first type of sensor of the external accessory device; determining, based on the first data stream, a first set of data representing a first type of visual feature of the avatar; and rendering the avatar using the first set of data.
-
公开(公告)号:US11217255B2
公开(公告)日:2022-01-04
申请号:US15679108
申请日:2017-08-16
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
-
-
-
-
-
-
-
-