-
公开(公告)号:US12027166B2
公开(公告)日:2024-07-02
申请号:US17402328
申请日:2021-08-13
Applicant: Apple Inc.
Inventor: Hong Yu , Saurabh Adya , Shruti Bhargava , Myra C. Lukens , Jianpeng Cheng , Lin Li , Alkeshkumar M. Patel , Dhivya Piraviperumal , Stephen G. Pulman
CPC classification number: G10L15/22 , G06F3/013 , G06F3/017 , G10L15/1815 , G10L2015/228
Abstract: Systems and processes for operating a digital assistant are provided. An example process for performing a task includes, at an electronic device having one or more processors and memory, receiving a spoken input including a request, receiving an image input including a plurality of objects, selecting a reference resolution module of a plurality of reference resolution modules based on the request and the image input, determining, with the selected reference resolution module, whether the request references a first object of the plurality of objects based on at least the spoken input, and in accordance with a determination that the request references the first object of the plurality of objects, determining a response to the request including information about the first object.
-
公开(公告)号:US12190873B2
公开(公告)日:2025-01-07
申请号:US17952005
申请日:2022-09-23
Applicant: Apple Inc.
Inventor: Ahmed S. Hussen Abdelaziz , Saurabh Adya , Alexander W. Churchill , Pranay Dighe , Sachin S. Kajarekar , Chaitanya Mannemala , Erik Marchi , Seyedmahdad Mirsamadi , Ognjen Rudovic , Ahmed H. Tewfik , Barry-John Theobald , Srikanth Vishnubhotla
Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.
-
公开(公告)号:US12073831B1
公开(公告)日:2024-08-27
申请号:US17576419
申请日:2022-01-14
Applicant: Apple Inc.
Inventor: Saurabh Adya , Sameer Badaskar , Akanksha Bindal , Ahmed S. Hussen Abdelaziz , Xiaochuan Niu , Alkeshkumar M. Patel , Srikanth Vishnubhotla
CPC classification number: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/063 , G10L15/16 , G10L15/18 , G10L15/24
Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.
-
-