Patent search ap:("Apple Inc.") AND inv:"Saurabh Adya" Page 1

1.

发明授权
Digital assistant reference resolution 有权

公开(公告)号：US12027166B2

公开(公告)日：2024-07-02

申请号：US17402328

申请日：2021-08-13

Applicant: Apple Inc.

Inventor： Hong Yu , Saurabh Adya , Shruti Bhargava , Myra C. Lukens , Jianpeng Cheng , Lin Li , Alkeshkumar M. Patel , Dhivya Piraviperumal , Stephen G. Pulman

IPC: G10L15/22 , G06F3/01 , G10L15/18

CPC classification number: G10L15/22 , G06F3/013 , G06F3/017 , G10L15/1815 , G10L2015/228

Abstract: Systems and processes for operating a digital assistant are provided. An example process for performing a task includes, at an electronic device having one or more processors and memory, receiving a spoken input including a request, receiving an image input including a plurality of objects, selecting a reference resolution module of a plurality of reference resolution modules based on the request and the image input, determining, with the selected reference resolution module, whether the request references a first object of the plurality of objects based on at least the spoken input, and in accordance with a determination that the request references the first object of the plurality of objects, determining a response to the request including information about the first object.

2.

发明授权
Determining whether speech input is intended for a digital assistant 有权

公开(公告)号：US12190873B2

公开(公告)日：2025-01-07

申请号：US17952005

申请日：2022-09-23

Applicant: Apple Inc.

Inventor： Ahmed S. Hussen Abdelaziz , Saurabh Adya , Alexander W. Churchill , Pranay Dighe , Sachin S. Kajarekar , Chaitanya Mannemala , Erik Marchi , Seyedmahdad Mirsamadi , Ognjen Rudovic , Ahmed H. Tewfik , Barry-John Theobald , Srikanth Vishnubhotla

IPC: G10L15/22 , G06T7/70 , G06V40/16 , G10L15/16 , G10L15/197 , G10L25/78 , G10L15/08

Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.

3.

发明授权
Using visual context to improve a virtual assistant 有权

公开(公告)号：US12073831B1

公开(公告)日：2024-08-27

申请号：US17576419

申请日：2022-01-14

Applicant: Apple Inc.

Inventor： Saurabh Adya , Sameer Badaskar , Akanksha Bindal , Ahmed S. Hussen Abdelaziz , Xiaochuan Niu , Alkeshkumar M. Patel , Srikanth Vishnubhotla

IPC: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/06 , G10L15/16 , G10L15/18 , G10L15/24

CPC classification number: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/063 , G10L15/16 , G10L15/18 , G10L15/24

Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.

Patent Agency Ranking