Patent search ap:("Apple Inc.") AND inv:"Xiaochuan Niu" Page 1

1.

发明授权
Rank-reduced token representation for automatic speech recognition 有权

公开(公告)号：US10593346B2

公开(公告)日：2020-03-17

申请号：US15459481

申请日：2017-03-15

Applicant: Apple Inc.

Inventor： Christophe J. Van Gysel , Yi Su , Xiaochuan Niu , Ilya Oparin

IPC: G10L21/10 , G10L15/16 , G10L15/183

Abstract: The present disclosure generally relates to processing speech or text using rank-reduced token representation. In one example process, speech input is received. A sequence of candidate words corresponding to the speech input is determined. The sequence of candidate words includes a current word and one or more previous words. A vector representation of the current word is determined from a set of trained parameters. A number of parameters in the set of trained parameters varies as a function of one or more linguistic characteristics of the current word. Using the vector representation of the current word, a probability of a next word given the current word and the one or more previous words is determined. A text representation of the speech input is displayed based on the determined probability.

2.

发明授权
Using visual context to improve a virtual assistant 有权

公开(公告)号：US12073831B1

公开(公告)日：2024-08-27

申请号：US17576419

申请日：2022-01-14

Applicant: Apple Inc.

Inventor： Saurabh Adya , Sameer Badaskar , Akanksha Bindal , Ahmed S. Hussen Abdelaziz , Xiaochuan Niu , Alkeshkumar M. Patel , Srikanth Vishnubhotla

IPC: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/06 , G10L15/16 , G10L15/18 , G10L15/24

CPC classification number: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/063 , G10L15/16 , G10L15/18 , G10L15/24

Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.

Patent Agency Ranking