-
公开(公告)号:US10593346B2
公开(公告)日:2020-03-17
申请号:US15459481
申请日:2017-03-15
Applicant: Apple Inc.
Inventor: Christophe J. Van Gysel , Yi Su , Xiaochuan Niu , Ilya Oparin
IPC: G10L21/10 , G10L15/16 , G10L15/183
Abstract: The present disclosure generally relates to processing speech or text using rank-reduced token representation. In one example process, speech input is received. A sequence of candidate words corresponding to the speech input is determined. The sequence of candidate words includes a current word and one or more previous words. A vector representation of the current word is determined from a set of trained parameters. A number of parameters in the set of trained parameters varies as a function of one or more linguistic characteristics of the current word. Using the vector representation of the current word, a probability of a next word given the current word and the one or more previous words is determined. A text representation of the speech input is displayed based on the determined probability.
-
公开(公告)号:US10049668B2
公开(公告)日:2018-08-14
申请号:US15156161
申请日:2016-05-16
Applicant: Apple Inc.
Inventor: Rongqing Huang , Ilya Oparin
IPC: G10L15/16 , G10L15/197 , G10L15/28 , G10L15/14 , G10L15/08
Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.
-
公开(公告)号:US10354652B2
公开(公告)日:2019-07-16
申请号:US16035513
申请日:2018-07-13
Applicant: Apple Inc.
Inventor: Rongqing Huang , Ilya Oparin
IPC: G10L15/28 , G10L15/14 , G10L15/16 , G10L15/19 , G10L15/197 , G10L15/193 , G10L15/08
Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.
-
-