Rank-reduced token representation for automatic speech recognition

    公开(公告)号:US10593346B2

    公开(公告)日:2020-03-17

    申请号:US15459481

    申请日:2017-03-15

    Applicant: Apple Inc.

    Abstract: The present disclosure generally relates to processing speech or text using rank-reduced token representation. In one example process, speech input is received. A sequence of candidate words corresponding to the speech input is determined. The sequence of candidate words includes a current word and one or more previous words. A vector representation of the current word is determined from a set of trained parameters. A number of parameters in the set of trained parameters varies as a function of one or more linguistic characteristics of the current word. Using the vector representation of the current word, a probability of a next word given the current word and the one or more previous words is determined. A text representation of the speech input is displayed based on the determined probability.

    Applying neural network language models to weighted finite state transducers for automatic speech recognition

    公开(公告)号:US10049668B2

    公开(公告)日:2018-08-14

    申请号:US15156161

    申请日:2016-05-16

    Applicant: Apple Inc.

    Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.

    Applying neural network language models to weighted finite state transducers for automatic speech recognition

    公开(公告)号:US10354652B2

    公开(公告)日:2019-07-16

    申请号:US16035513

    申请日:2018-07-13

    Applicant: Apple Inc.

    Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.

Patent Agency Ranking