摘要:
Systems and processes for operating an intelligent automated assistant are provided. In one example process a set of words including a grammatical error is received. The process can generate, using a neural network based on the set of words including the grammatical error and a reference set of words, a transformed set of words and further determine, based on the set of words including the grammatical error and the reference set of words, a reconstructed reference set of words. The process can also determine, based on a comparison of the transformed set of words and the reconstructed reference set of words, whether the transformed set of words is grammatically correct and provide an indication of whether the transformed set of words is grammatically correct to the neural network.
摘要:
An example method includes at an electronic device with one or more processors and memory: obtaining first data representing a user-generated keyboard path for one or more words; obtaining second data representing a synthetic keyboard path for the one or more words; generating, using a first instance of a generative network, based on the first data and the second data, third data representing a modification of the synthetic keyboard path; determining whether the third data represent a second user-generated keyboard path; determining whether the third data represent the one or more words; and in accordance with a determination that the third data represent a second user-generated keyboard path and a determination that the third data represent the one or more words: training a model for keyboard path recognition based on the third data.
摘要:
Methods, systems, and computer-readable media related to a technique for providing handwriting input functionality on a user device. A handwriting recognition module is trained to have a repertoire comprising multiple non-overlapping scripts and capable of recognizing tens of thousands of characters using a single handwriting recognition model. The handwriting input module provides real-time, stroke-order and stroke-direction independent handwriting recognition for multi-character handwriting input. In particular, real-time, stroke-order and stroke-direction independent handwriting recognition is provided for multi-character, or sentence level Chinese handwriting recognition. User interfaces for providing the handwriting input functionality are also disclosed.
摘要:
Systems and processes for language identification using recurrent neural networks are provided. An example method includes, at an electronic device, receiving a first typed character of a character sequence and determining a character context of the first typed character based on the first typed character and a second typed character of the character sequence. The method further includes determining a confidence level that the character sequence is associated with a language of a plurality of languages based on the character context of the first typed character, and determining whether the confidence level exceeds a threshold, in accordance with a determination that the confidence level exceeds the threshold, providing the language as a candidate language, and in accordance with a determination that the confidence level does not exceed the threshold, forgoing providing the language as a candidate language.
摘要:
Methods, systems, and computer-readable media related to a technique for providing handwriting input functionality on a user device. A handwriting recognition module is trained to have a repertoire comprising multiple non-overlapping scripts and capable of recognizing tens of thousands of characters using a single handwriting recognition model. The handwriting input module provides real-time, stroke-order and stroke-direction independent handwriting recognition for multi-character handwriting input. In particular, real-time, stroke-order and stroke-direction independent handwriting recognition is provided for multi-character, or sentence level Chinese handwriting recognition. User interfaces for providing the handwriting input functionality are also disclosed.
摘要:
Methods, systems, and computer-readable media related to a technique for providing handwriting input functionality on a user device. A handwriting recognition module is trained to have a repertoire comprising multiple non-overlapping scripts and capable of recognizing tens of thousands of characters using a single handwriting recognition model. The handwriting input module provides real-time, stroke-order and stroke-direction independent handwriting recognition for multi-character handwriting input. In particular, real-time, stroke-order and stroke-direction independent handwriting recognition is provided for multi-character, or sentence level Chinese handwriting recognition. User interfaces for providing the handwriting input functionality are also disclosed.
摘要:
The present disclosure generally relates to updating a language model based on user feedback. Based on a user text input, a language model predicts a set of tokens and an action that will be taken by the user in response to the predicted set of tokens. If the predicted action does not match a detected actual user action, the language model is updated to reflect the user feedback by modifying an output token probability distribution based on the actual user action and updating the language model to converge with a target language model using the modified output token probability distribution.
摘要:
The embodiments set forth techniques for sorting emails within an email application. The technique can include: (1) accessing a plurality of emails, where each email is included in a respective folder of one or more folders, (2) establishing a set of n-grams based on the plurality of emails, (3) for each n-gram in the set of n-grams: calculating a respective normalized entropy for the n-gram, and calculating a respective indexing power for the n-gram based on its respective normalized entropy, (4) ranking the n-grams based on their respective indexing powers to establish a ranked list of n-grams, and (5) establishing, for each folder of the one or more folders, a respective set of high-value n-grams based on the ranked list of n-grams. In turn, the high-value n-grams can be compared against n-grams of a new email to identify a target folder into which the new email should be sorted.
摘要:
Exemplary embodiments of methods and apparatuses for automatic speech recognition are described. First model parameters associated with a first representation of an input signal are generated. The first representation of the input signal is a discrete parameter representation. Second model parameters associated with a second representation of the input signal are generated. The second representation of the input signal includes a continuous parameter representation of residuals of the input signal. The first representation of the input signal includes discrete parameters representing first portions of the input signal. The second representation includes discrete parameters representing second portions of the input signal that are smaller than the first portions. Third model parameters are generated to couple the first representation of the input signal with the second representation of the input signal. The first representation and the second representation of the input signal are mapped into a vector space.
摘要:
Systems and methods are disclosed for predicting words using a structured stem and suffix n-gram language model. The systems and methods include determining, using a first n-gram word language model, a first probability of a stem based on a first portion of a previously-input word in the received input. Using a second n-gram language model, a second probability of a first suffix may be determined based at least on a second portion the previously-input word in the received input. Further, a third probability of a second suffix different from the first suffix may be determined using a third n-gram language model based at least on a third portion of the previously-input word in the received input. A fourth probability of a predicted word may be determined based on the first, second and third probabilities. One or more predicted words may be determined and provided as an output to the user.