摘要:
An accurate grammar analyzer based on a so-called POST (part-of-speech tagged) parser and a learners' model for use in automated language learning applications such as the template-based ICALL (intelligent computer assisted language learning) system.
摘要:
A method of automatically generating a multi-variable fuzzy inference system using a Fourier series expansion. Sample sets are decomposed into a cluster of sample sets associated with given input variables. Fuzzy rules and membership functions are computed individually for each variable by solving a single input multiple outputs fuzzy system extracted from the set cluster. The resulting fuzzy rules and membership functions are composed and integrated back into the fuzzy system appropriate for the original sample set with a minimal computational cost. In addition, an overall system error can be related to errors at each stage of decomposition and composition, enabling error bounds or accuracy thresholds for each stage to be specified and ensuring the final precision of the resulting fuzzy system on the original sample set.
摘要:
To retrieve an optimum template pattern in response to an input sentence, a set of templates is arranged in a plurality of template blocks containing an arbitrary number of sentence components, including grammatically correct and/or incorrect components. A score is assigned to every word in the set of templates according to its importance. The candidate template patterns and the input sentence are retrieved, the scores of the matched words are calculated, and the total of the scores of the entire paths are calculated. Optimum level comparison values are then calculated using the score of the matching words as the numerator and the total score as the denominator. The candidate template pattern having the largest optimum level comparison value among optimum level comparison values that provide the largest numerator, is selected as the optimum template pattern. The input sentence is then corrected using this optimum template pattern.
摘要:
A learning system uses the concept of template automaton. “Various expected examples of learners” consisting of “correct” answers and “incorrect” answers are collected and a representative NLP technique such as HCS (heaviest common character string) or LCS (longest common character string) algorithm is used as an effective error diagnosis engine in the language learning system. These examples embedded in the template are used for diagnostic analysis of the answers of the learners. This diagnosis is performed by selecting a path of the highest similarity with the input sentence of the learner among a plenty of candidate paths. Thus, it is possible to automatize and simplify the time-requiring authoring task used in the language-oriented intelligent learning system.
摘要:
A computerized method for automatic document classification based on a combined use of the projection and the distance of the differential document vectors to the differential latent semantics index (DLSI) spaces. The method includes the setting up of a DLSI space-based classifier to be stored in computer storage and the use of such classifier by a computer to evaluate the possibility of a document belonging to a given cluster using a posteriori probability function and to classify the document in the cluster. The classifier is effective in operating on very large numbers of documents such as with document retrieval systems over a distributed computer network.
摘要:
An accurate grammar analyzer that works effectively even with error-ridden sentences input by learners, based on a context-free probabilistic statistical POST (part-of-speech tagged) parser, for a template-automation-based computer-assisted language learning system. For any keyed-in sentence, the parser finds a closest correct sentence to the keyed-in sentence from among the embedded template paths exploiting a highest similarity value, and generates a grammar tree for the correct sentence where some ambiguous words are preassigned by expert language teachers. The system marks the errors under the leaves of the grammar tree by identifying the differences between the keyed-in sentence and the grammar tree of the correct sentence as errors committed by learners. By identifying most frequently recurring grammatical errors of each student, the system sets up a learner's model, providing a unique level of contingent remediation most appropriate to each learner involved.
摘要:
To improve the precision in correction of an input sentence by using a template pattern for model sentence. A plurality of template patterns for the model sentence are provided beforehand. Each of the template patterns is regarded as a plurality of templates of words/phrases based on expertise of language teachers with scores assigned to the words according to their importance. The scores and subsequently the input sentence are read and analyzed in comparison with each of the template patterns and the total of scores of matching words is calculated. A template pattern having the highest total score is selected as an optimum template pattern and the input sentence is corrected using the optimum template pattern. This method improves the likelihood that a template pattern containing a larger number of important words is selected as the optimum template pattern.
摘要:
A new, more efficient memory translation algorithm facilitating the acquisition of a most appropriate translation in a target language from among those of nearly narrowed-down candidates of translation by separately applying the so-called dimension reducing functions of a template automaton and the LSI (latent semantic index) technique. Both the template automaton and the LSI principle play an important role in implementing an efficient process of narrowing down an efficient solution space from among the many example sentences of the databases in a target language by exploiting their respective unique search space reduction function. Once developed into a fully operational system, an expert editor rather than an expert translator can tune up the translation memory system, markedly widening the range of available experts who can utilize the system.
摘要:
A learning system uses the concept of template automaton. “Various expected examples of learners” consisting of “correct” answers and “incorrect” answers are collected and a representative NLP technique such as HCS (heaviest common character string) or LCS (longest common character string) algorithm is used as an effective error diagnosis engine in the language learning system. These examples embedded in the template are used for diagnostic analysis of the answers of the learners. This diagnosis is performed by selecting a path of the highest similarity with the input sentence of the learner among a plenty of candidate paths. Thus, it is possible to automatize and simplify the time-requiring authoring task used in the language-oriented intelligent learning system.
摘要:
A computer-based information search and retrieval system and method for retrieving textual digital objects that makes full use of the projections of the documents onto both the reduced document space characterized by the singular value decomposition-based latent semantic structure and its orthogonal space. The resulting system and method has increased robustness, improving the instability of the traditional keyword search engine due to synonymy and/or polysemy of a natural language, and therefore is particularly suitable for web document searching over a distributed computer network such as the Internet.