Abstract:
A grammar programming language ("GPL") compiler (300) compiles each rule in a natural language grammar (311) into a separate function (312) that can be invoked by a translation system to apply the rule to a representation of a natural language expression. The GPL compiler can output the functions for the rules as source code for a standard computer programming language to be further compiled (313) into object code that can be directly executed by a computer processor. The GPL compiler can also generate special functions for each rule to enable multi-layered operations (505, 507, 509) on the representations and to handle the processing of representations of ambiguous expressions (601, 605).
Abstract:
A method and system for reducing lexical ambiguity in an input stream (302) are described. In one embodiment, the input stream (302) is broken into tokens (308). The tokens are used to create a connection graph comprising a number of paths. Each of the paths is assigned a cost (318). At least one best path is defined based upon a corresponding cost to generate an output graph (322). The generated output graph is provided to reduce lexical ambiguity.
Abstract:
A natural language translation system contains language-neutral modules for syntactic analysis (216), transfer (222), and morphological and syntactical generation (228) of feature structures for an input expression in a source (202) and a target language (230). The language-neutral modules are driven by language-specific grammars (212, 218, 224) to translate between the specified languages so that no knowledge about the languages need be incorporated into the modules themselves. The modules interface with the grammar rules in the form of compiled grammar programming language statements that perform the required manipulation of the feature sturctures. Because the modules are language-neutral, the system is readily adaptable to new languages simply by providing a grammar for the new language. Multiple copies of each module, each interfacing with a different natural language grammar, enables simultaneous translation of multiple languages in the same system.