摘要:
An improved method, apparatus, and computer instructions displaying search results. A query is received in a first language. A snippet in the document containing the word is selected in response to identifying a document in a second language in which the document contains a word matching a query word in the query. This snippet is translated into the first language.
摘要:
Techniques for translating and presenting text in a structured document are provided. A structured document including text to be translated is received. At least one portion of the text to be translated is translated into at least one representation of translated text of a specific language. The translation is carried out in accordance with a prioritization schedule. Further, the representation of translated text is provided by at least first and second servers, the first server independently providing the representation of translated text of a first quality and the second server independently providing the representation of translated text of a second quality, the first quality being different than the second quality. As the text is translated, the representation of translated text is presented immediately when available.
摘要:
A parser is provided that parses a Chinese text stream at the character level and builds a syntactic structure of Chinese character sequences. A character-based syntactic parse tree contains word boundaries, part-of-speech tags, and phrasal structure information. Syntactic knowledge constrains the system when it determines word boundaries. A deterministic procedure is used to convert word-based parse trees into character-based trees. Character-level tags are derived from word-level part-of-speech tags and word-boundary information is encoded with a positional tag. Word-level parts-of-speech become a constituent label in character-based trees. A maximum entropy parser is then built and tested.
摘要:
The invention proposes using statistical methods to do natural language understanding. The key notion is that there are "strings" of words in the natural language, that correspond to a single semantic concept. One can then define an alignment between an entire semantic meaning (consisting of a set of semantic concepts), and the English. This is modeled using P(E,A.vertline.S). One can model p(S) separately. This allows each parameter to be modeled using many different statistical models.
摘要:
Techniques for translating and presenting text in a structured document are provided. A structured document including text to be translated is received. At least one portion of the text to be translated is translated into at least one representation of translated text of a specific language. The translation is carried out in accordance with a prioritization schedule. Further, the representation of translated text is provided by at least first and second servers, the first server independently providing the representation of translated text of a first quality and the second server independently providing the representation of translated text of a second quality, the first quality being different than the second quality. As the text is translated, the representation of translated text is presented immediately when available.
摘要:
A system for translating a first word set in a source language into a second word set in a target language, the system comprising: input means for inputting the first word set into the system; tagging means for tagging the first word set input to the system so as to at least substantially reduce non-essential variability in the first word set; translation means including a single a posteriori conditional probability model and a target candidate store for storing target language candidate word sets, wherein the translation means employs the single model to evaluate the target language candidate word sets in order to select the target language candidate word set having a best score with respect to the first word set; and output means for outputting the best scoring target language candidate word set as the second word set in the target language.