摘要:
A method of statistical machine translation (SMT) is provided. The method comprises generating reordering knowledge based on the syntax of a source language (SL) and a number of alignment matrices that map sample SL sentences with sample target language (TL) sentences. The method further comprises receiving a SL word string and parsing the SL word string into a parse tree that represents the syntactic properties of the SL word string. The nodes on the parse tree are reordered based on the generated reordering knowledge in order to provide reordered word strings. The method further comprises translating a number of reordered word strings to create a number of TL word strings, and identifying a statistically preferred TL word string as a preferred translation of the SL word string.
摘要:
Concepts presented herein relate to extracting knowledge for a chatbot knowledge base from online discussion forms. Within a thread of on online discussion form, replies are selected based on structural features and content features therein. The replies can be ranked and used in a chatbot knowledge base.
摘要:
A method, computer readable medium and system are provided which retrieve hint sentences from a sentence database in response to a query. An input component receives the query having terms. A search engine expands the query by including synonyms of the terms to obtain expanded terms. The search engine then combines the expanded terms to form dependency triples from the expanded terms. From the formed dependency triples, dependency triples which are not found in a dependency triples database are discarded to obtain remaining dependency triples from the expanded terms. The search engine then searches the sentence database using the remaining dependency triples as search parameters.
摘要:
Named entity translation of a named entity in a source language is translated to a target language by combining a transliteration of the named entity with data mining in the target language.
摘要:
A sentence is accessed and at least one query is generated based on the sentence. At least one query can be compared to text within a collection of documents, for example using a web search engine. Collocation errors in the sentence can be detected and/or corrected based on the comparison of the at least one query and the text within the collection of documents.
摘要:
General information blocks of text are extracted from a document. A label is applied to each general information block and detailed information strings of text are extracted from at least one of the general information blocks based on the corresponding label of the at least one general information block.
摘要:
A computer-aided writing system offers assistance to a user writing in a non-native language, as the user needs help, without requiring the user to divert attention away from the entry task. The writing system provides a user interface (UI) that integrates writing assistance with in-line text entry. When the user is unsure of a word's spelling or whether the word is appropriate, the user may enter a corresponding native word directly in line with the ongoing sentence. An error tolerant spelling tool accepts the native word (even if it is misspelled or mistyped) and derives the most probable non-native word for the given context. The spelling tool consults a bilingual dictionary to determine possible non-native word translation candidates, a non-native language model (e.g., a trigram language model) to generate probabilities associated with the candidates given the current sentence or phrase context, and a translation model to generate probabilities of how likely a certain native word was intended given the non-native word candidates. From these probabilities, the spelling tool determines the most probable non-native word translation and substitutes the non-native word for the native input string. If the user likes the non-native word, the user may simply continue with the sentence. On the other hand, if the user is still unsure of the non-native word, the user can invoke a sentence recommendation tool that allows the user to see the non-native word in a bilingual sentence pair to learn how the word can be used.
摘要:
A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engine defines indexing units based upon the query, with the indexing units including both lemma from the query and extended indexing units associated with the query. The search engine then retrieves a plurality of sentences from the sentence database using the defined indexing units as search parameters. A similarity between each of the plurality of retrieved sentences and the query is determined by the search engine, wherein each similarity is determined as a function of a linguistic weight of a term in the query. The search engine then ranks the plurality of retrieved sentences based upon the determined similarities.
摘要:
A file system for file sharing includes a web server that creates a temporary directory for each session between the web server and a client computer. When the client computer requests a file located in a file server, the web server creates a symbolic link to the file in the temporary directory and a web page including an URL to the symbolic link. The web server transmits the web page, and therefore the file, to the client computer. Client computer can also upload files to the web server. The web server saves the uploaded files to the temporary directory and then moves the files to the file server. At the end of the session, the web server deletes the temporary directory. Thus, files are not saved on the web server and therefore not accessible to others from the Internet. In this file system, file and directory names are orderly incremented in the file server to simplify the back up process of the file server. Furthermore, previously backed up directories are checked for their current size to determine if they should be backed up again. By backing up previously backed up directories that are now smaller speeds up the restoration of file serve when necessary.
摘要:
An enclosure for housing circuitry of an electronic device including at least one enclosure part, and a shielding layer on the part. The enclosure part has an exterior surface and an opposing interior surface defining a wall therebetween, and is formed of a plastic material which is generally transparent. The shielding layer is provided to cover at least a portion of the interior surface of the enclosure part. The layer is formulated to provide EMI shielding for the circuitry housed within the enclosure, and as coated or otherwise applied to the interior surface is visually perceptible through the exterior surface of the enclosure part for enhancing the cosmetic appearance of the enclosure.