摘要:
A translation device comprises a character recognition unit that recognizes text data in a text region of an input image; a translator that translates the text data in the text region; and a layout configuration processor that generates data containing the translated text data in the text region and graphics in the input image, wherein a layout of the input image is maintained in a layout of the image of the data generated by the layout configuration processor.
摘要:
A image processing device has a reading unit, a graphics area extraction unit, a writing area extraction unit, a character string extraction unit and an association unit. The reading unit reads a document. The graphics area extraction unit extracts a graphics area from the document read by the reading unit. The writing area extraction unit extracts a writing area from the document read by the reading unit. The character string extraction unit extracts a character string presented in the graphics area. The association unit associates information of the writing area with the graphics area based on the character string extracted by the character string extraction unit.
摘要:
A question answering system includes a question sentence analyzing unit, a question keyword identifying unit, a passage acquiring unit and an answer generating unit. The question sentence analyzing unit determines whether or not an input question sentence is an ambiguous question. The question keyword identifying unit extracts a question keyword from the input question sentence. The passage acquiring unit executes a search process to which the question keyword is applied. The answer generating unit generates answers in a form of a list of predicates extracted correspondingly to the question keyword, based on passages acquired by the passage acquiring unit.
摘要:
The present invention provides a translation memory system including: a memory which stores plural pairs of a natural language sentence written in a first language and an interlingua representation of the natural language sentence; an analysis unit which performs a syntactic and semantic analysis on a natural language sentence written in a second language and translates the natural language sentence into an interlingua representation on the basis of the analysis result; a search unit which searches the memory to identify an interlingua representation which corresponds to or has a predetermined level of similarity to the interlingua representation obtained by the analysis unit, and which extracts a natural language sentence written in the first language paired with the identified interlingua representation; and an output unit which outputs the natural language sentence extracted by the search unit as a translation result.
摘要:
The present invention provides a document processing device including: a specifying unit that specifies character strings which have a common property across documents, from among character strings included in plural documents which are represented by plural corresponding document data; and a rewriting unit that rewrites, among the character strings specified by the specifying unit, character strings expressed in formats different from a defined format to character strings expressed in the defined format.
摘要:
The invention provides an electronic device that has an identification unit that performs character recognition processing on image data representing a text written in a first language and identifies candidate character strings representing results of the character recognition processing for each of structural units of the text, a decision unit that decides whether a second language selected by a user is different from the first language, a presentation unit that presents translations of the candidate character strings in the second language for each of structural units for which plural candidate character strings are identified when the first language and the second language are different, and a selection unit that allows the user to select a single translation from the translations presented by the presentation unit.
摘要:
A translation device comprises a character recognition unit that recognizes text data in a text region of an input image; a translator that translates the text data in the text region; and a layout configuration processor that generates data containing the translated text data in the text region and graphics in the input image, wherein a layout of the input image is maintained in a layout of the image of the data generated by the layout configuration processor.
摘要:
As a retrieval result, appropriate text of a second language is provided in response to a retrieval request by text of a first language. A first directory storing part stores a first directory structure created for a first language. A second directory storing part stores a second directory structure created for a second language. A directory relation storing part stores correspondences between directories in the first directory structure and directories in the second directory structure. A directory retrieval part receives a retrieval request by the first language from a user and decides which directory in the first directory structure the request has a high degree of relation with. A multilingual retrieval part decides documents having a high degree of relation with the retrieval request, of documents belonging to a directory in the second directory structure that corresponds to the decided directory.
摘要:
The present invention provides a document processing device including: a general feature vector memory that stores feature vectors of a shape for each of plural characters; an input unit that optically reads in a document; a extracting unit that extracts feature vectors from the shapes of characters in a document read in by the input unit; a general shape recognition unit that estimates a character for which the feature vectors of its shape extracted by the extracting unit, based on the feature vectors extracted by the extracting unit and the content stored in the general feature vector memory; and a specific feature vector memory that stores the feature vectors extracted by the extracting unit in association with an estimation result of the general shape recognition unit.
摘要:
A image processing device has a reading unit, a graphics area extraction unit, a writing area extraction unit, a character string extraction unit and an association unit. The reading unit reads a document. The graphics area extraction unit extracts a graphics area from the document read by the reading unit. The writing area extraction unit extracts a writing area from the document read by the reading unit. The character string extraction unit extracts a character string presented in the graphics area. The association unit associates information of the writing area with the graphics area based on the character string extracted by the character string extraction unit.