摘要:
In the learning apparatus, a memory stores a dictionary in an updatable manner, and an inputting means inputs data when an instruction is input by a user. An outputting part processes the data inputted through the inputting part by using the dictionary stored in the memory, and outputs the result of the processing. An identifier receiver obtains an identifier of the user or a group to which the user belongs. An updating means updates the dictionary only when the identifier obtained by the identifier receiver is pre-registered in the memory.
摘要:
A translation device has a dictionary that stores a set of words and their corresponding meanings in plural languages; an input unit that inputs a document; a recognizing unit that recognizes text in the inputted document; an analyzing unit that devides the text recognized by the recognizing unit into words; a translating unit that translates each of the words obtained by the analyzing unit into a translated term by using the dictionary; and an output unit that outputs an output image containing the translated term for a key word.
摘要:
The present invention provides a document processing device including: a general feature vector memory that stores feature vectors of a shape for each of plural characters; an input unit that optically reads in a document; a extracting unit that extracts feature vectors from the shapes of characters in a document read in by the input unit; a general shape recognition unit that estimates a character for which the feature vectors of its shape extracted by the extracting unit, based on the feature vectors extracted by the extracting unit and the content stored in the general feature vector memory; and a specific feature vector memory that stores the feature vectors extracted by the extracting unit in association with an estimation result of the general shape recognition unit.
摘要:
The present invention provides a document processing device including: a specifying unit that specifies character strings which have a common property across documents, from among character strings included in plural documents which are represented by plural corresponding document data; and a rewriting unit that rewrites, among the character strings specified by the specifying unit, character strings expressed in formats different from a defined format to character strings expressed in the defined format.
摘要:
A translation device comprises a character recognition unit that recognizes text data in a text region of an input image; a translator that translates the text data in the text region; and a layout configuration processor that generates data containing the translated text data in the text region and graphics in the input image, wherein a layout of the input image is maintained in a layout of the image of the data generated by the layout configuration processor.
摘要:
A image processing device has a reading unit, a graphics area extraction unit, a writing area extraction unit, a character string extraction unit and an association unit. The reading unit reads a document. The graphics area extraction unit extracts a graphics area from the document read by the reading unit. The writing area extraction unit extracts a writing area from the document read by the reading unit. The character string extraction unit extracts a character string presented in the graphics area. The association unit associates information of the writing area with the graphics area based on the character string extracted by the character string extraction unit.
摘要:
The present invention provides a translation memory system including: a memory which stores plural pairs of a natural language sentence written in a first language and an interlingua representation of the natural language sentence; an analysis unit which performs a syntactic and semantic analysis on a natural language sentence written in a second language and translates the natural language sentence into an interlingua representation on the basis of the analysis result; a search unit which searches the memory to identify an interlingua representation which corresponds to or has a predetermined level of similarity to the interlingua representation obtained by the analysis unit, and which extracts a natural language sentence written in the first language paired with the identified interlingua representation; and an output unit which outputs the natural language sentence extracted by the search unit as a translation result.
摘要:
The present invention provides a document processing device including: a specifying unit that specifies character strings which have a common property across documents, from among character strings included in plural documents which are represented by plural corresponding document data; and a rewriting unit that rewrites, among the character strings specified by the specifying unit, character strings expressed in formats different from a defined format to character strings expressed in the defined format.
摘要:
The invention provides an electronic device that has an identification unit that performs character recognition processing on image data representing a text written in a first language and identifies candidate character strings representing results of the character recognition processing for each of structural units of the text, a decision unit that decides whether a second language selected by a user is different from the first language, a presentation unit that presents translations of the candidate character strings in the second language for each of structural units for which plural candidate character strings are identified when the first language and the second language are different, and a selection unit that allows the user to select a single translation from the translations presented by the presentation unit.
摘要:
A translation device comprises a character recognition unit that recognizes text data in a text region of an input image; a translator that translates the text data in the text region; and a layout configuration processor that generates data containing the translated text data in the text region and graphics in the input image, wherein a layout of the input image is maintained in a layout of the image of the data generated by the layout configuration processor.