摘要:
A method and apparatus for performing a document information search to uncover specified text data containing a given search subject key word from a group of document text data stored in a memory. In the document information search method, two stages of presearch are carried out to perform the document search with respect to a desired subject key word. In a first stage of presearch, a character component table is generated in which the existence of character codes for every document is set forth with respect to all the character codes contained in the group of document text data of stored documents. The character component table is searched for all the character codes comprising a designated search subject key word to thereby extract all documents containing all the character codes comprising the search subject key word. Further, in the presearch step, all texts without the possibility of containing the search subject key word are eliminated. A comprehensive, narrowed text search is thereby performed in accordance with the search subject key word.
摘要:
A system which includes a CAD system and an electronic filing system, wherein when vector data such as figures and their names are converted to image data for storage, the system automatically extracts, from the vector data coming from the CAD system, data that correspond to predetermined character attributes, maximum numeric attributes, primitives of each figure and the topology thereof. The extracted data is set as key-words and made to correspond to the image data for registration and storage in the electronic filing system. In searching for desired image data, key-words corresponding to the desired image data is input for retrieval and output of the desired image data.
摘要:
In an image filing system including an image scanner having at least a color mode and a monochrome mode as image input mode, for inputting document images; a key device for designating the image input mode of the image scanner, an image file for storing document image data inputted from the image scanner and a display screen on which the document images are displayed, a background corresponding to the image input mode is outputted when the document image inputted for the image scanner is displayed on the display screen. If a monochrome document is inputted for processing in a full color or multicolor mode, a monochrome input image and the background which is specific to the full color or the multicolor mode are outputted. Accordingly, a user which looks at the display notices that he or she should change the input mode and inputs the document image again prior to registration of the inputted image in a file.
摘要:
Method for determining and correcting the amount of an skew of image which may be read by an image reader, preferably one with with an automatic paper feeding device. Information dependent on the angle of skew of image is obtained in a plurality of the directions with respect to image data and the amount of skew is determined based on the information obtained. Measurement of the information is performed in two or more stages (or steps). At the first stage, the measurement is performed within a narrow range of angles including a reference direction. If no skew angle is detected in the narrow range, then the measurement is performed again at the second stage in a wider range of angles. If a skew angle is detected in the first stage, the second stage is omitted. With the skew angle determined, the image data is rotated in accordance with the skew angle detected to cancel the skew. No correction of the image data is performed if skew angle detection does not result in either the first or the second stage. Preferably, correction of the image data is omitted if the detected skew angle falls within a very small range of angles.
摘要:
A method and apparatus for making document information search and a magnetic disk unit to be used for realizing the method and apparatus. In the document information search method, in performing document search with respect to a desired subject key word, two stages of presearch are carried out. In a first stage of presearch (step 402), a character component table (500) in which existence of character codes for every document is stated with respect to all the character codes contained in the group of document text data of stored documents is generated, and the character component table is searched for all the character codes constituting a desiredly designated search subject key word to thereby extract all the documents each containing all the character codes constituting the search subject key word. In a second stage of presearch step 403), contracted text data for every document in which adjuncts and duplication of repeatedly stated words contained in advance in the text data are eliminated is generated, and the documents each containing the search subject key words by word are extracted from the documents extracted by the first presearch. After the second stage of presearch, text search is performed in accordance with a neighbor condition, a contextual condition, or the like (step 404). Further, as a term comparator means, hardware (1106) for exclusive use for term comparison in accordance with a finite automation is employed. Further, as for different notation and synonym, inputted terms are once subject to different notation development in a different notation development processing portion (2601), each of the different-notation developed terms is subject to synonym development in a synonym development processing portion (2602) while referring to a synonym dictionary, and then the results of synonym development are further subject to different notation development in a different notation development processing portion (2603) in accordance with a conversion rule table (2603).
摘要:
A color document image processing apparatus comprising an image input means for inputting document image data including multivalue color image, a binarizing means for binarizing input document image data by a simple binarization or artificial binary-halftone process, an image memory means for temporarily storing image data binarized by said binarizing means, a codec means for executing predetermined coding for storing image data stored in the image memory means and executing decoding to the stored document image data, an image storing means for storing document image data encoded by the codec means, a binary-halftone transducing means for transducing binary image data decoded by the codec means into multivalue image data, and an image output means for outputting multivalue image data transduced by the binary-halftone transducing means.
摘要:
An image processing system wherein for an inputted composite image composed of a line image and a dither image, both a line image processing and a dither image processing are carried out in parallel, and one of the processed results is selected in accordance with the image region discrimination result. The dither image processing is carried out through data conversion for calculating multivalued gray scale image from the inputted image data, grey scale data conversion for adjusting the gray scale image data so as to match an output device and obtaining such adjusted gray scale image data, and re-binarization for re-binarizing the gray scale image data after subjected to the grey scale conversion. The image region discrimination for discriminating if an image region is of a line image or a dither image is carried out based on a ratio of the number of black or white pixels within the region to the contour line length within the range. An ordered dither image through a screened type dither matrix is discriminated in accordance with a corelation between adjacent pixel trains each having a predetermined number of pixels.
摘要:
An original is first illuminated with a light beam of white color or a wavelength adapted for reading of the original, and an optical image of the original is focused on a photosensitive drum previously charged uniformly, to form an electrostatic latent image on the photosensitive drum. Thereafter, the photosensitive drum is scanned with a laser beam, and a discharge current attendant on the scanning is detected and converted into an electric signal representative of the optical image. Thus, a monochromatic laser can read graphic records of any color.
摘要:
Disclosed herein is a system integrally comprising a CAD system and an electronic filing system. When vector data such as figures and their names are converted to image (raster) data for storage, the system automatically extracts, from the vector data coming from the CAD system, those data that correspond to predetermined character attributes, maximum numeric attributes, primitives of each figure and the topology thereof. The data are set as key-words and are made to correspond to the image data for registration and storage into the electronic filing system. In searching for desired image data, the corresponding key-words are input for data retrieval and output.
摘要:
A drawing data obtained by a CAD device is extended once to an image data on a plane and the extended image data is coded and stored in an electronic file device so that every display can be done by merely reading data stored in the electronic file device, decoding and displaying it. The image data having a plurality of attributes is stored attribute by attribute so that the electronic file device can display only image having attribute or attributes instructed and it is possible to obtain only required image information among various informations contained in a complicated image and thus searching is facilitated.