摘要:
An improved method and apparatus for correcting for splay is provided. A document distorted by the curvature of a page of text away from a platen is converted to a digital image. The digital image is the manipulated to remove the distortion by fitting the lines of text in an unsplayed portion to a skew line, which represents the deviation of lines of text in the digital image from horizontal. Then the splay is determined for each line of text. Once the skew and the splay are determined, an inverse transformation is done to straighten the lines of text. A horizontal stretching is also applied to the text to correct for the projection angle of the original document.
摘要:
A facsimile transmission system with a first facsimile machine that includes at least a scanner for scanning documents inserted into a document feeder and transmission capabilities for sending a fax and with a second facsimile machine that includes at least reception capabilities for receiving the fax and a printer for printing a hard copy of the received fax, if necessary. The facsimile system may include functionality for securing the facsimile transmission. The facsimile system may include functionality to enable the facsimile transmission to be certified.
摘要:
The apparatus for the recognition of speech comprises an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data.
摘要:
Multiframe reconstruction combines a set of acquired images into a reconstructed image. Here, which images to acquire are selected based at least in part on the content of previously acquired images. In one approach, a set of at least three images of an object are acquired at different acquisition settings. For at least one of the images in the set, the acquisition setting for the image is determined based at least in part on the content of previously acquired images. Multiframe image reconstruction, preferably via a multi-focal display, is applied to the set of acquired images to synthesize a reconstructed image of the object.
摘要:
An internet target marketing system, method, and computer program for distributing online advertising to viewers based upon the viewers' interests is provided. The system, method, and computer program may involve identifying one or more document-related concepts derived from analysis of content of a web document capable of being displayed to the user, identifying one or more advertisement-related concepts relevant to an advertising, comparing the one or more document-related concepts to the one or more advertising-related concepts to determine a relevance, and selecting the advertising based on the relevance.
摘要:
An automatic reading assistance application for documents available in electronic form. An automatic annotator is provided which finds concepts of interest and keywords. The operation of the annotator is personalizable for a particular user. The annotator is also capable of improving its performance overtime by both automatic and manual feedback. The annotator is usable with any electronic document. Another available feature is a thumbnail image of all or part of a multi-page document wherein a currently displayed section of the document is highlighted in the thumbnail image. Movement of the highlighted area in the thumbnail image is then synchronized with scrolling through the document.
摘要:
According to the present invention, an internet target marketing system, method and computer program for distributing online advertising to viewers based upon the viewers' interests is provided. Specific embodiments according to the present invention can use an n-way matching of user's concepts of interest, advertiser's concepts and a currently viewed document to target advertising to the view of the current document. Some embodiments can generate a contextually sensitive advertisement for each page viewed in a browser, thereby associating an advertisement with every page in a document. Specific embodiments can associate advertising with documents that are substantially free of embedded advertisements, for example. Alternative embodiments can include embedded advertising, however.
摘要:
An automatic reading assistance application for documents available in electronic form. An automatic annotator is provided which finds concepts of interest and keywords. The operation of the annotator is personalizable for a particular user. The annotator is also capable of improving its performance overtime by both automatic and manual feedback. The annotator is usable with any electronic document. Another available feature is a thumbnail image of all or part of a multi-page document wherein a currently displayed section of the document is highlighted in the thumbnail image. Movement of the highlighted area in the thumbnail image is then synchronized with scrolling through the document.
摘要:
A facial feature extraction method and apparatus uses the variation in light intensity (gray-scale) of a frontal view of a speaker's face. The sequence of video images are sampled and quantized into a regular array of 150.times.150 pixels that naturally form a coordinate system of scan lines and pixel position along a scan line. Left and right eye areas and a mouth are located by thresholding the pixel gray-scale and finding the centroids of the three areas. The line segment joining the eye area centroids is bisected at right angle to form an axis of symmetry. A straight line through the centroid of the mouth area that is at right angle to the axis of symmetry constitutes the mouth line. Pixels along the mouth line and the axis of symmetry in the vicinity of the mouth area form a horizontal and vertical gray-scale profile, respectively. The profiles could be used as feature vectors but it is more efficient to select peaks and valleys (maximas and minimas) of the profile that correspond to the important physiological speech features such as lower and upper lip, mouth corner, and mouth area positions and pixel values and their time derivatives as visual vector components. Time derivatives are estimated by pixel position and value changes between video image frames. A speech recognition system uses the visual feature vector in combination with a concomitant acoustic vector as inputs to a time-delay neural network.
摘要:
The invention provides an improved method and apparatus for compression of palettized images. Input symbols in an M-ary alphabet are binarized based on a context model of the input data, where the binarization is selected to provide good compression by a binary encoder. The particular binarization is determined from a reindexing table which maps each input symbol to a number of binary values. The mapping is determined from the images to be compressed, and is typically transmitted with the compressed images as overhead. The mapping is a local minimum of the bitwise entropy of the binarization. With or without reindexing the input, the symbols can be converted compressed in parallel, with the bits of the input symbols buffered and reordered as necessary to ensure that bits needed for context of a bit being decoded are available before the decompressor decodes the bit being decoded. The decompressor includes a means for performing the opposite reordering such that the output of the decompressor is the same as the input to the compressor.