Method and apparatus for distortion correction of scanned images
    11.
    发明授权
    Method and apparatus for distortion correction of scanned images 失效
    扫描图像畸变校正的方法和装置

    公开(公告)号:US5497236A

    公开(公告)日:1996-03-05

    申请号:US82118

    申请日:1993-06-23

    摘要: An improved method and apparatus for correcting for splay is provided. A document distorted by the curvature of a page of text away from a platen is converted to a digital image. The digital image is the manipulated to remove the distortion by fitting the lines of text in an unsplayed portion to a skew line, which represents the deviation of lines of text in the digital image from horizontal. Then the splay is determined for each line of text. Once the skew and the splay are determined, an inverse transformation is done to straighten the lines of text. A horizontal stretching is also applied to the text to correct for the projection angle of the original document.

    摘要翻译: 提供了一种用于校正喷射的改进的方法和装置。 由文本页距离压板的曲率变形的文档被转换为数字图像。 数字图像被操纵以通过将未播放部分中的文本线拟合到偏斜线来消除失真,其表示数字图像中的文本行与水平线的偏差。 然后确定每行文本的播放。 一旦确定了偏斜和张开,就进行逆变换以矫正文本行。 对文本也应用水平拉伸,以校正原始文档的投影角度。

    Neural network acoustic and visual speech recognition system
    13.
    发明授权
    Neural network acoustic and visual speech recognition system 失效
    神经网络声学和视觉语音识别系统

    公开(公告)号:US5586215A

    公开(公告)日:1996-12-17

    申请号:US889619

    申请日:1992-05-26

    摘要: The apparatus for the recognition of speech comprises an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data.

    摘要翻译: 用于识别语音的装置包括声学预处理器,视觉预处理器和对声学和视觉预处理数据进行操作的语音分类器。 声学预处理器包括一个产生相同的梅尔带宽对数功率谱的对数密度谱分析仪。 视觉处理器检测扬声器面部上的一组基准标记的运动,并提取一组描述嘴唇和嘴巴运动的归一化距离矢量。 语音分类器使用对预处理的声学和视觉数据进行操作的多电平时间延迟神经网络,以基于声学和视觉数据形成指示每个候选语音已被说出的概率的输出概率分布。

    Method, system and computer code for content based web advertising
    15.
    发明授权
    Method, system and computer code for content based web advertising 有权
    基于内容的网络广告的方法,系统和计算机代码

    公开(公告)号:US08620747B2

    公开(公告)日:2013-12-31

    申请号:US11327087

    申请日:2006-01-05

    IPC分类号: G06Q30/00

    摘要: An internet target marketing system, method, and computer program for distributing online advertising to viewers based upon the viewers' interests is provided. The system, method, and computer program may involve identifying one or more document-related concepts derived from analysis of content of a web document capable of being displayed to the user, identifying one or more advertisement-related concepts relevant to an advertising, comparing the one or more document-related concepts to the one or more advertising-related concepts to determine a relevance, and selecting the advertising based on the relevance.

    摘要翻译: 提供了一种根据观众的兴趣向观众分发网络广告的互联网目标营销系统,方法和计算机程序。 系统,方法和计算机程序可以包括识别从能够被显示给用户的web文档的内容的分析得出的一个或多个文档相关概念,识别与广告相关的一个或多个广告相关概念,比较 与所述一个或多个广告相关概念的一个或多个文档相关概念以确定相关性,以及基于所述相关性来选择所述广告。

    Techniques for annotating portions of a document relevant to concepts of interest
    16.
    发明授权
    Techniques for annotating portions of a document relevant to concepts of interest 有权
    用于注释与感兴趣的概念相关的文档的部分的技术

    公开(公告)号:US07395501B2

    公开(公告)日:2008-07-01

    申请号:US10214380

    申请日:2002-08-06

    IPC分类号: G06F17/00

    摘要: An automatic reading assistance application for documents available in electronic form. An automatic annotator is provided which finds concepts of interest and keywords. The operation of the annotator is personalizable for a particular user. The annotator is also capable of improving its performance overtime by both automatic and manual feedback. The annotator is usable with any electronic document. Another available feature is a thumbnail image of all or part of a multi-page document wherein a currently displayed section of the document is highlighted in the thumbnail image. Movement of the highlighted area in the thumbnail image is then synchronized with scrolling through the document.

    摘要翻译: 以电子形式提供文件的自动阅读辅助申请。 提供了一个自动注释器,可以查找兴趣和关键字的概念。 注释器的操作可以针对特定用户进行个性化。 注释器还能够通过自动和手动反馈来提高其超时性能。 注释器可用于任何电子文档。 另一个可用的特征是多页文档的全部或部分的缩略图,其中文档的当前显示部分在缩略图中突出显示。 缩略图中突出显示的区域的移动与滚动文档同步。

    Content based web advertising
    17.
    发明授权
    Content based web advertising 有权
    基于内容的网络广告

    公开(公告)号:US06804659B1

    公开(公告)日:2004-10-12

    申请号:US09483092

    申请日:2000-01-14

    IPC分类号: G06F1760

    摘要: According to the present invention, an internet target marketing system, method and computer program for distributing online advertising to viewers based upon the viewers' interests is provided. Specific embodiments according to the present invention can use an n-way matching of user's concepts of interest, advertiser's concepts and a currently viewed document to target advertising to the view of the current document. Some embodiments can generate a contextually sensitive advertisement for each page viewed in a browser, thereby associating an advertisement with every page in a document. Specific embodiments can associate advertising with documents that are substantially free of embedded advertisements, for example. Alternative embodiments can include embedded advertising, however.

    摘要翻译: 根据本发明,提供了一种基于观众的兴趣向观众分发网络广告的互联网目标营销系统,方法和计算机程序。 根据本发明的具体实施例可以使用用户感兴趣的概念,广告主的概念和当前浏览的文档的n-way匹配来将广告定位到当前文档的视图。 一些实施例可以为在浏览器中查看的每个页面生成上下文敏感广告,从而将广告与文档中的每个页面相关联。 具体实施例可以将广告与基本上不含嵌入式广告的文档相关联。 然而,替代实施例可以包括嵌入式广告。

    System to facilitate reading a document
    18.
    发明授权
    System to facilitate reading a document 有权
    系统便于阅读文件

    公开(公告)号:US06457026B1

    公开(公告)日:2002-09-24

    申请号:US09661184

    申请日:2000-09-13

    IPC分类号: G06F1721

    摘要: An automatic reading assistance application for documents available in electronic form. An automatic annotator is provided which finds concepts of interest and keywords. The operation of the annotator is personalizable for a particular user. The annotator is also capable of improving its performance overtime by both automatic and manual feedback. The annotator is usable with any electronic document. Another available feature is a thumbnail image of all or part of a multi-page document wherein a currently displayed section of the document is highlighted in the thumbnail image. Movement of the highlighted area in the thumbnail image is then synchronized with scrolling through the document.

    摘要翻译: 以电子形式提供文件的自动阅读辅助申请。 提供了一个自动注释器,可以查找兴趣和关键字的概念。 注释器的操作可以针对特定用户进行个性化。 注释器还能够通过自动和手动反馈来提高其超时性能。 注释器可用于任何电子文档。 另一个可用的特征是多页文档的全部或部分的缩略图,其中文档的当前显示部分在缩略图中突出显示。 缩略图中突出显示的区域的移动与滚动文档同步。

    Facial feature extraction method and apparatus for a neural network
acoustic and visual speech recognition system
    19.
    发明授权
    Facial feature extraction method and apparatus for a neural network acoustic and visual speech recognition system 失效
    用于神经网络声学和视觉语音识别系统的面部特征提取方法和装置

    公开(公告)号:US5680481A

    公开(公告)日:1997-10-21

    申请号:US488840

    申请日:1995-06-09

    摘要: A facial feature extraction method and apparatus uses the variation in light intensity (gray-scale) of a frontal view of a speaker's face. The sequence of video images are sampled and quantized into a regular array of 150.times.150 pixels that naturally form a coordinate system of scan lines and pixel position along a scan line. Left and right eye areas and a mouth are located by thresholding the pixel gray-scale and finding the centroids of the three areas. The line segment joining the eye area centroids is bisected at right angle to form an axis of symmetry. A straight line through the centroid of the mouth area that is at right angle to the axis of symmetry constitutes the mouth line. Pixels along the mouth line and the axis of symmetry in the vicinity of the mouth area form a horizontal and vertical gray-scale profile, respectively. The profiles could be used as feature vectors but it is more efficient to select peaks and valleys (maximas and minimas) of the profile that correspond to the important physiological speech features such as lower and upper lip, mouth corner, and mouth area positions and pixel values and their time derivatives as visual vector components. Time derivatives are estimated by pixel position and value changes between video image frames. A speech recognition system uses the visual feature vector in combination with a concomitant acoustic vector as inputs to a time-delay neural network.

    摘要翻译: 面部特征提取方法和装置使用说话者脸部正视图的光强度(灰度)的变化。 视频图像的序列被采样和量化为150×150像素的规则阵列,其自然地沿着扫描线形成扫描线和像素位置的坐标系。 通过对像素灰度进行阈值定位并找到三个区域的质心来定位左眼区域和右眼区域。 连接眼睛区域重心的线段以直角平分,形成对称轴。 通过与对称轴成直角的口区域的质心的直线构成口线。 沿嘴口的像素和口区附近的对称轴分别形成水平和垂直的灰度轮廓。 轮廓可以用作特征向量,但是更有效地选择对应于重要的生理语音特征(例如下唇和上唇,嘴角和嘴区域位置和像素)的轮廓的峰和谷(最大值和最小值) 值和它们的时间导数作为视觉矢量分量。 时间导数由视频图像帧之间的像素位置和值变化来估计。 语音识别系统使用视觉特征向量与伴随的声矢量相结合,作为时间延迟神经网络的输入。

    Compression of palettized images and binarization for bitwise coding of
M-ary alphabets therefor
    20.
    发明授权
    Compression of palettized images and binarization for bitwise coding of M-ary alphabets therefor 失效
    压缩的调色图像和二进制化用于按位编码的M字形字母

    公开(公告)号:US5471207A

    公开(公告)日:1995-11-28

    申请号:US200233

    申请日:1994-02-23

    摘要: The invention provides an improved method and apparatus for compression of palettized images. Input symbols in an M-ary alphabet are binarized based on a context model of the input data, where the binarization is selected to provide good compression by a binary encoder. The particular binarization is determined from a reindexing table which maps each input symbol to a number of binary values. The mapping is determined from the images to be compressed, and is typically transmitted with the compressed images as overhead. The mapping is a local minimum of the bitwise entropy of the binarization. With or without reindexing the input, the symbols can be converted compressed in parallel, with the bits of the input symbols buffered and reordered as necessary to ensure that bits needed for context of a bit being decoded are available before the decompressor decodes the bit being decoded. The decompressor includes a means for performing the opposite reordering such that the output of the decompressor is the same as the input to the compressor.

    摘要翻译: 本发明提供了一种用于压缩调色图像的改进方法和装置。 基于输入数据的上下文模型将M字母字母中的输入符号进行二进制化,其中选择二值化以通过二进制编码器提供良好的压缩。 特定二进制化是从将每个输入符号映射到多个二进制值的重建索引表确定的。 从要压缩的图像确定映射,并且通常以压缩图像作为开销来传送映射。 映射是二值化的位熵的局部最小值。 有或没有重新索引输入,符号可以并行转换压缩,根据需要缓冲和重新排序输入符号的位,以确保在解码器解码正被解码的位之前,需要解码的位的上下文所需的位可用 。 解压缩器包括用于执行相反重新排序的装置,使得解压缩器的输出与压缩器的输入相同。