System and method for web content extraction
    2.
    发明授权
    System and method for web content extraction 有权
    网页内容提取的系统和方法

    公开(公告)号:US08819028B2

    公开(公告)日:2014-08-26

    申请号:US13258482

    申请日:2009-12-14

    IPC分类号: G06F17/30 G06F3/12

    CPC分类号: G06F17/30896 G06F3/1246

    摘要: A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.

    摘要翻译: 公开了一种用于提取Web内容的方法和系统。 在一个实施例中,通过基于线间歇节点确定来识别Web内容中的段落来提取网页中的Web内容。 然后使用最大记分子序列来识别与识别的段落相关联的文本体的范围。 此外,使用基本上水平对齐的启发式规则来改进所识别的文本体。 此外,提取与Web内容相关联的一个或多个标题和一个或多个图像。 此外,输出包括识别的段落的Web内容,一个或多个标题和一个或多个图像。

    KEYBOARD DEVICE WITH LUMINOUS KEY
    3.
    发明申请
    KEYBOARD DEVICE WITH LUMINOUS KEY 审中-公开
    带有钥匙的键盘设备

    公开(公告)号:US20130161170A1

    公开(公告)日:2013-06-27

    申请号:US13425158

    申请日:2012-03-20

    IPC分类号: H01H13/76

    摘要: A keyboard device includes at least one luminous key, at least one light-emitting element, a membrane switch circuit member, an opaque seal structure, and a transparent seal structure. The luminous key has a light-transmissible zone. The light-emitting element is electrically connected with the membrane switch circuit member, and disposed under the light-transmissible zone. A top surface of the light-emitting element is encapsulated by the transparent seal structure. The transparent seal structure is partially surrounded by the opaque seal structure. Consequently, the light beam from the light-emitting element is transmissible through the transparent seal structure, and directed to the light-transmissible zone.

    摘要翻译: 键盘装置包括至少一个发光键,至少一个发光元件,薄膜开关电路构件,不透明密封结构和透明密封结构。 发光键具有透光区域。 发光元件与薄膜开关电路部件电连接,设置在透光区域的下方。 发光元件的上表面被透明密封结构封装。 透明密封结构被不透明密封结构部分地包围。 因此,来自发光元件的光束通过透明密封结构传播,并且指向透光区域。

    Digital camera with panoramic image capture
    5.
    发明授权
    Digital camera with panoramic image capture 有权
    具有全景图像捕获的数码相机

    公开(公告)号:US07746404B2

    公开(公告)日:2010-06-29

    申请号:US10705188

    申请日:2003-11-10

    IPC分类号: H04N5/225

    摘要: A method for generating a panoramic image that enables a user to obtain panoramic photographs with a digital camera without the aid of a computer system or specialized lenses. A digital camera according to the present techniques captures a series of image frames as a user pans the digital camera through a panoramic image scene and combines the captured image frames while the image frames are being captured.

    摘要翻译: 一种用于产生全景图像的方法,其使用户能够在不借助于计算机系统或专用镜头的情况下利用数码相机获得全景照片。 根据本技术的数字照相机捕获一系列图像帧,用户通过全景图像场景平移数字照相机,并在拍摄图像帧的同时组合捕获的图像帧。

    Method for determining logical components of a document
    7.
    发明授权
    Method for determining logical components of a document 有权
    用于确定文档的逻辑组件的方法

    公开(公告)号:US07386789B2

    公开(公告)日:2008-06-10

    申请号:US10787971

    申请日:2004-02-27

    申请人: Hui Chao Lei He Jian Fan

    发明人: Hui Chao Lei He Jian Fan

    IPC分类号: G06K15/02

    CPC分类号: G06F17/24 Y10S707/99933

    摘要: A method for determining logical components of a portable document format (PDF) document is disclosed. The method includes separating the document into a plurality of layers. A PDF document is created for each of the plurality of layers. The method also includes determining a logical structure for each layer. The logical structures of the plurality of layers are combined to determine the logical components of the PDF document.

    摘要翻译: 公开了一种用于确定便携式文档格式(PDF)文档的逻辑组件的方法。 该方法包括将文档分离成多个层。 为多个层中的每一层创建一个PDF文档。 该方法还包括确定每层的逻辑结构。 组合多个层的逻辑结构以确定PDF文档的逻辑组件。

    System and method for enhancing document images
    8.
    发明授权
    System and method for enhancing document images 失效
    用于增强文档图像的系统和方法

    公开(公告)号:US06771838B1

    公开(公告)日:2004-08-03

    申请号:US09684312

    申请日:2000-10-06

    申请人: Jian Fan

    发明人: Jian Fan

    IPC分类号: G06K940

    摘要: A system and method for enhancing digital images containing both text and pictorial content (“mixed document images”) utilizes an estimated illumination surface for a given digital image to correct the undesirable effect of non-uniform illumination. The estimated illumination surface is based on the luminance values of the edge pixels of the given image that are on the dark side of text edges. In an alternative embodiment, the luminance values of the edge pixels that are on the lighter side of the detected text edges are used to generate the estimated illumination surface. The estimated illumination surface is applied to the digital image to compensate for illumination variations in the image due to non-uniform illumination. In addition to non-uniform illumination correction, the system and method enhances the mixed document images by sharpening and/or darkening edges of text contained in the images.

    摘要翻译: 用于增强包含文本和图形内容(“混合文档图像”)的数字图像的系统和方法利用给定数字图像的估计照明表面来校正不均匀照明的不期望的影响。 估计的照明面是基于文本边缘的黑色侧的给定图像的边缘像素的亮度值。 在替代实施例中,使用在检测到的文本边缘的较轻侧上的边缘像素的亮度值来生成估计的照明表面。 将估计的照明表面施加到数字图像以补偿由于不均匀照明而导致的图像中的照明变化。 除了不均匀的照明校正之外,系统和方法通过锐化和/或变暗包含在图像中的文本的边缘来增强混合文档图像。

    System and method of foreground-background segmentation of digitized images
    9.
    发明授权
    System and method of foreground-background segmentation of digitized images 有权
    数字化图像的前景背景分割的系统和方法

    公开(公告)号:US08792711B2

    公开(公告)日:2014-07-29

    申请号:US13511806

    申请日:2009-12-02

    申请人: Jian Fan

    发明人: Jian Fan

    IPC分类号: G06K9/00 G06K9/34

    摘要: A system and method for segmenting foreground and background regions on a digitized image uses a computer, having a processor and system memory, to segment the image into initial regions and identify background regions from the initial regions. A complete background surface is estimated of the image, and pixels of the image are rectified with the estimated background surface to normalize the image. Normalized pixels are compared with a threshold color to determine a final segmentation of background regions.

    摘要翻译: 用于在数字化图像上分割前景和背景区域的系统和方法使用具有处理器和系统存储器的计算机,将图像分割成初始区域并从初始区域识别背景区域。 估计图像的完整背景表面,并且用估计的背景面校正图像的像素以使图像归一化。 将归一化像素与阈值颜色进行比较,以确定背景区域的最终分割。

    Extraction of Content from a Web Page
    10.
    发明申请
    Extraction of Content from a Web Page 审中-公开
    从网页提取内容

    公开(公告)号:US20130283148A1

    公开(公告)日:2013-10-24

    申请号:US13817656

    申请日:2010-10-26

    IPC分类号: G06F17/22

    CPC分类号: G06F17/2247 G06F16/986

    摘要: A system and method are provided for extracting main content from a web page. Web page segmentation is performed on a web page to provide affinity-grouped segments. Descriptive features of at least one of the affinity-grouped segments are computed. At least one of the affinity-grouped segments is classified as a main body segment based on the computed descriptive features. Additional affinity-grouped segments are classified as to a document function based on the computed descriptive features. Classified affinity-grouped segments are assembled according to their classified document functions to provide the main content.

    摘要翻译: 提供了一种用于从网页提取主要内容的系统和方法。 在网页上执行网页分割以提供关联分组的段。 计算至少一个亲和力分组段的描述性特征。 基于所计算的描述特征,至少一个亲和度分组的段被分类为主体段。 基于所计算的描述特征,附加的亲和组合段被分类为文档功能。 分类的亲和度分组段根据其分类的文档功能进行组装以提供主要内容。