PROVIDING A PARTICULAR TYPE OF UNIFORM RESOURCE LOCATOR
    1.
    发明申请
    PROVIDING A PARTICULAR TYPE OF UNIFORM RESOURCE LOCATOR 审中-公开
    提供特殊类型的统一资源定位器

    公开(公告)号:US20120246552A1

    公开(公告)日:2012-09-27

    申请号:US13052622

    申请日:2011-03-21

    IPC分类号: G06F17/00

    CPC分类号: G06F16/951

    摘要: Examples disclosed herein are example systems and methods to provide a particular type of uniform resource locator. In one example, a processor identifies webpage source code associated with a list of text associated with the type of uniform resource locator. The processor may identify a uniform resource locator within the identified webpage source code and provide the uniform resource locator.

    摘要翻译: 本文公开的示例是提供特定类型的统一资源定位符的示例系统和方法。 在一个示例中,处理器识别与与统一资源定位符的类型相关联的文本列表相关联的网页源代码。 处理器可以识别所识别的网页源代码内的统一资源定位符,并提供统一的资源定位符。

    Redigitization system and service
    2.
    发明授权
    Redigitization system and service 有权
    赎回制度和服务

    公开(公告)号:US09330323B2

    公开(公告)日:2016-05-03

    申请号:US14364743

    申请日:2012-04-29

    IPC分类号: G06K9/18 G06K9/03 G06K9/00

    CPC分类号: G06K9/18 G06K9/00442 G06K9/03

    摘要: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.

    摘要翻译: 公开了一种错误纠正现有电子文档的系统和方法。 电子文档可以被光栅化以获得电子文档的像素表示(例如,光栅图像)。 可以在电子文档的光栅图像上执行一个或多个光学字符识别(OCR)任务。 可能会纠正由OCR任务发现的错误,并且可以创建和存储电子文档的定制错误更正版本。 如果电子文档的作者是已知的,则光栅图像可以与与作者相关联的个性化tf * idf错误字典进行比较,以确定作者特有的已知OCR错误。 也可以将光栅图像与与作者相关联的个性化电子错误字典进行比较,以确定作者特有的已知印刷错误。

    Redigitization System and Service
    4.
    发明申请
    Redigitization System and Service 有权
    赎回制度和服务

    公开(公告)号:US20150049949A1

    公开(公告)日:2015-02-19

    申请号:US14364743

    申请日:2012-04-29

    IPC分类号: G06K9/18 G06K9/00

    CPC分类号: G06K9/18 G06K9/00442 G06K9/03

    摘要: A system and method to error correct extant electronic documents is disclosed. An electronic document may be rasterized to obtain a pixel representation of the electronic document (e.g., raster image). One or more optical character recognition (OCR) tasks may be performed on the raster image of the electronic document. Errors discovered by the OCR tasks may be corrected and a customized error corrected version of the electronic document may be created and stored. If the author of the electronic document is known, the raster image may be compared to a personalized tf*idf error dictionary associated with the author to determine known OCR errors specific to the author. The raster image may also be compared to a personalized electronic error dictionary associated with the author to determine known typographical errors specific to the author.

    摘要翻译: 公开了一种错误纠正现有电子文档的系统和方法。 电子文档可以被光栅化以获得电子文档的像素表示(例如,光栅图像)。 可以在电子文档的光栅图像上执行一个或多个光学字符识别(OCR)任务。 可能会纠正由OCR任务发现的错误,并且可以创建和存储电子文档的定制错误更正版本。 如果电子文档的作者是已知的,则光栅图像可以与与作者相关联的个性化tf * idf错误字典进行比较,以确定作者特有的已知OCR错误。 也可以将光栅图像与与作者相关联的个性化电子错误字典进行比较,以确定作者特有的已知印刷错误。

    Content grouping systems and methods
    5.
    发明授权
    Content grouping systems and methods 失效
    内容分组系统和方法

    公开(公告)号:US08577887B2

    公开(公告)日:2013-11-05

    申请号:US12639768

    申请日:2009-12-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.

    摘要翻译: 提供了一种分组多个媒体内容的方法。 该方法包括使用处理器将媒体内容的至少一部分转换成至少一个文档对象模型(“DOM”)。 DOM可以包括多个块元素,每个块元素包括至少一个内容对象。 该方法包括将内容对象分配到相关部分和不相关部分中,并且在内容对象的相关部分内提取一组关键字,该集合包括至少一个关键字。 该方法包括使用该组关键字的至少一部分将内容对象的相关部分分配到相关部分和不相关部分中,并且对内容的相关部分进行分组以提供一组相关内容。

    Systems and Methods for Adding Commercial Content to Printouts
    7.
    发明申请
    Systems and Methods for Adding Commercial Content to Printouts 审中-公开
    将商业内容添加到打印输出的系统和方法

    公开(公告)号:US20120150637A1

    公开(公告)日:2012-06-14

    申请号:US13391637

    申请日:2009-08-26

    IPC分类号: G06Q30/02

    摘要: In one embodiment, a system and method relate to detecting a print command received by a network browser of a client computer, the print command reflecting an interest to print content of a network page displayed in the network browser as a hard copy printout, analyzing the network page content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a document that includes the network page content and the identified commercial content.

    摘要翻译: 在一个实施例中,一种系统和方法涉及检测由客户端计算机的网络浏览器接收到的打印命令,该打印命令反映了将网络浏览器中显示的网页的内容打印出来的兴趣作为硬拷贝打印输出,分析 网页内容以确定其基本主题,识别与底层主题相关的商业内容,以及创建和格式化包括网页内容和所识别的商业内容的文档。

    CONTENT GROUPING SYSTEMS AND METHODS
    8.
    发明申请
    CONTENT GROUPING SYSTEMS AND METHODS 失效
    内容分组系统和方法

    公开(公告)号:US20110145249A1

    公开(公告)日:2011-06-16

    申请号:US12639768

    申请日:2009-12-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A method of grouping a plurality of media content is provided. The method includes converting at least a portion of the media content into at least one document object model (“DOM”) using a processor. The DOM can include a plurality of block elements, each comprising at least one content object. The method includes apportioning the content objects into a relevant portion and an irrelevant portion and extracting a set of keywords, the set comprising at least one keyword, within the relevant portion of the content objects. The method includes apportioning the relevant portion of the content objects into a related portion and an unrelated portion using at least a portion of the set of keywords and grouping the related portion of the content to provide a group of related content.

    摘要翻译: 提供了一种分组多个媒体内容的方法。 该方法包括使用处理器将媒体内容的至少一部分转换成至少一个文档对象模型(“DOM”)。 DOM可以包括多个块元素,每个块元素包括至少一个内容对象。 该方法包括将内容对象分配到相关部分和不相关部分中,并且在内容对象的相关部分内提取一组关键字,该集合包括至少一个关键字。 该方法包括使用该组关键字的至少一部分将内容对象的相关部分分配到相关部分和不相关部分中,并且对内容的相关部分进行分组以提供一组相关内容。

    SYSTEMS AND METHODS FOR ADDING COMMERCIAL CONTENT TO PRINTOUTS
    9.
    发明申请
    SYSTEMS AND METHODS FOR ADDING COMMERCIAL CONTENT TO PRINTOUTS 审中-公开
    将商业内容添加到打印机的系统和方法

    公开(公告)号:US20150138605A1

    公开(公告)日:2015-05-21

    申请号:US13821356

    申请日:2010-09-21

    IPC分类号: G06Q30/02 G06F3/12 G06K15/02

    摘要: Systems, devices and methods are provided which relate to detecting a print command on a client computer, the print command reflecting an interest to print content of an electronic document, accessible by a client computer, as a hard copy printout. One method includes analyzing the electronic document content to determine its underlying subject matter, identifying commercial content relevant to the underlying subject matter, and creating and formatting a new, printable document that includes the electronic document content and the identified commercial content.

    摘要翻译: 提供了与检测客户端计算机上的打印命令相关的系统,设备和方法,该打印命令反映了将由客户端计算机访问的电子文档的内容打印出来的兴趣,作为硬拷贝打印输出。 一种方法包括分析电子文档内容以确定其基本主题,识别与底层主题相关的商业内容,以及创建和格式化包括电子文档内容和所识别的商业内容的新的可打印文档。