Segmenting printed media pages into articles
    1.
    发明授权
    Segmenting printed media pages into articles 有权
    将印刷媒体页面分割成文章

    公开(公告)号:US08693779B1

    公开(公告)日:2014-04-08

    申请号:US13612072

    申请日:2012-09-12

    IPC分类号: G06K9/34 G06K9/46 G06K9/66

    CPC分类号: G06K9/00463

    摘要: Methods and systems for segmenting printed media pages into individual articles quickly and efficiently. A printed media based image that may include a variety of columns, headlines, images, and text is input into the system which comprises a block segmenter and an article segmenter system. The block segmenter identifies and produces blocks of textual content from a printed media image while the article segmenter system determines which blocks of textual content belong to one or more articles in the printed media image based on a classifier algorithm. A method for segmenting printed media pages into individual articles is also presented.

    摘要翻译: 将印刷媒体页面快速有效地分割成单个文章的方法和系统。 将包括各种列,标题,图像和文本的基于印刷媒体的图像输入到包括块分割器和物品分割器系统的系统中。 块分割器基于分类器算法,从打印媒体图像识别并产生文本内容块,同时文章分割器系统基于分类器算法确定文本内容的哪些块属于打印的媒体图像中的一个或多个文章。 还介绍了将打印的媒体页面分割成单个文章的方法。

    Segmenting Printed Media Pages Into Articles
    2.
    发明申请
    Segmenting Printed Media Pages Into Articles 有权
    将印刷媒体页面分割成文章

    公开(公告)号:US20100040287A1

    公开(公告)日:2010-02-18

    申请号:US12191120

    申请日:2008-08-13

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: Methods and systems for segmenting printed media pages into individual articles quickly and efficiently. A printed media based image that may include a variety of columns, headlines, images, and text is input into the system which comprises a block segmenter and a article segmenter system. The block segmenter identifies and produces blocks of textual content from a printed media image while the article segmenter system determines which blocks of textual content belong to one or more articles in the printed media image based on a classifier algorithm. A method for segmenting printed media pages into individual articles is also presented.

    摘要翻译: 将印刷媒体页面快速有效地分割成单个文章的方法和系统。 可以将包括各种列,标题,图像和文本的基于印刷媒体的图像输入到包括块分割器和物品分割器系统的系统中。 块分割器基于分类器算法,从打印媒体图像识别并产生文本内容块,同时文章分割器系统基于分类器算法确定文本内容的哪些块属于打印的媒体图像中的一个或多个文章。 还介绍了将打印的媒体页面分割成单个文章的方法。

    Segmenting printed media pages into articles
    3.
    发明授权
    Segmenting printed media pages into articles 有权
    将印刷媒体页面分割成文章

    公开(公告)号:US08290268B2

    公开(公告)日:2012-10-16

    申请号:US12191120

    申请日:2008-08-13

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: Methods and systems for segmenting printed media pages into individual articles quickly and efficiently. A printed media based image that may include a variety of columns, headlines, images, and text is input into the system which comprises a block segmenter and a article segmenter system. The block segmenter identifies and produces blocks of textual content from a printed media image while the article segmenter system determines which blocks of textual content belong to one or more articles in the printed media image based on a classifier algorithm. A method for segmenting printed media pages into individual articles is also presented.

    摘要翻译: 将印刷媒体页面快速有效地分割成单个文章的方法和系统。 可以将包括各种列,标题,图像和文本的基于印刷媒体的图像输入到包括块分割器和物品分割器系统的系统中。 块分割器基于分类器算法,从打印媒体图像识别并产生文本内容块,同时文章分割器系统基于分类器算法确定文本内容的哪些块属于打印的媒体图像中的一个或多个文章。 还介绍了将打印的媒体页面分割成单个文章的方法。