Digital conversion of imaged content
    2.
    发明授权
    Digital conversion of imaged content 有权
    成像内容的数字转换

    公开(公告)号:US09349202B1

    公开(公告)日:2016-05-24

    申请号:US13632864

    申请日:2012-10-01

    CPC classification number: G06T11/60 G06K9/00463 G06K9/481 G06K9/6828

    Abstract: A method of generating a reflowable content file from a physical text source is described. An image of the physical text source is segmented into a plurality of glyphs and a character and font is determined for each of the glyphs. The font for each of the plurality of glyphs is determined based on two or more of the glyphs.

    Abstract translation: 描述从物理文本源生成可回流内容文件的方法。 物理文本源的图像被分割成多个字形,并且为每个字形确定字符和字体。 基于两个或更多个字形来确定多个字形中的每一个的字体。

    Creating an electronic book using video-based input
    4.
    发明授权
    Creating an electronic book using video-based input 有权
    使用基于视频的输入创建电子书

    公开(公告)号:US09191554B1

    公开(公告)日:2015-11-17

    申请号:US13677096

    申请日:2012-11-14

    CPC classification number: H04N1/00198 G06F17/21 H04N5/14

    Abstract: Some implementations include using a trained classifier to identify page-turn events in a video. The video may be divided into multiple segments based on the page-turn events, with each segment of the multiple segments corresponding to a pair of adjacent pages in a book. Exemplar frames that provide non-redundant data compared to other frames may be chosen from each segment. The exemplar frames may be cropped to include content portions of pages. The exemplar frames may be aligned such that a pixel is located in a same position in each frame. Optical character recognition (OCR) may be performed on exemplar frames and the OCR for exemplar frames in each segment may be combined. The exemplar frames in each segment may be combined to create a composite image for each pair of adjacent pages in the book, and OCR may be performed on the composite image.

    Abstract translation: 一些实现包括使用经过训练的分类器来识别视频中的翻页事件。 视频可以基于翻页事件被划分成多个片段,多个片段的每个片段对应于书中的一对相邻页面。 可以从每个段选择与其他帧相比提供非冗余数据的示例帧。 可以裁剪示例帧以包括页面的内容部分。 示例性帧可以对准,使得像素位于每个帧中的相同位置。 可以在示例性帧上执行光学字符识别(OCR),并且可以组合每个段中的示例帧的OCR。 每个段中的示例帧可以被组合以为书中的每对相邻页创建合成图像,并且可以在合成图像上执行OCR。

    Efficient identification of objects in videos using motion information

    公开(公告)号:US11126854B1

    公开(公告)日:2021-09-21

    申请号:US15612651

    申请日:2017-06-02

    Abstract: Technologies are disclosed for efficiently identifying objects in videos using deep neural networks and motion information. Using the disclosed technologies, the amount of time required to identify objects in videos can be greatly reduced. Motion information for a video, such as motion vectors, are extracted during the encoding or decoding of the video. The motion information is used to determine whether there is sufficient motion between frames of the video to warrant performing object detection on the frames. If there is insufficient movement from one frame to a subsequent frame, the subsequent frame will not be processed to identify objects contained therein. In this way, object detection will not be performed on video frames that have changed minimally as compared to a previous frame, thereby reducing the amount of time and the number of processing operations required to identify the objects in the video.

    Validating digital content rendering

    公开(公告)号:US10242277B1

    公开(公告)日:2019-03-26

    申请号:US14794351

    申请日:2015-07-08

    Abstract: Devices, systems and methods are disclosed for validating an electronic publication and determining a source of identified errors in a rendering of the electronic publication. The rendering may be captured as a rendered image and rendered data may be extracted from the rendering. The rendered data may be compared to actual input data to the renderer used to generate the rendered image. If errors are visible in the rendering, a source of the errors may be identified based on the comparison between the extracted rendered data to the actual input data. If errors are not visible in the rendering, the rendering may be validated.

    Detection of layouts in electronic documents

    公开(公告)号:US10095677B1

    公开(公告)日:2018-10-09

    申请号:US14316704

    申请日:2014-06-26

    Abstract: Disclosed are techniques and systems to detect a layout of a source document. A process may include receiving content from a first page and a second page of the source document, designating sections in each page along a first direction of the page, and assigning similar sections to a group. For the group, the process may proceed by dividing sections for each page into discrete portions associated with 2D coordinate areas, and identifying sets of 2D coordinate areas for the discrete portions that contain content. The number of times each portion contains some content may be compared to a threshold to determine a layout of the group of sections.

Patent Agency Ranking