DIGITAL CONTENT LAYOUT ENCODING FOR SEARCH

    公开(公告)号:US20240419750A1

    公开(公告)日:2024-12-19

    申请号:US18822367

    申请日:2024-09-02

    Applicant: Adobe Inc.

    Abstract: Digital content layout encoding techniques for search are described. In these techniques, a layout representation is generated (using machine learning automatically and without user intervention) that describes a layout of elements included within the digital content. In an implementation, the layout representation includes a description of both spatial and structural aspects of the elements in relation to each other. To do so, a two-pathway pipeline that is configured to model layout from both spatial and structural aspects using a spatial pathway, and a structural pathway, respectively. In one example, this is also performed through use of multi-level encoding and fusion to generate a layout representation.

    Digital Content Layout Encoding for Search
    3.
    发明公开

    公开(公告)号:US20230359682A1

    公开(公告)日:2023-11-09

    申请号:US17735748

    申请日:2022-05-03

    Applicant: Adobe Inc.

    CPC classification number: G06F16/9537 G06F40/30 G06N20/00

    Abstract: Digital content layout encoding techniques for search are described. In these techniques, a layout representation is generated (using machine learning automatically and without user intervention) that describes a layout of elements included within the digital content. In an implementation, the layout representation includes a description of both spatial and structural aspects of the elements in relation to each other. To do so, a two-pathway pipeline that is configured to model layout from both spatial and structural aspects using a spatial pathway, and a structural pathway, respectively. In one example, this is also performed through use of multi-level encoding and fusion to generate a layout representation.

    Training text recognition systems

    公开(公告)号:US11810374B2

    公开(公告)日:2023-11-07

    申请号:US17240097

    申请日:2021-04-26

    Applicant: Adobe Inc.

    Abstract: In implementations of recognizing text in images, text recognition systems are trained using noisy images that have nuisance factors applied, and corresponding clean images (e.g., without nuisance factors). Clean images serve as supervision at both feature and pixel levels, so that text recognition systems are trained to be feature invariant (e.g., by requiring features extracted from a noisy image to match features extracted from a clean image), and feature complete (e.g., by requiring that features extracted from a noisy image be sufficient to generate a clean image). Accordingly, text recognition systems generalize to text not included in training images, and are robust to nuisance factors. Furthermore, since clean images are provided as supervision at feature and pixel levels, training requires fewer training images than text recognition systems that are not trained with a supervisory clean image, thus saving time and resources.

    Preserving Document Design Using Font Synthesis

    公开(公告)号:US20230326104A1

    公开(公告)日:2023-10-12

    申请号:US18333766

    申请日:2023-06-13

    Applicant: Adobe Inc.

    CPC classification number: G06T11/203 G06F40/109 G06F40/166 G06V30/245

    Abstract: Automatic font synthesis for modifying a local font to have an appearance that is visually similar to a source font is described. A font modification system receives an electronic document including the source font together with an indication of a font descriptor for the source font. The font descriptor includes information describing various font attributes for the source font, which define a visual appearance of the source font. Using the source font descriptor, the font modification system identifies a local font that is visually similar in appearance to the source font by comparing local font descriptors to the source font descriptor. A visually similar font is then synthesized by modifying glyph outlines of the local font to achieve the visual appearance defined by the source font descriptor. The synthesized font is then used to replace the source font and output in the electronic document at the computing device.

Patent Agency Ranking