-
公开(公告)号:US20200151503A1
公开(公告)日:2020-05-14
申请号:US16184779
申请日:2018-11-08
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Hailin Jin , Yang Liu
Abstract: In implementations of recognizing text in images, text recognition systems are trained using noisy images that have nuisance factors applied, and corresponding clean images (e.g., without nuisance factors). Clean images serve as supervision at both feature and pixel levels, so that text recognition systems are trained to be feature invariant (e.g., by requiring features extracted from a noisy image to match features extracted from a clean image), and feature complete (e.g., by requiring that features extracted from a noisy image be sufficient to generate a clean image). Accordingly, text recognition systems generalize to text not included in training images, and are robust to nuisance factors. Furthermore, since clean images are provided as supervision at feature and pixel levels, training requires fewer training images than text recognition systems that are not trained with a supervisory clean image, thus saving time and resources.
-
公开(公告)号:US10592787B2
公开(公告)日:2020-03-17
申请号:US15807028
申请日:2017-11-08
Applicant: Adobe Inc.
Inventor: Yang Liu , Zhaowen Wang , Hailin Jin
Abstract: The present disclosure relates to a font recognition system that employs a multi-task learning framework and adversarial training to improve font classification and remove negative side effects caused by intra-class variances of glyph content. For example, in one or more embodiments, the font recognition system adversarial trains a font recognition neural network by minimizing font classification loss while at the same time maximizing glyph classification loss. By employing an adversarially trained font classification neural network, the font recognition system can improve overall font recognition by removing the negative side effects from diverse glyph content.
-
公开(公告)号:US10515296B2
公开(公告)日:2019-12-24
申请号:US15812548
申请日:2017-11-14
Applicant: Adobe Inc.
Inventor: Yang Liu , Zhaowen Wang , I-Ming Pao , Hailin Jin
Abstract: The present disclosure relates to a font recognition system that employs a multi-task learning framework and training to improve font classification and remove negative side effects caused by intra-class variances of glyph content. For example, in one or more embodiments, the font recognition system trains a hybrid font recognition neural network that includes two or more font recognition neural networks and a weight prediction neural network. The hybrid font recognition neural network determines and generates classification weights based on which font recognition neural network within the hybrid font recognition neural network is best suited to classify the font in an input text image. By employing a hybrid trained font classification neural network, the font recognition system can improve overall font recognition as well as remove the negative side effects from diverse glyph content.
-
公开(公告)号:US20190147304A1
公开(公告)日:2019-05-16
申请号:US15812548
申请日:2017-11-14
Applicant: Adobe Inc.
Inventor: Yang Liu , Zhaowen Wang , I-Ming Pao , Hailin Jin
CPC classification number: G06K9/6828 , G06K9/6227 , G06K9/6257 , G06K9/6262 , G06K9/6277 , G06K9/628 , G06N3/0454 , G06N3/08 , G06N3/084 , G06N5/046
Abstract: The present disclosure relates to a font recognition system that employs a multi-task learning framework and training to improve font classification and remove negative side effects caused by intra-class variances of glyph content. For example, in one or more embodiments, the font recognition system trains a hybrid font recognition neural network that includes two or more font recognition neural networks and a weight prediction neural network. The hybrid font recognition neural network determines and generates classification weights based on which font recognition neural network within the hybrid font recognition neural network is best suited to classify the font in an input text image. By employing a hybrid trained font classification neural network, the font recognition system can improve overall font recognition as well as remove the negative side effects from diverse glyph content.
-
公开(公告)号:US20190138860A1
公开(公告)日:2019-05-09
申请号:US15807028
申请日:2017-11-08
Applicant: Adobe Inc.
Inventor: Yang Liu , Zhaowen Wang , Hailin Jin
CPC classification number: G06K9/6828 , G06K9/00422 , G06K9/6256 , G06K9/6273 , G06K9/6277 , G06K9/66 , G06K2209/011 , G06N3/0454 , G06N3/0472 , G06N3/084
Abstract: The present disclosure relates to a font recognition system that employs a multi-task learning framework and adversarial training to improve font classification and remove negative side effects caused by intra-class variances of glyph content. For example, in one or more embodiments, the font recognition system adversarial trains a font recognition neural network by minimizing font classification loss while at the same time maximizing glyph classification loss. By employing an adversarially trained font classification neural network, the font recognition system can improve overall font recognition by removing the negative side effects from diverse glyph content.
-
6.
公开(公告)号:US12254570B2
公开(公告)日:2025-03-18
申请号:US17661878
申请日:2022-05-03
Applicant: Adobe Inc.
Inventor: Sai Bi , Yang Liu , Zexiang Xu , Fujun Luan , Kalyan Sunkavalli
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generate three-dimensional hybrid mesh-volumetric representations for digital objects. For instance, in one or more embodiments, the disclosed systems generate a mesh for a digital object from a plurality of digital images that portray the digital object using a multi-view stereo model. Additionally, the disclosed systems determine a set of sample points for a thin volume around the mesh. Using a neural network, the disclosed systems further generate a three-dimensional hybrid mesh-volumetric representation for the digital object utilizing the set of sample points for the thin volume and the mesh.
-
公开(公告)号:US20210241032A1
公开(公告)日:2021-08-05
申请号:US17240097
申请日:2021-04-26
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Hailin Jin , Yang Liu
Abstract: In implementations of recognizing text in images, text recognition systems are trained using noisy images that have nuisance factors applied, and corresponding clean images (e.g., without nuisance factors). Clean images serve as supervision at both feature and pixel levels, so that text recognition systems are trained to be feature invariant (e.g., by requiring features extracted from a noisy image to match features extracted from a clean image), and feature complete (e.g., by requiring that features extracted from a noisy image be sufficient to generate a clean image). Accordingly, text recognition systems generalize to text not included in training images, and are robust to nuisance factors. Furthermore, since clean images are provided as supervision at feature and pixel levels, training requires fewer training images than text recognition systems that are not trained with a supervisory clean image, thus saving time and resources.
-
公开(公告)号:US10997463B2
公开(公告)日:2021-05-04
申请号:US16184779
申请日:2018-11-08
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Hailin Jin , Yang Liu
Abstract: In implementations of recognizing text in images, text recognition systems are trained using noisy images that have nuisance factors applied, and corresponding clean images (e.g., without nuisance factors). Clean images serve as supervision at both feature and pixel levels, so that text recognition systems are trained to be feature invariant (e.g., by requiring features extracted from a noisy image to match features extracted from a clean image), and feature complete (e.g., by requiring that features extracted from a noisy image be sufficient to generate a clean image). Accordingly, text recognition systems generalize to text not included in training images, and are robust to nuisance factors. Furthermore, since clean images are provided as supervision at feature and pixel levels, training requires fewer training images than text recognition systems that are not trained with a supervisory clean image, thus saving time and resources.
-
9.
公开(公告)号:US20230360327A1
公开(公告)日:2023-11-09
申请号:US17661878
申请日:2022-05-03
Applicant: Adobe Inc.
Inventor: Sai Bi , Yang Liu , Zexiang Xu , Fujun Luan , Kalyan Sunkavalli
CPC classification number: G06T17/205 , G06T13/20 , G06T2210/21
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generate three-dimensional hybrid mesh-volumetric representations for digital objects. For instance, in one or more embodiments, the disclosed systems generate a mesh for a digital object from a plurality of digital images that portray the digital object using a multi-view stereo model. Additionally, the disclosed systems determine a set of sample points for a thin volume around the mesh. Using a neural network, the disclosed systems further generate a three-dimensional hybrid mesh-volumetric representation for the digital object utilizing the set of sample points for the thin volume and the mesh.
-
公开(公告)号:US11810374B2
公开(公告)日:2023-11-07
申请号:US17240097
申请日:2021-04-26
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Hailin Jin , Yang Liu
IPC: G06V20/62 , G06V30/148 , G06F18/214 , G06V10/764
CPC classification number: G06V20/62 , G06F18/214 , G06V10/764 , G06V20/63 , G06V30/153 , G06V2201/01
Abstract: In implementations of recognizing text in images, text recognition systems are trained using noisy images that have nuisance factors applied, and corresponding clean images (e.g., without nuisance factors). Clean images serve as supervision at both feature and pixel levels, so that text recognition systems are trained to be feature invariant (e.g., by requiring features extracted from a noisy image to match features extracted from a clean image), and feature complete (e.g., by requiring that features extracted from a noisy image be sufficient to generate a clean image). Accordingly, text recognition systems generalize to text not included in training images, and are robust to nuisance factors. Furthermore, since clean images are provided as supervision at feature and pixel levels, training requires fewer training images than text recognition systems that are not trained with a supervisory clean image, thus saving time and resources.
-
-
-
-
-
-
-
-
-