专利检索 cpc:"G06V30/1444" 第 1 页

1.

发明公开
ONE-SHOT MULTIMODAL LEARNING FOR DOCUMENT IDENTIFICATION 审中-公开

公开(公告)号：US20240331423A1

公开(公告)日：2024-10-03

申请号：US18604902

申请日：2024-03-14

申请人： IRON MOUNTAIN INCORPORATED

发明人： Zhihong Zeng , Sushant Tiwari , Jonathan Hirscher , Zhi Chen , Narasimha Goli

IPC分类号： G06V30/19 , G06V20/62 , G06V30/14 , G06V30/18 , G06V30/413 , G06V30/416 , G06V30/42

CPC分类号： G06V30/19147 , G06V20/62 , G06V30/1444 , G06V30/18019 , G06V30/413 , G06V30/416 , G06V30/42

摘要： In some embodiments, techniques are provided for document identification using a multimodal model that has been trained using one-shot learning. In one example, a first method of document image processing includes generating, for each template document image of a plurality of template document images, a corresponding fingerprint of a plurality of fingerprints; and based on the plurality of fingerprints, training a multimodal model. For each template document image of the plurality of template document images, generating the corresponding fingerprint may include detecting a plurality of regions within the template document image, wherein the plurality of regions comprises a plurality of text regions; and filtering the plurality of regions to obtain a plurality of regions of interest, wherein the fingerprint is based on the plurality of regions of interest.

2.

发明公开
SYSTEM AND METHOD FOR GENERATING EMBEDDINGS IN PRE-TRAINED TEXT DETECTION AND EXTRACTION NEURAL NETWORK MODEL AND FINE-TUNING THE SAME 审中-公开

公开(公告)号：US20240289550A1

公开(公告)日：2024-08-29

申请号：US18115156

申请日：2023-02-28

申请人： Singularity Systems Inc.

发明人： Tianhao Wu

IPC分类号： G06F40/284 , G06N3/08 , G06V10/82 , G06V30/14 , G06V30/19 , G06V30/416

CPC分类号： G06F40/284 , G06N3/08 , G06V10/82 , G06V30/1444 , G06V30/19147 , G06V30/416

摘要： A system and method for training a neural network model includes obtaining, by a processing device, a document image containing raw text, tokenizing the raw text in the document image to obtain tokens located in a plurality of rows, identifying a first token in one of the plurality of rows, calculating a horizontal language feature of the first token based on the first token and one or more tokens in the row, and encoding, using a first encoder, the horizontal language feature into a horizontal language embedding, calculating a vertical language feature of the first token based on the token and one or more tokens in rows above or below the row, and encoding, using a second encoder, the vertical language feature into a vertical language embedding, and training a neural network model using the horizontal language embeddings and the vertical language embeddings.

3.

发明授权
Automated license plate recognition system and related method 有权

公开(公告)号：US12067743B2

公开(公告)日：2024-08-20

申请号：US17512870

申请日：2021-10-28

申请人： GENETEC INC.

发明人： Louis-Antoine Blais-Morin , Pablo Agustin Cassani , André Bleau

IPC分类号： G06T7/70 , G06T7/00 , G06V10/44 , G06V10/75 , G06V20/62 , G06V30/12 , G06V30/14 , G08G1/133 , G06V30/10

CPC分类号： G06T7/70 , G06T7/97 , G06V10/44 , G06V10/751 , G06V20/62 , G06V20/625 , G06V30/133 , G06V30/1444 , G08G1/133 , G06V30/10

摘要： Systems, methods, devices and computer readable media for determining a geographical location of a license plate are described herein. A first image of a license plate is acquired by a first image acquisition device of a camera unit and a second image of the license plate is acquired by a second image acquisition device of the camera unit. A three-dimensional position of the license plate relative to the camera unit is determined based on stereoscopic image processing of the first image and the second image. A geographical location of the camera unit is obtained. A geographical location of the license plate is determined from the three-dimensional position of the license plate relative to the camera unit and the geographical location of the camera unit. Other systems, methods, devices and computer readable media for detecting a license plate and identifying a license plate are described herein.

4.

发明授权
Apparatus for generating a binary image into a white pixel, storage medium, and method 有权

公开(公告)号：US11935314B2

公开(公告)日：2024-03-19

申请号：US17125753

申请日：2020-12-17

申请人： CANON KABUSHIKI KAISHA

发明人： Satoru Yamanaka

IPC分类号： G06V10/00 , G06V30/14 , G06V30/148 , G06V30/18 , G06V30/40 , G06V30/10

CPC分类号： G06V30/40 , G06V30/1444 , G06V30/155 , G06V30/18105 , G06V30/10

摘要： In the present disclosure, a candidate area is determined based on a pixel having a specific color included in an input image, and an area is determined to be a processing target from the candidate area based on a pixel having a predetermined color different from the specific color included in the candidate area. Further, a second binary image in which a pixel corresponding to the pixel having the specific color is converted into a white pixel is generated by converting, in a first binary image obtained by the input image being binarized, a pixel that is included in the area determined to be the processing target and corresponds to the pixel having the specific color, into a white pixel.

5.

发明公开
METHOD AND SYSTEM FOR DETECTING AND EXTRACTING PRICE REGION FROM DIGITAL FLYERS AND PROMOTIONS 审中-公开

公开(公告)号：US20230377356A1

公开(公告)日：2023-11-23

申请号：US18183411

申请日：2023-03-14

申请人： Tata Consultancy Services Limited

发明人： AMIT KUMAR AGRAWAL , MANTU PRASAD GUPTA , DEVANG JAGDISHCHANDRA PATEL , PUSHP KUMAR JAIN

IPC分类号： G06V30/14 , G06V20/62 , G06V30/164 , G06V30/18 , G06V30/19

CPC分类号： G06V30/1444 , G06V20/62 , G06V30/164 , G06V30/18086 , G06V30/19007

摘要： This disclosure relates generally to method and system for detecting and extracting price region from digital flyers and promotions. In retail business, extracting price information from digital flyers is crucial for complex nature of flyers having large variety of formats, color scheme, font styles, variable text information and thereof. The method of the present disclosure detects a text region comprising a price information from a set of digital flyers and promotions received as input images. Further, each text region is converted into a two-color text comprising of a set of white pixels and a set of black pixels. Further, underlying price from the price region of the two-color text is detected and price is extracted from the price region of each input image. Additionally, the price region detection function detects price region accurately and extracts price values having an irregular font size.

6.

发明公开
METHOD AND APPARATUS FOR EDITING AN IMAGE AND METHOD AND APPARATUS FOR TRAINING AN IMAGE EDITING MODEL, DEVICE AND MEDIUM 审中-公开

公开(公告)号：US20230377225A1

公开(公告)日：2023-11-23

申请号：US18121444

申请日：2023-03-14

申请人： Beijing Baidu Netcom Science and Technology Co., Ltd.

发明人： Chengquan ZHANG , Yuechen YU , Liang WU

IPC分类号： G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19 , G06V30/14

CPC分类号： G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19127 , G06V30/1918 , G06V30/1444 , G06V30/19147 , G06V40/10

摘要： A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.

7.

发明授权
Automatic image feature removal 有权

公开(公告)号：US11783607B2

公开(公告)日：2023-10-10

申请号：US17180924

申请日：2021-02-22

申请人： SYNC-RX, LTD

发明人： Nili Karmon , Sarit Semo

IPC分类号： G06V30/148 , G06T7/00 , G06V30/10 , G06V30/14 , G06V40/14

CPC分类号： G06V30/155 , G06T7/0012 , G06T2207/30004 , G06V30/10 , G06V30/1444 , G06V40/14 , G06V2201/03

摘要： Apparatus and methods are described including receiving, via a computer processor, at least one image of a portion of a subject's body. One or more features that are present within the image of the portion of the subject's body, and that were artificially added to the image subsequent to acquisition of the image, are identified. In response thereto, an output is generated on an output device.

8.

发明授权
Method of automatically extracting information of a predefined type from a document 有权

公开(公告)号：US11783572B2

公开(公告)日：2023-10-10

申请号：US17828303

申请日：2022-05-31

申请人： Amadeus S.A.S.

发明人： Sebastian Andreas Bildner , Paul Krion , Thomas Stark , Martin Christopher Stämmler , Martin Von Schledorn , Jürgen Oesterle , Renjith Karimattathil Sasidharan

IPC分类号： G06V10/82 , G06V30/414 , G06F18/214 , G06N3/08 , G06V30/14 , G06V30/19 , G06V30/412 , G06V30/10

CPC分类号： G06V10/82 , G06F18/214 , G06N3/08 , G06V30/1444 , G06V30/19173 , G06V30/412 , G06V30/414 , G06V30/10

摘要： Method and system of automatically extracting information of a predefined type from a document is provided. The method includes identifying a location and classification of a segment of interest of a document that includes information associated with a predefined type. The method further includes identifying a location and classification of characters from the segment of interest based on characteristics associated with the predefined type. The method further includes extracting the identified characters from the segment of interested associated with the predefined type.

9.

发明公开
SYSTEMS AND METHODS FOR IMAGE MODIFICATION AND IMAGE BASED CONTENT CAPTURE AND EXTRACTION IN NEURAL NETWORKS 审中-公开

公开(公告)号：US20230206619A1

公开(公告)日：2023-06-29

申请号：US18179485

申请日：2023-03-07

申请人： Open Text Corporation

发明人： Christopher Dale Lund , Sreelatha Samala

IPC分类号： G06V10/82 , G06T3/40 , G06T5/00 , G06N5/046 , G06N3/08 , G06V20/62 , G06V30/413 , G06V30/414 , G06F18/2413 , G06N3/045 , G06V30/14 , G06V30/18 , G06V30/19

CPC分类号： G06V10/82 , G06F18/24143 , G06N3/08 , G06N3/045 , G06N5/046 , G06T3/4046 , G06T5/009 , G06V20/62 , G06V30/413 , G06V30/414 , G06V30/1444 , G06V30/18057 , G06V30/19173 , G06V30/10

摘要： Systems and methods for image modification to increase contrast between text and non-text pixels within the image. In one embodiment, an original document image is scaled to a predetermined size for processing by a convolutional neural network. The convolutional neural network identifies a probability that each pixel in the scaled is text and generates a heat map of these probabilities. The heat map is then scaled back to the size of the original document image, and the probabilities in the heat map are used to adjust the intensities of the text and non-text pixels. For positive text, intensities of text pixels are reduced and intensities of non-text pixels are increased in order to increase the contrast of the text against the background of the image. Optical character recognition may then be performed on the contrast-adjusted image.

10.

发明公开
INFORMATION PROCESSING APPARATUS AND SYSTEM 审中-公开

公开(公告)号：US20230196814A1

公开(公告)日：2023-06-22

申请号：US18171840

申请日：2023-02-21

申请人： KABUSHIKI KAISHA TOSHIBA , Toshiba Infrastructure Systems & Solutions Corporation

发明人： Hidetaka KOJIMA , Takuma AKAGI

IPC分类号： G06V30/414 , G06V30/14

CPC分类号： G06V30/414 , G06V30/1444

摘要： An information processing apparatus and a system capable of effectively managing slips are provided. According to an embodiment, an information processing apparatus includes a first interface, a storage unit, and a processor. The first interface acquires an instruction picture indicating an article to be picked up. The storage unit stores data. The processor acquires a first ID from the instruction picture, generates feature information indicating a feature of a slip to deliver the article, generates a second ID, associates the first ID and the second ID with the feature information, and stores them in the storage unit.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类