-
公开(公告)号:US20240331423A1
公开(公告)日:2024-10-03
申请号:US18604902
申请日:2024-03-14
发明人: Zhihong Zeng , Sushant Tiwari , Jonathan Hirscher , Zhi Chen , Narasimha Goli
IPC分类号: G06V30/19 , G06V20/62 , G06V30/14 , G06V30/18 , G06V30/413 , G06V30/416 , G06V30/42
CPC分类号: G06V30/19147 , G06V20/62 , G06V30/1444 , G06V30/18019 , G06V30/413 , G06V30/416 , G06V30/42
摘要: In some embodiments, techniques are provided for document identification using a multimodal model that has been trained using one-shot learning. In one example, a first method of document image processing includes generating, for each template document image of a plurality of template document images, a corresponding fingerprint of a plurality of fingerprints; and based on the plurality of fingerprints, training a multimodal model. For each template document image of the plurality of template document images, generating the corresponding fingerprint may include detecting a plurality of regions within the template document image, wherein the plurality of regions comprises a plurality of text regions; and filtering the plurality of regions to obtain a plurality of regions of interest, wherein the fingerprint is based on the plurality of regions of interest.
-
公开(公告)号:US20240289550A1
公开(公告)日:2024-08-29
申请号:US18115156
申请日:2023-02-28
发明人: Tianhao Wu
IPC分类号: G06F40/284 , G06N3/08 , G06V10/82 , G06V30/14 , G06V30/19 , G06V30/416
CPC分类号: G06F40/284 , G06N3/08 , G06V10/82 , G06V30/1444 , G06V30/19147 , G06V30/416
摘要: A system and method for training a neural network model includes obtaining, by a processing device, a document image containing raw text, tokenizing the raw text in the document image to obtain tokens located in a plurality of rows, identifying a first token in one of the plurality of rows, calculating a horizontal language feature of the first token based on the first token and one or more tokens in the row, and encoding, using a first encoder, the horizontal language feature into a horizontal language embedding, calculating a vertical language feature of the first token based on the token and one or more tokens in rows above or below the row, and encoding, using a second encoder, the vertical language feature into a vertical language embedding, and training a neural network model using the horizontal language embeddings and the vertical language embeddings.
-
公开(公告)号:US12067743B2
公开(公告)日:2024-08-20
申请号:US17512870
申请日:2021-10-28
申请人: GENETEC INC.
IPC分类号: G06T7/70 , G06T7/00 , G06V10/44 , G06V10/75 , G06V20/62 , G06V30/12 , G06V30/14 , G08G1/133 , G06V30/10
CPC分类号: G06T7/70 , G06T7/97 , G06V10/44 , G06V10/751 , G06V20/62 , G06V20/625 , G06V30/133 , G06V30/1444 , G08G1/133 , G06V30/10
摘要: Systems, methods, devices and computer readable media for determining a geographical location of a license plate are described herein. A first image of a license plate is acquired by a first image acquisition device of a camera unit and a second image of the license plate is acquired by a second image acquisition device of the camera unit. A three-dimensional position of the license plate relative to the camera unit is determined based on stereoscopic image processing of the first image and the second image. A geographical location of the camera unit is obtained. A geographical location of the license plate is determined from the three-dimensional position of the license plate relative to the camera unit and the geographical location of the camera unit. Other systems, methods, devices and computer readable media for detecting a license plate and identifying a license plate are described herein.
-
公开(公告)号:US11935314B2
公开(公告)日:2024-03-19
申请号:US17125753
申请日:2020-12-17
发明人: Satoru Yamanaka
CPC分类号: G06V30/40 , G06V30/1444 , G06V30/155 , G06V30/18105 , G06V30/10
摘要: In the present disclosure, a candidate area is determined based on a pixel having a specific color included in an input image, and an area is determined to be a processing target from the candidate area based on a pixel having a predetermined color different from the specific color included in the candidate area. Further, a second binary image in which a pixel corresponding to the pixel having the specific color is converted into a white pixel is generated by converting, in a first binary image obtained by the input image being binarized, a pixel that is included in the area determined to be the processing target and corresponds to the pixel having the specific color, into a white pixel.
-
5.
公开(公告)号:US20230377356A1
公开(公告)日:2023-11-23
申请号:US18183411
申请日:2023-03-14
IPC分类号: G06V30/14 , G06V20/62 , G06V30/164 , G06V30/18 , G06V30/19
CPC分类号: G06V30/1444 , G06V20/62 , G06V30/164 , G06V30/18086 , G06V30/19007
摘要: This disclosure relates generally to method and system for detecting and extracting price region from digital flyers and promotions. In retail business, extracting price information from digital flyers is crucial for complex nature of flyers having large variety of formats, color scheme, font styles, variable text information and thereof. The method of the present disclosure detects a text region comprising a price information from a set of digital flyers and promotions received as input images. Further, each text region is converted into a two-color text comprising of a set of white pixels and a set of black pixels. Further, underlying price from the price region of the two-color text is detected and price is extracted from the price region of each input image. Additionally, the price region detection function detects price region accurately and extracts price values having an irregular font size.
-
6.
公开(公告)号:US20230377225A1
公开(公告)日:2023-11-23
申请号:US18121444
申请日:2023-03-14
发明人: Chengquan ZHANG , Yuechen YU , Liang WU
CPC分类号: G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19127 , G06V30/1918 , G06V30/1444 , G06V30/19147 , G06V40/10
摘要: A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.
-
公开(公告)号:US11783607B2
公开(公告)日:2023-10-10
申请号:US17180924
申请日:2021-02-22
申请人: SYNC-RX, LTD
发明人: Nili Karmon , Sarit Semo
IPC分类号: G06V30/148 , G06T7/00 , G06V30/10 , G06V30/14 , G06V40/14
CPC分类号: G06V30/155 , G06T7/0012 , G06T2207/30004 , G06V30/10 , G06V30/1444 , G06V40/14 , G06V2201/03
摘要: Apparatus and methods are described including receiving, via a computer processor, at least one image of a portion of a subject's body. One or more features that are present within the image of the portion of the subject's body, and that were artificially added to the image subsequent to acquisition of the image, are identified. In response thereto, an output is generated on an output device.
-
公开(公告)号:US11783572B2
公开(公告)日:2023-10-10
申请号:US17828303
申请日:2022-05-31
申请人: Amadeus S.A.S.
发明人: Sebastian Andreas Bildner , Paul Krion , Thomas Stark , Martin Christopher Stämmler , Martin Von Schledorn , Jürgen Oesterle , Renjith Karimattathil Sasidharan
IPC分类号: G06V10/82 , G06V30/414 , G06F18/214 , G06N3/08 , G06V30/14 , G06V30/19 , G06V30/412 , G06V30/10
CPC分类号: G06V10/82 , G06F18/214 , G06N3/08 , G06V30/1444 , G06V30/19173 , G06V30/412 , G06V30/414 , G06V30/10
摘要: Method and system of automatically extracting information of a predefined type from a document is provided. The method includes identifying a location and classification of a segment of interest of a document that includes information associated with a predefined type. The method further includes identifying a location and classification of characters from the segment of interest based on characteristics associated with the predefined type. The method further includes extracting the identified characters from the segment of interested associated with the predefined type.
-
9.
公开(公告)号:US20230206619A1
公开(公告)日:2023-06-29
申请号:US18179485
申请日:2023-03-07
IPC分类号: G06V10/82 , G06T3/40 , G06T5/00 , G06N5/046 , G06N3/08 , G06V20/62 , G06V30/413 , G06V30/414 , G06F18/2413 , G06N3/045 , G06V30/14 , G06V30/18 , G06V30/19
CPC分类号: G06V10/82 , G06F18/24143 , G06N3/08 , G06N3/045 , G06N5/046 , G06T3/4046 , G06T5/009 , G06V20/62 , G06V30/413 , G06V30/414 , G06V30/1444 , G06V30/18057 , G06V30/19173 , G06V30/10
摘要: Systems and methods for image modification to increase contrast between text and non-text pixels within the image. In one embodiment, an original document image is scaled to a predetermined size for processing by a convolutional neural network. The convolutional neural network identifies a probability that each pixel in the scaled is text and generates a heat map of these probabilities. The heat map is then scaled back to the size of the original document image, and the probabilities in the heat map are used to adjust the intensities of the text and non-text pixels. For positive text, intensities of text pixels are reduced and intensities of non-text pixels are increased in order to increase the contrast of the text against the background of the image. Optical character recognition may then be performed on the contrast-adjusted image.
-
公开(公告)号:US20230196814A1
公开(公告)日:2023-06-22
申请号:US18171840
申请日:2023-02-21
发明人: Hidetaka KOJIMA , Takuma AKAGI
IPC分类号: G06V30/414 , G06V30/14
CPC分类号: G06V30/414 , G06V30/1444
摘要: An information processing apparatus and a system capable of effectively managing slips are provided. According to an embodiment, an information processing apparatus includes a first interface, a storage unit, and a processor. The first interface acquires an instruction picture indicating an article to be picked up. The storage unit stores data. The processor acquires a first ID from the instruction picture, generates feature information indicating a feature of a slip to deliver the article, generates a second ID, associates the first ID and the second ID with the feature information, and stores them in the storage unit.
-
-
-
-
-
-
-
-
-