专利检索 ipc:"G06V30/148" 第 1 页

1.

发明授权
Autonomous system and method for monitoring and improving water quality by mitigating harmful algal blooms 有权

公开(公告)号：US12129191B1

公开(公告)日：2024-10-29

申请号：US18779609

申请日：2024-07-22

申请人： Nishant Narayanan

发明人： Nishant Narayanan

IPC分类号： G06V30/148 , C02F1/50 , G06F3/02 , G06F7/24 , G06F18/20 , G06F18/213 , G06V10/82 , G06V20/56 , G06V20/62 , G06V20/69 , C02F103/00

CPC分类号： C02F1/50 , G06V10/82 , G06V20/56 , G06V20/69 , C02F2103/007 , C02F2201/008 , C02F2209/006 , G06V2201/07

摘要： A system for monitoring and improving water quality by mitigating harmful algal blooms. The system comprises an automatic detection unit that is configured to affix to an unmanned vehicle (UV). The automatic detection unit is adapted to detect harmful algal blooms in a water body when the UV flies over it. The automatic detection unit communicates to a server via a network. This automated process ensures swift identification without human intervention, enhancing efficiency. The system performs real-time data transmission that allows for analysis and response, facilitating timely decisions and interventions to mitigate algal blooms. The system is integrated with an artificial intelligence module, trained on reference data using convolution neural networks (CNNs). By automating detection, analysis, and response processes, the system optimizes operational efficiency, reducing manual effort and response times in managing algal bloom incidents.

2.

发明授权
Semantic matching between a source screen or source data and a target screen using semantic artificial intelligence 有权

公开(公告)号：US12124806B2

公开(公告)日：2024-10-22

申请号：US17494744

申请日：2021-10-05

申请人： UiPath, Inc.

发明人： Christian Mayer , Mircea Neagovici , Cosmin Voicu

IPC分类号： G06F40/30 , G06F3/0481 , G06F40/289 , G06N20/00 , G06V30/148 , G06F17/10 , G06F17/18 , G06F30/20 , G06V30/10

CPC分类号： G06F40/30 , G06F3/0481 , G06F40/289 , G06N20/00 , G06V30/153 , G06F17/10 , G06F17/18 , G06F30/20 , G06V30/10

摘要： Semantic matching between a source screen or source data and a target screen using semantic artificial intelligence (AI) for robotic process automation (RPA) workflows is disclosed. The source data or source screen and the target screen are selected on a matching interface, semantic matching is performed between the source data/screen and the target screen using an artificial intelligence/machine learning (AI/ML) model, and matching graphical elements and unmatched graphical elements are highlighted, allowing the developer to see which graphical elements match and which do not. The matching interface may also provide a confidence score of the individual matches, provide an overall mapping score, and allow the developer to hide/unhide the matched/unmatched graphical elements. Activities of an RPA workflow may be automatically created based on the semantic mapping that can be executed to perform the automation.

3.

发明授权
Continuous learning for document processing and analysis 有权

公开(公告)号：US12118813B2

公开(公告)日：2024-10-15

申请号：US17518191

申请日：2021-11-03

申请人： ABBYY Development Inc.

发明人： Stanislav Semenov

IPC分类号： G06V30/41 , G06F40/174 , G06N3/048 , G06N3/08 , G06V30/148

CPC分类号： G06V30/41 , G06F40/174 , G06N3/048 , G06N3/08 , G06V30/153

摘要： A document processing method includes receiving one or more documents, performing optical character recognition on the one or more documents to detect words comprising symbols in the one or more documents, and determining a encoding value for each of the symbols. It further includes applying a first hash function to each encoding value to generate a first set of hashed symbol values, applying a second hash function to each hashed symbol value to generate a vector array including a second set of hashed symbol values, and applying a linear transformation to each value of the second set of hashed symbol values of the vector array. The method also includes applying an irreversible non-linear activation function to the vector array to obtain abstract values associated with the symbols and saving the abstract values to train a neural network to detect fields in an input document.

4.

发明授权
Image generation method, computing device, and storage medium 有权

公开(公告)号：US12118808B2

公开(公告)日：2024-10-15

申请号：US17830518

申请日：2022-06-02

申请人： HON HAI PRECISION INDUSTRY CO., LTD.

发明人： Cheng-Feng Wang , Po-Chung Wang , Li-Che Lin

IPC分类号： G06V30/12 , G06T5/50 , G06T7/00 , G06V30/148

CPC分类号： G06V30/133 , G06T5/50 , G06T7/0002 , G06V30/153

摘要： An image generation method obtains an original image. A character area, a background area, and a position of each flawless character in the original image are determined. The character area is segmented to obtain a first image of each flawless character. A background is removed from the first image to obtain a second image. First image processing is performed on the second image to obtain a third image. Second image processing is performed on the second image to obtain fourth images. Third image processing is performed on the fourth images respectively to obtain fifth images. A similarity between each fifth image and the third image is calculated. When the similarity is greater than a defect threshold, a background image is segmented. Brightness of the background image is adjusted. The target fourth image and adjusted background image are synthesized. The method can generate images with defective characters quickly.

5.

发明授权
Remotely verifying an identity of a person 有权

公开(公告)号：US12099585B2

公开(公告)日：2024-09-24

申请号：US17275361

申请日：2019-09-12

申请人： ISX IP Ltd

发明人： Nickolas John Karantzis

IPC分类号： G06F21/32 , G06F21/35 , G06F21/40 , G06V30/10 , G06V30/148

CPC分类号： G06F21/32 , G06F21/35 , G06F21/40 , G06V30/153 , G06F2221/2121 , G06V30/10

摘要： A computer-implemented method for remotely verifying an identity of a user is presented. The method comprises a first data processing device (120) receiving a live video stream (102) of the user from a second data processing device (140) via a video data connection (108) having a video bandwidth. Establishing a separate data connection (110) between the first (120) and second (140) data processing devices, the data connection (110) having a data bandwidth. The first data processing device (120) receiving, via the data connection (110), identifying data (104) captured from an identifying means from the second data processing device (140), or another data processing device. The first data processing device (120) determining first biometric data based on the identifying data (104) and comparing to second biometric data based on the live video stream (102). The first data processing device (120) then verifying an identity of the user based on a correspondence between the first biometric data and the second biometric data.

6.

发明公开
METHOD AND SYSTEM FOR OPTICAL CHARACTER RECOGNITION (OCR)-FREE INFORMATION EXTRACTION FROM IMAGE-BASED DOCUMENTS 审中-公开

公开(公告)号：US20240312233A1

公开(公告)日：2024-09-19

申请号：US18183523

申请日：2023-03-14

申请人： Dell Product L.P.

发明人： Atul Kumar , Sailendu Kumar Patra , Saurabh Jha

IPC分类号： G06V30/416 , G06F40/103 , G06F40/169 , G06F40/205 , G06V10/82 , G06V30/148 , G06V30/18

CPC分类号： G06V30/416 , G06F40/103 , G06F40/169 , G06F40/205 , G06V10/82 , G06V30/148 , G06V30/18 , G06N3/0464

摘要： A method for information extraction from an image-based asset includes: generating, by an encoder, at least one image patch from the asset; generating, by the encoder, an input embedding for the at least one image patch; generating, by the encoder, an output embedding based on the input embedding; inferring, by a decoder, a detail of the image-based asset based on the output embedding and a formatted asset, in which the output embedding is sent by the encoder, wherein the formatted asset is sent by a parser; generating, by the decoder, a decoder output based on the detail, in which the detail comprises at least a feature and a second feature; converting, by a converter, the decoder output into an output asset, in which the decoder output is sent by the decoder; and sending, by the converter, the output asset to a user using a graphical user interface (GUI).

7.

发明公开
METHOD AND SYSTEM FOR EXTRACTING INFORMATION FROM DOCUMENTS VIA EYE GAZE TRACKING 审中-公开

公开(公告)号：US20240304012A1

公开(公告)日：2024-09-12

申请号：US18118520

申请日：2023-03-07

申请人： JPMorgan Chase Bank, N.A.

发明人： Nancy THOMAS , Daniel BORRAJO

IPC分类号： G06V30/14 , G06F3/01 , G06V30/146 , G06V30/148 , G06V30/414

CPC分类号： G06V30/1456 , G06F3/013 , G06V30/1452 , G06V30/147 , G06V30/15 , G06V30/414

摘要： A method and system for using eye gaze tracking to extract information in textual form from documents is provided. The method includes: receiving an image that corresponds to a document; receiving, from an eye-tracking sensor configured to detect a sequence of eye-gaze positions on the document as a function of time, a sequence of measurements that correspond to a human reading of the document; determining, based on the received sequence of measurements, a region of the document that is being read by a human; and extracting the textual information that corresponds to the region.

8.

发明公开
CHARACTER RECOGNITION DEVICE AND CHARACTER RECOGNITION METHOD 审中-公开

公开(公告)号：US20240296686A1

公开(公告)日：2024-09-05

申请号：US18665063

申请日：2024-05-15

申请人： Panasonic Intellectual Property Management Co., Ltd.

发明人： Kentaro MATSUMOTO , Yuma SAITO , Daiki YAMAMOTO , Rei HASEGAWA , Masashi YAMAMOTO

IPC分类号： G06V30/148 , G06V20/62

CPC分类号： G06V30/153 , G06V20/625

摘要： A character recognition device includes a recognizer that recognizes at least one character string from an image including a trailer captured by an imaging device, an attribute determinator that determines an attribute of the character string recognized by the recognition unit, and a trailer ID estimator that estimate whether the character string is a trailer ID based on the attribute of the character string determined by the attribute determinator.

9.

发明授权
Systems and methods for recovering numerical readings of cumulative flow meters based on noisy image data 有权

公开(公告)号：US12080085B2

公开(公告)日：2024-09-03

申请号：US18097961

申请日：2023-01-17

申请人： Yuri P. Garbuzov

发明人： Yuri P. Garbuzov

IPC分类号： G06F7/24 , G06V30/148

CPC分类号： G06V30/153

摘要： A meter readout on a meter has digits including a first digit, a second digit, etc. A sequence of images of the meter is obtained. The images include images of the digits in the meter readout. Automated recognition of the digits in the images result in likelihood arrays indicating the likelihoods for the digit values for the digits imaged in the meter images. Short chains of digit values are identified and spliced together to form a series of single digit, two-digit, three-digits, etc. paths that are built up based on the likelihood arrays. Various criteria are used to discard most of the chains and thereby avoid the combinatorial explosion of possible paths and thereby produce reliable meter readings without consuming considerable computational resources.

10.

发明授权
Picture processing method, and task data processing method and apparatus 有权

公开(公告)号：US12079662B2

公开(公告)日：2024-09-03

申请号：US17010812

申请日：2020-09-02

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Yao Xin

IPC分类号： G06F9/50 , G06F9/48 , G06F18/21 , G06F18/2415 , G06V10/24 , G06V10/426 , G06V10/82 , G06V10/94 , G06V20/62 , G06V30/148 , G06V30/19 , G06N20/00

CPC分类号： G06F9/5027 , G06F9/4881 , G06F18/2163 , G06F18/2415 , G06V10/242 , G06V10/426 , G06V10/82 , G06V10/955 , G06V20/63 , G06V30/153 , G06V30/19153 , G06F2209/486 , G06N20/00

摘要： A picture processing method is provided for a computer device. The method includes obtaining a to-be-processed picture; extracting a text feature in the to-be-processed picture using a machine learning model; and determining text box proposals at any angles in the to-be-processed picture according to the text feature. Corresponding subtasks are performed by using processing units corresponding to substructures in the machine learning model, and at least part of the processing units comprise a field-programmable gate array (FPGA) unit. The method also includes performing rotation region of interest (RROI) pooling processing on each text box proposal, and projecting the text box proposal onto a feature graph of a fixed size, to obtain a text box feature graph corresponding to the text box proposal; and recognizing text in the text box feature graph, to obtain a text recognition result.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类