专利检索 ipc:"G06V30/19" 第 1 页

1.

发明公开
PERSONALIZED FORM ERROR CORRECTION PROPAGATION 审中-公开

公开(公告)号：US20240362941A1

公开(公告)日：2024-10-31

申请号：US18140143

申请日：2023-04-27

申请人： Adobe Inc.

发明人： Silky Singh , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain , Abhinav Java

IPC分类号： G06V30/262 , G06V30/14 , G06V30/19 , G06V30/414

CPC分类号： G06V30/274 , G06V30/1444 , G06V30/19147 , G06V30/414

摘要： A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.

2.

发明公开
CONTINUAL TEXT RECOGNITION USING PROMPT-GUIDED KNOWLEDGE DISTILLATION 审中-公开

公开(公告)号：US20240362937A1

公开(公告)日：2024-10-31

申请号：US18766599

申请日：2024-07-08

申请人： ExlService Holdings, Inc.

发明人： Ankit Malviya , Shubhanshu Kumar Singh , Vishu Mittal , Anish Goswami , Chaithanya Manda , Saurabh Khanna , Sarika Pal

IPC分类号： G06V30/146 , G06V30/16 , G06V30/18 , G06V30/19

CPC分类号： G06V30/147 , G06V30/16 , G06V30/18 , G06V30/19147

摘要： A text recognition system causes a trained region encoder to determine a region of interest of an image file. The system modifies a first image associated with the first region of interest (e.g., parsed out from the first region) to generate a data augmentation entity that includes a modified image. Using a trained instance encoder, the system generates a first set of visual instances corresponding to the first region of interest image and a second set of visual instances corresponding to the data augmentation entity. The system generates the corresponding first and second sequences. By executing a self-supervised contrastive loss function on the first and second sequences, the system automatically updates a continual knowledge distillation model of the trained region encoder. The system provides the first sequence to an instance decoder to generate output text in response to the prompt.

3.

发明授权
Systems and methods to process electronic images to provide image-based cell group targeting 有权

公开(公告)号：US12131469B2

公开(公告)日：2024-10-29

申请号：US18342032

申请日：2023-06-27

申请人： PAIGE.AI, Inc.

发明人： Rodrigo Ceballos Lentini , Christopher Kanan , Belma Dogdas

IPC分类号： G06T7/00 , G06F18/23213 , G06N20/00 , G06T7/11 , G06V10/762 , G06V30/19 , G16B40/00 , G16H50/20

CPC分类号： G06T7/0012 , G06F18/23213 , G06N20/00 , G06T7/11 , G06V10/763 , G06V30/19107 , G16B40/00 , G16H50/20 , G06T2207/20081 , G06T2207/30024 , G06T2207/30096

摘要： Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.

4.

发明公开
METHOD AND SYSTEM FOR RELEVANT DATA EXTRACTION FROM A DOCUMENT 审中-公开

公开(公告)号：US20240355136A1

公开(公告)日：2024-10-24

申请号：US18239778

申请日：2023-08-30

申请人： L&T TECHNOLOGY SERVICES LIMITED

发明人： NIRMAL RAMESH RAYULU VANAPALLI VENKATA , MADHUSUDAN SINGH , TAMILARASAN ELLAPPAN

IPC分类号： G06V30/414 , G06F40/169 , G06F40/186 , G06V10/94 , G06V20/62 , G06V30/19

CPC分类号： G06V30/414 , G06F40/169 , G06F40/186 , G06V10/945 , G06V20/62 , G06V30/19013 , G06V30/19147 , G06V30/1916

摘要： A method and system for relevant data extraction from a document is disclosed. The method includes determining first positional information corresponding to a key from a plurality of predefined keys in the document image based on a deep learning model. Further, second positional information corresponding to the key is determined based on OCR of the document image and an NLP model. Final positional information is determined based on the first positional information and the second positional information, in case a difference between the first positional information and the second positional information is minimal. Relevant data is extracted for the key in the OCR document image based on the final positional information.

5.

发明授权
Apparatus and a method for detecting fraudulent signature inputs 有权

公开(公告)号：US12125318B1

公开(公告)日：2024-10-22

申请号：US18635241

申请日：2024-04-15

申请人： Quick Quack Car Wash Holdings, LLC

发明人： Josh David Schumacher , Betsy Danielle Urschel

IPC分类号： G06V10/00 , G06V30/19 , G06V30/226 , G06V40/30

CPC分类号： G06V40/33 , G06V30/19107 , G06V30/226

摘要： An apparatus for detecting fraudulent signature inputs is disclosed. The apparatus includes at least a processor and a memory. The memory instructs the processor to receive a plurality of image data from a user. The memory instructs the processor to identify a plurality of signature elements as a function of the plurality of signature inputs. The memory instructs the processor to determine a plurality of signature scores as a function of the plurality of signature elements, wherein the plurality of signature scores comprises a first set of signature scores and a second set of signature scores. The memory instructs the processor to generate an accuracy threshold as a function of the first set of signature scores. The memory instructs the processor to determine one or more fraudulent signature inputs from the plurality of signature inputs as a function of a comparison of signature score to an accuracy threshold.

6.

发明授权
Reinforcement learning based locally interpretable models 有权

公开(公告)号：US12125265B2

公开(公告)日：2024-10-22

申请号：US17809798

申请日：2022-06-29

申请人： Google LLC

发明人： Sercan Omer Arik , Jinsung Yoon , Tomas Jon Pfister

IPC分类号： G06V10/774 , G06F18/21 , G06F18/2115 , G06F18/214 , G06N3/006 , G06N3/02 , G06N3/045 , G06N3/084 , G06N3/088 , G06N5/01 , G06N5/045 , G06N7/01 , G06N20/20 , G06V30/19

CPC分类号： G06V10/774 , G06F18/2115 , G06F18/2148 , G06F18/2193 , G06N3/006 , G06N3/02 , G06N3/045 , G06N3/084 , G06N3/088 , G06N5/045 , G06N7/01 , G06V30/19147 , G06N5/01 , G06N20/20

摘要： A method for training a locally interpretable model includes obtaining a set of training samples and training a black-box model using the set of training samples. The method also includes generating, using the trained black-box model and the set of training samples, a set of auxiliary training samples and training a baseline interpretable model using the set of auxiliary training samples. The method also includes training, using the set of auxiliary training samples and baseline interpretable model, an instance-wise weight estimator model. For each auxiliary training sample in the set of auxiliary training samples, the method also includes determining, using the trained instance-wise weight estimator model, a selection probability for the auxiliary training sample. The method also includes selecting, based on the selection probabilities, a subset of auxiliary training samples and training the locally interpretable model using the subset of auxiliary training samples.

7.

发明授权
System and method for facilitating the synchronization of written works with accompanying audio 有权

公开(公告)号：US12118814B2

公开(公告)日：2024-10-15

申请号：US17581133

申请日：2022-01-21

申请人： MEDIAMACROS, INC.

发明人： Charles Neal

IPC分类号： G06V30/00 , G06F3/04842 , G06F40/242 , G06F40/279 , G06V30/19 , G06V30/41 , G10L21/10

CPC分类号： G06V30/41 , G06F3/04842 , G06F40/242 , G06F40/279 , G06V30/19013 , G10L21/10

摘要： An interactive system for identifying and correcting inconsistencies between a written work, an audio reading of the written work, and a resulting transcription of the audio reading. The system stores on a computing device connected to a network a manuscript, an audio version of the manuscript, and a transcription of the audio version of the manuscript. Via a transcription engine, difference and comparison engine, and a user device having a visual interface, a user is visually presented via the display the inconsistencies between the transcript and the manuscript, the user can amend the manuscript and/or the transcript to reconcile the works, the user can listen to a corresponding section of the corresponding audio file, and the user can interact with collaborators in a context aware interface. Upon the user processing, the manuscript may be read and listened to simultaneously as an enhanced e-book through a separate software tool.

8.

发明授权
Copying shared content using machine vision 有权

公开(公告)号：US12112532B2

公开(公告)日：2024-10-08

申请号：US18313631

申请日：2023-05-08

申请人： Zoom Video Communications, Inc.

发明人： Shane Paul Springer

IPC分类号： G06V10/94 , G06T7/13 , G06V20/40 , G06V30/19 , H04L12/18 , H04N5/77

CPC分类号： G06V10/945 , G06T7/13 , G06V20/41 , G06V30/19 , H04L12/1813 , H04N5/77 , G06T2200/24 , G06T2207/10016

摘要： Selections of content shared from a remote device during a video conference are copied to a destination of a computing device connected to the video conference live or at which a recording of the video conference is viewed. The content shared from the remote device during the video conference is output at a display of the computing device. A portion of the content is selected according to an instruction received from a user of the computing device while output at the display of the computing device to copy to a destination associated with software running at the computing device. The portion of the content is identified using a machine vision process performed against the content while output at the display of the computing device. The portion of the content is then copied to the destination.

9.

发明公开
METHOD AND APPARATUS FOR DATA STRUCTURING OF TEXT 审中-公开

公开(公告)号：US20240331432A1

公开(公告)日：2024-10-03

申请号：US18741370

申请日：2024-06-12

申请人： 42Maru Inc.

发明人： Dong Hwan KIM , You Kyung KWON , So Young KO , Sook Jin ROE , Ki Beom KWON , Da Hea MOON

IPC分类号： G06V30/413 , G06F16/953 , G06F40/20 , G06V30/12 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/416

CPC分类号： G06V30/413 , G06F16/953 , G06F40/20 , G06V30/12 , G06V30/19093 , G06V30/412 , G06V30/414 , G06V30/416

摘要： Provided are method and apparatus for data structuring of text. The apparatus for data structuring of text includes a processor; and a memory storing instructions executable by the processor, wherein the processor is configured to execute the instructions to: extract text and location information of the text from an image, set text units for the extracted text, assigning a first tag and a second tag to at least one of the text units, connect text units with related tags among the text units allocated the first tag and the second tag, label the connected text units as first text, second text, and third text respectively corresponding to an item name, an item value, and others based on a natural language processing model, and structure the extracted text by mapping the second text to the first text.

10.

发明公开
FORM STRUCTURE SIMILARITY DETECTION 审中-公开

公开(公告)号：US20240330351A1

公开(公告)日：2024-10-03

申请号：US18190686

申请日：2023-03-27

申请人： Adobe Inc.

发明人： Abhinav Java , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain

IPC分类号： G06F16/383 , G06F16/332 , G06V30/19 , G06V30/412

CPC分类号： G06F16/383 , G06F16/332 , G06V30/19147 , G06V30/412

摘要： Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar. The content processing system utilizes the training dataset to train a machine learning model to perform form structure similarity matching.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类