-
公开(公告)号:US20240362941A1
公开(公告)日:2024-10-31
申请号:US18140143
申请日:2023-04-27
申请人: Adobe Inc.
发明人: Silky Singh , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain , Abhinav Java
IPC分类号: G06V30/262 , G06V30/14 , G06V30/19 , G06V30/414
CPC分类号: G06V30/274 , G06V30/1444 , G06V30/19147 , G06V30/414
摘要: A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.
-
公开(公告)号:US20240362937A1
公开(公告)日:2024-10-31
申请号:US18766599
申请日:2024-07-08
发明人: Ankit Malviya , Shubhanshu Kumar Singh , Vishu Mittal , Anish Goswami , Chaithanya Manda , Saurabh Khanna , Sarika Pal
IPC分类号: G06V30/146 , G06V30/16 , G06V30/18 , G06V30/19
CPC分类号: G06V30/147 , G06V30/16 , G06V30/18 , G06V30/19147
摘要: A text recognition system causes a trained region encoder to determine a region of interest of an image file. The system modifies a first image associated with the first region of interest (e.g., parsed out from the first region) to generate a data augmentation entity that includes a modified image. Using a trained instance encoder, the system generates a first set of visual instances corresponding to the first region of interest image and a second set of visual instances corresponding to the data augmentation entity. The system generates the corresponding first and second sequences. By executing a self-supervised contrastive loss function on the first and second sequences, the system automatically updates a continual knowledge distillation model of the trained region encoder. The system provides the first sequence to an instance decoder to generate output text in response to the prompt.
-
3.
公开(公告)号:US12131469B2
公开(公告)日:2024-10-29
申请号:US18342032
申请日:2023-06-27
申请人: PAIGE.AI, Inc.
IPC分类号: G06T7/00 , G06F18/23213 , G06N20/00 , G06T7/11 , G06V10/762 , G06V30/19 , G16B40/00 , G16H50/20
CPC分类号: G06T7/0012 , G06F18/23213 , G06N20/00 , G06T7/11 , G06V10/763 , G06V30/19107 , G16B40/00 , G16H50/20 , G06T2207/20081 , G06T2207/30024 , G06T2207/30096
摘要: Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.
-
公开(公告)号:US20240355136A1
公开(公告)日:2024-10-24
申请号:US18239778
申请日:2023-08-30
IPC分类号: G06V30/414 , G06F40/169 , G06F40/186 , G06V10/94 , G06V20/62 , G06V30/19
CPC分类号: G06V30/414 , G06F40/169 , G06F40/186 , G06V10/945 , G06V20/62 , G06V30/19013 , G06V30/19147 , G06V30/1916
摘要: A method and system for relevant data extraction from a document is disclosed. The method includes determining first positional information corresponding to a key from a plurality of predefined keys in the document image based on a deep learning model. Further, second positional information corresponding to the key is determined based on OCR of the document image and an NLP model. Final positional information is determined based on the first positional information and the second positional information, in case a difference between the first positional information and the second positional information is minimal. Relevant data is extracted for the key in the OCR document image based on the final positional information.
-
公开(公告)号:US12125318B1
公开(公告)日:2024-10-22
申请号:US18635241
申请日:2024-04-15
IPC分类号: G06V10/00 , G06V30/19 , G06V30/226 , G06V40/30
CPC分类号: G06V40/33 , G06V30/19107 , G06V30/226
摘要: An apparatus for detecting fraudulent signature inputs is disclosed. The apparatus includes at least a processor and a memory. The memory instructs the processor to receive a plurality of image data from a user. The memory instructs the processor to identify a plurality of signature elements as a function of the plurality of signature inputs. The memory instructs the processor to determine a plurality of signature scores as a function of the plurality of signature elements, wherein the plurality of signature scores comprises a first set of signature scores and a second set of signature scores. The memory instructs the processor to generate an accuracy threshold as a function of the first set of signature scores. The memory instructs the processor to determine one or more fraudulent signature inputs from the plurality of signature inputs as a function of a comparison of signature score to an accuracy threshold.
-
公开(公告)号:US12125265B2
公开(公告)日:2024-10-22
申请号:US17809798
申请日:2022-06-29
申请人: Google LLC
IPC分类号: G06V10/774 , G06F18/21 , G06F18/2115 , G06F18/214 , G06N3/006 , G06N3/02 , G06N3/045 , G06N3/084 , G06N3/088 , G06N5/01 , G06N5/045 , G06N7/01 , G06N20/20 , G06V30/19
CPC分类号: G06V10/774 , G06F18/2115 , G06F18/2148 , G06F18/2193 , G06N3/006 , G06N3/02 , G06N3/045 , G06N3/084 , G06N3/088 , G06N5/045 , G06N7/01 , G06V30/19147 , G06N5/01 , G06N20/20
摘要: A method for training a locally interpretable model includes obtaining a set of training samples and training a black-box model using the set of training samples. The method also includes generating, using the trained black-box model and the set of training samples, a set of auxiliary training samples and training a baseline interpretable model using the set of auxiliary training samples. The method also includes training, using the set of auxiliary training samples and baseline interpretable model, an instance-wise weight estimator model. For each auxiliary training sample in the set of auxiliary training samples, the method also includes determining, using the trained instance-wise weight estimator model, a selection probability for the auxiliary training sample. The method also includes selecting, based on the selection probabilities, a subset of auxiliary training samples and training the locally interpretable model using the subset of auxiliary training samples.
-
7.
公开(公告)号:US12118814B2
公开(公告)日:2024-10-15
申请号:US17581133
申请日:2022-01-21
申请人: MEDIAMACROS, INC.
发明人: Charles Neal
IPC分类号: G06V30/00 , G06F3/04842 , G06F40/242 , G06F40/279 , G06V30/19 , G06V30/41 , G10L21/10
CPC分类号: G06V30/41 , G06F3/04842 , G06F40/242 , G06F40/279 , G06V30/19013 , G10L21/10
摘要: An interactive system for identifying and correcting inconsistencies between a written work, an audio reading of the written work, and a resulting transcription of the audio reading. The system stores on a computing device connected to a network a manuscript, an audio version of the manuscript, and a transcription of the audio version of the manuscript. Via a transcription engine, difference and comparison engine, and a user device having a visual interface, a user is visually presented via the display the inconsistencies between the transcript and the manuscript, the user can amend the manuscript and/or the transcript to reconcile the works, the user can listen to a corresponding section of the corresponding audio file, and the user can interact with collaborators in a context aware interface. Upon the user processing, the manuscript may be read and listened to simultaneously as an enhanced e-book through a separate software tool.
-
公开(公告)号:US12112532B2
公开(公告)日:2024-10-08
申请号:US18313631
申请日:2023-05-08
发明人: Shane Paul Springer
CPC分类号: G06V10/945 , G06T7/13 , G06V20/41 , G06V30/19 , H04L12/1813 , H04N5/77 , G06T2200/24 , G06T2207/10016
摘要: Selections of content shared from a remote device during a video conference are copied to a destination of a computing device connected to the video conference live or at which a recording of the video conference is viewed. The content shared from the remote device during the video conference is output at a display of the computing device. A portion of the content is selected according to an instruction received from a user of the computing device while output at the display of the computing device to copy to a destination associated with software running at the computing device. The portion of the content is identified using a machine vision process performed against the content while output at the display of the computing device. The portion of the content is then copied to the destination.
-
公开(公告)号:US20240331432A1
公开(公告)日:2024-10-03
申请号:US18741370
申请日:2024-06-12
申请人: 42Maru Inc.
发明人: Dong Hwan KIM , You Kyung KWON , So Young KO , Sook Jin ROE , Ki Beom KWON , Da Hea MOON
IPC分类号: G06V30/413 , G06F16/953 , G06F40/20 , G06V30/12 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/416
CPC分类号: G06V30/413 , G06F16/953 , G06F40/20 , G06V30/12 , G06V30/19093 , G06V30/412 , G06V30/414 , G06V30/416
摘要: Provided are method and apparatus for data structuring of text. The apparatus for data structuring of text includes a processor; and a memory storing instructions executable by the processor, wherein the processor is configured to execute the instructions to: extract text and location information of the text from an image, set text units for the extracted text, assigning a first tag and a second tag to at least one of the text units, connect text units with related tags among the text units allocated the first tag and the second tag, label the connected text units as first text, second text, and third text respectively corresponding to an item name, an item value, and others based on a natural language processing model, and structure the extracted text by mapping the second text to the first text.
-
公开(公告)号:US20240330351A1
公开(公告)日:2024-10-03
申请号:US18190686
申请日:2023-03-27
申请人: Adobe Inc.
发明人: Abhinav Java , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain
IPC分类号: G06F16/383 , G06F16/332 , G06V30/19 , G06V30/412
CPC分类号: G06F16/383 , G06F16/332 , G06V30/19147 , G06V30/412
摘要: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar. The content processing system utilizes the training dataset to train a machine learning model to perform form structure similarity matching.
-
-
-
-
-
-
-
-
-