-
公开(公告)号:US12125125B2
公开(公告)日:2024-10-22
申请号:US17344484
申请日:2021-06-10
发明人: Yuchuan Gou , Qiancheng Wu , Minghao Li , Bo Gong , Mei Han
IPC分类号: G06K9/62 , G06F18/213 , G06F40/126 , G06F40/20 , G06F40/279 , G06F40/289 , G06F40/30 , G06N3/02 , G06N3/045 , G06N3/0475 , G06T11/00 , G06V10/70 , G06V10/82 , H04N21/81
CPC分类号: G06T11/00 , G06F18/213 , G06F40/126 , G06F40/20 , G06F40/279 , G06F40/289 , G06F40/30 , G06N3/02 , G06N3/045 , G06N3/0475 , G06V10/70 , G06V10/768 , G06V10/82 , H04N21/8153 , G06T2207/20084
摘要: A method and device for image generation are provided. The method includes: obtaining a text describing a content of an image to be generated; extracting, using a text encoder, a text feature vector from the text; determining a semantic mask as spatial constraints of the image to be generated; and automatically generating the image using a generative adversarial network (GAN) model according to the semantic mask and the text feature vector.
-
公开(公告)号:US12112131B2
公开(公告)日:2024-10-08
申请号:US17588043
申请日:2022-01-28
申请人: Salesforce, Inc.
IPC分类号: G06F40/30 , G06F3/08 , G06F40/126 , G06F40/279 , G06N3/044
CPC分类号: G06F40/279 , G06F40/126 , G06N3/044
摘要: Embodiments described herein provide a system and method for extracting factual information. The system transforms a query into a natural language prompt in a format of a query subject and a queried relation. The system encodes, via an embedding layer of a pre-trained language model, the natural language prompt into a first embedding. The system encodes, via the adapter model, the first embedding into a second embedding based on a probability that the second embedding returns the factual information when the second embedding is fed the first attention layer of the pre-trained language model. The system decodes, by the first attention layer of the pre-trained language mode, the second embedding into a response to the query. The system extracts the factual information from the decoded response to the query.
-
3.
公开(公告)号:US12106591B2
公开(公告)日:2024-10-01
申请号:US17661647
申请日:2022-05-02
申请人: Truist Bank
发明人: Raphael Fitzgerald
IPC分类号: G06V30/22 , G06F40/126 , G06T3/4046 , G06V10/82 , G06V30/14 , G06V30/19 , G06V30/42
CPC分类号: G06V30/22 , G06F40/126 , G06T3/4046 , G06V10/82 , G06V30/1452 , G06V30/19107 , G06V30/19173 , G06V30/42
摘要: A system and method for identifying handwritten characters on an image using a classification model that employs a neural network. The system includes a computer having a processor and a memory device that stores data and executable code that, when executed, causes the processor to read and convert typed text on the image to machine encoded text to identify locations of the typed text on the image; identify a location on the image that includes handwritten text based on the location of predetermined typed text on the image; identify clusters of non-white pixels in the image at the location having the handwritten text; generate an individual and separate cluster image for each identified cluster; classify each cluster image using machine learning and at least one neural network to determine the likelihood that the cluster is a certain character; and determine what character each cluster image is based on the classification.
-
公开(公告)号:US12099811B2
公开(公告)日:2024-09-24
申请号:US18088583
申请日:2022-12-25
IPC分类号: G06F40/35 , G06F16/242 , G06F16/31 , G06F16/332 , G06F16/951 , G06F16/955 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/216 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/289 , G06F40/30 , G06F40/44 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06N5/04 , G06N20/00 , G06Q10/1053 , G06Q30/0251 , G06Q30/0601 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L15/08
CPC分类号: G06F40/35 , G06F16/243 , G06F16/322 , G06F16/3329 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0255 , G06Q30/0257 , G06Q30/0631 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L2015/088
摘要: There is provided a computer implemented method for the automated analysis or use of data, to answer questions, comprising the steps of: (a) storing in a non-transitory storage medium a structured, machine-readable representation of data that conforms to a machine-readable language, in which the machine-readable language uses a shared syntax across factual statements, queries and reasoning, and uses nesting of nodes and passages, as an unambiguous syntax; where the data relates to parts of documents stored in a document store; (b) automatically processing the structured, machine-readable representation of data to answer questions, in which a user's query is automatically translated into the machine-readable language and a system responds to the user's query by utilising the machine-readable language translation of the query.
-
公开(公告)号:US20240284148A1
公开(公告)日:2024-08-22
申请号:US18170943
申请日:2023-02-17
申请人: T-Mobile USA, Inc.
IPC分类号: H04W4/16 , G06F40/126 , G06F40/205
CPC分类号: H04W4/16 , G06F40/126 , G06F40/205
摘要: This document describes techniques, apparatuses, and systems for communication of text data through a voice call. Provided is a voice call between a first user equipment and a second user equipment, during which packets are communicated through a voice channel. Information is requested from the first user equipment. Text conveying the requested information is received at the first user equipment. The text is encoded into data indicative of the text and organized into packets that are marked to indicate the text data within. The packets are transmitted to the second user equipment, where the packets are determined to include text data, and the data indicative of the text is decoded using a decoder associated with text data. In doing so, text data can be communicated through the voice call.
-
公开(公告)号:US12067367B2
公开(公告)日:2024-08-20
申请号:US18517720
申请日:2023-11-22
IPC分类号: G06F17/00 , G06F16/242 , G06F16/31 , G06F16/332 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/35 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0251 , G06Q30/0601 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L15/08
CPC分类号: G06F40/35 , G06F16/243 , G06F16/322 , G06F16/3329 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0255 , G06Q30/0631 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L2015/088
摘要: A computer implemented method for the automated analysis or use of data is implemented by a voice assistant. The method comprises the steps of: (a) storing in a memory a structured, machine-readable representation of data that conforms to a machine-readable language (‘machine representation’); the machine representation including representations of user speech or text input to a human/machine interface; and (b) automatically processing the machine representations to analyse the user speech or text input.
-
公开(公告)号:US20240273290A1
公开(公告)日:2024-08-15
申请号:US18168450
申请日:2023-02-13
申请人: SAP SE
发明人: Manuel Zeise , Marius Lehne
IPC分类号: G06F40/279 , G06F40/126 , G06F40/263
CPC分类号: G06F40/279 , G06F40/126 , G06F40/263 , G06N3/088
摘要: A method for multi-language document field extraction may include determining, based on a received document including a plurality of key fields and a plurality of value fields, a plurality of key-value pairs. The method also includes determining whether an encoding of a key field is within a threshold distance from a predetermined encoding of a predefined key field associated with a predefined field type. The method further includes assigning, based on determining the encoding of the key field is within the threshold distance, the predefined field type to the corresponding key-value pair. The method also includes performing a document processing operation based on each key-value pair and the predefined field type assigned to each key-value pair. Related systems and methods are provided.
-
8.
公开(公告)号:US20240265198A1
公开(公告)日:2024-08-08
申请号:US18597135
申请日:2024-03-06
发明人: Xiaoshuai CHEN
IPC分类号: G06F40/169 , G06F40/126 , G06F40/194 , G06F40/279
CPC分类号: G06F40/169 , G06F40/126 , G06F40/194 , G06F40/279
摘要: A reply content processing method including obtaining to-be-replied interactive content for media content, performing encoding processing on description content of the media content and the to-be-replied interactive content to obtain a vectorized representation of each word in the description content and the to-be-replied interactive content, and performing style recognition based on the vectorized representation of each word in the description content and the to-be-replied interactive content to obtain a first style category set, performing style recognition based on release party information of the to-be-replied interactive content to obtain a second style category set, determining a third style category set to which the to-be-replied interactive content belongs, determining a style category vector corresponding to each style category in the third style category set, and performing reply word prediction based on the description content, the to-be-replied interactive content, and the style category vector to generate reply content corresponding to each style category.
-
公开(公告)号:US12026473B2
公开(公告)日:2024-07-02
申请号:US18517720
申请日:2023-11-22
IPC分类号: G06F17/00 , G06F16/242 , G06F16/31 , G06F16/332 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/35 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0251 , G06Q30/0601 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L15/08
CPC分类号: G06F40/35 , G06F16/243 , G06F16/322 , G06F16/3329 , G06F16/951 , G06F40/123 , G06F40/126 , G06F40/20 , G06F40/205 , G06F40/211 , G06F40/226 , G06F40/242 , G06F40/279 , G06F40/30 , G06F40/45 , G06F40/47 , G06F40/58 , G06N3/0442 , G06N3/0455 , G06N3/0499 , G06N3/08 , G06N5/02 , G06Q10/1053 , G06Q30/0255 , G06Q30/0631 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L15/26 , G10L25/63 , G16H10/60 , H04L51/02 , G06N3/091 , G10L2015/088
摘要: A computer implemented method for the automated analysis or use of data is implemented by a voice assistant. The method comprises the steps of: (a) storing in a memory a structured, machine-readable representation of data that conforms to a machine-readable language (‘machine representation’); the machine representation including representations of user speech or text input to a human/machine interface; and (b) automatically processing the machine representations to analyse the user speech or text input.
-
公开(公告)号:US20240211678A1
公开(公告)日:2024-06-27
申请号:US18145412
申请日:2022-12-22
发明人: Timothy Andrew LARGE , Se Hoon LIM
IPC分类号: G06F40/126 , G06F3/01 , G06F21/84
CPC分类号: G06F40/126 , G06F3/013 , G06F21/84
摘要: A computing system for generating and displaying an encoded document for peripheral privacy is provided. The computing system includes a processor executing a program using portions of memory to determine an initial document, generate an encoded document based on the initial document by modifying letters in a text portion of the initial document, and output the encoded document. In another example, a computing system for displaying an encoded document for peripheral privacy is provided. The computing system includes a processor executing a program using portions of memory to receive an initial document and an encoded document, determine a gaze location of a user using a camera, generate a mask based on the determined gaze location, and display an alpha-blended document by alpha-blending the initial document and the encoded document using the mask.
-
-
-
-
-
-
-
-
-