-
公开(公告)号:US11934788B2
公开(公告)日:2024-03-19
申请号:US17350752
申请日:2021-06-17
IPC分类号: G06F40/30 , G06F40/126 , G06F40/279 , G06F40/151 , G10L15/16
CPC分类号: G06F40/30 , G06F40/126 , G06F40/279 , G06F40/151 , G10L15/16
摘要: Embodiments of this disclosure include an encoding method and apparatus. The encoding may include obtaining a target paragraph and a context sentence of the target paragraph, and inputting the target paragraph and the context sentence into a memory encoding model. The encoding may further include obtaining an original vector set and a memory vector set in the input layer and obtaining a first target sentence matrix of the original vector set in the memory layer according to the original vector set and the memory vector set. The encoding may further include obtaining a paragraph vector of the target paragraph in the output layer according to the first target sentence matrix and performing processing based on the paragraph vector.
-
公开(公告)号:US20240086637A1
公开(公告)日:2024-03-14
申请号:US17940525
申请日:2022-09-08
申请人: Tencent America LLC
IPC分类号: G06F40/295 , G06F40/151 , G06N5/02
CPC分类号: G06F40/295 , G06F40/151 , G06N5/025
摘要: Methods and devices to efficiently normalize text by processing inputted text based on a text normalization model that includes processing the input text in a first stage including a statistical model as a first output, processing the first output in a second stage including a rule based model as a normalized text, and outputting the normalized text.
-
公开(公告)号:US20240086619A1
公开(公告)日:2024-03-14
申请号:US18384322
申请日:2023-10-26
发明人: Pengcheng HE , Xiaodong Liu , Jianfeng Gao , Weizhu Chen
IPC分类号: G06F40/126 , G06F40/151 , G06N3/08
CPC分类号: G06F40/126 , G06F40/151 , G06N3/08
摘要: Generally discussed herein are devices, systems, and methods for generating an embedding that is both local string dependent and global string dependent. The generated embedding can improve machine learning (ML) model performance. A method can include converting a string of words to a series of tokens, generating a local string-dependent embedding of each token of the series of tokens, generating a global string-dependent embedding of each token of the series of tokens, combining the local string dependent embedding the global string dependent embedding to generate an n-gram induced embedding of each token of the series of tokens, obtaining a masked language model (MLM) previously trained to generate a masked word prediction, and executing the MLM based on the n-gram induced embedding of each token to generate the masked word prediction.
-
公开(公告)号:US11886800B1
公开(公告)日:2024-01-30
申请号:US18104258
申请日:2023-01-31
申请人: INTUIT INC.
发明人: Jing Wang , John Matthew Mastin, Jr. , Sowmyanka Andalam , Piyasa Molly Paul , Dallas Leigh Taylor , Andres Castro
IPC分类号: G06F40/151 , G06F40/166 , G06F40/253 , G06F40/284
CPC分类号: G06F40/151 , G06F40/166 , G06F40/253 , G06F40/284
摘要: A method includes detecting, in a written electronic communication, an input sentence satisfying a readability metric threshold, and processing, by a sentence transformer model responsive to the input sentence satisfying the readability metric threshold, the input sentence to output a suggested set of sentences. The method further includes evaluating the first suggested set of sentences along a set of acceptability criteria, and determining, based on the evaluating, that the set of acceptability criteria is satisfied. The method further includes modifying, based on determining that the set of acceptability criteria is satisfied, the written electronic communication with the suggested set of sentences to obtain a modified written electronic communication, and storing the modified written electronic communication.
-
公开(公告)号:US11836438B2
公开(公告)日:2023-12-05
申请号:US17229140
申请日:2021-04-13
发明人: Pengcheng He , Xiaodong Liu , Jianfeng Gao , Weizhu Chen
IPC分类号: G06F40/126 , G06N3/08 , G06F40/151
CPC分类号: G06F40/126 , G06F40/151 , G06N3/08
摘要: Generally discussed herein are devices, systems, and methods for generating an embedding that is both local string dependent and global string dependent. The generated embedding can improve machine learning (ML) model performance. A method can include converting a string of words to a series of tokens, generating a local string-dependent embedding of each token of the series of tokens, generating a global string-dependent embedding of each token of the series of tokens, combining the local string dependent embedding the global string dependent embedding to generate an n-gram induced embedding of each token of the series of tokens, obtaining a masked language model (MLM) previously trained to generate a masked word prediction, and executing the MLM based on the n-based induced embedding of each token to generate the masked word prediction.
-
公开(公告)号:US20230367954A1
公开(公告)日:2023-11-16
申请号:US18347949
申请日:2023-07-06
申请人: Open Text SA ULC
发明人: Gregory R. Petti
IPC分类号: G06F40/143 , G06F8/40 , G06F40/151
CPC分类号: G06F40/143 , G06F8/40 , G06F40/151 , G06F9/44
摘要: A template built by a user may be converted by a Server Script Generation Engine (SSGE) into script code. In converting, the SSGE may load and parse a framework file containing static script syntax to locate insertion points, each associated with an iteration number, and may iteratively parse the template, utilizing the iteration number to resolve, in order, tags and sub-tags contained in the template. If a tag is set to respond to the iteration number, a function of the tag is invoked to process any related sub-tags and return a script associated therewith at the appropriate insertion point. The framework file (with the appropriate script code inserted) is compiled and stored in a compiled script object which can be run multiple times to perform all of the output functions expected by the user in lieu of the need to reconvert the template.
-
公开(公告)号:US20230359837A1
公开(公告)日:2023-11-09
申请号:US17735384
申请日:2022-05-03
申请人: Spotify AB
发明人: Edgar Tanaka , Ann Clifton
IPC分类号: G06F40/58 , G06F40/263 , G06F40/284 , G06F40/151 , G06F40/166 , G06F40/197 , G06N20/10
CPC分类号: G06F40/58 , G06F40/263 , G06F40/284 , G06F40/151 , G06F40/166 , G06F40/197 , G06N20/10
摘要: A full attention mechanism of a multilingual transformer model is converted into a Longformer attention mechanism to generate a Longformer multilingual transformer model. The Longformer multilingual transformer model is finetuned to perform a summarization task based on episode-description:episode-transcript pairs, thereby generating a finetuned Longformer multilingual transformer model. The Longformer multilingual transformer model also can further be finetuned to perform a summarization task based on article-summary:full-original-article pairs. A summary of a query episode transcript can be generated using the single-finetuned Longformer multilingual transformer model and/or the double-finetuned Longformer multilingual transformer model. The multilingual transformer-based model enables systems, methods and computer products to be capable of generating multilingual abstractive summaries.
-
18.
公开(公告)号:US20230306189A1
公开(公告)日:2023-09-28
申请号:US17885530
申请日:2022-08-10
发明人: Toru MORITA
IPC分类号: G06F40/151 , G06F40/134
CPC分类号: G06F40/151 , G06F40/134
摘要: An information processing apparatus includes a processor configured to: acquire information from an external link destination inserted into a document in a document format that can be viewed without depending on a software environment; in a case where the acquired information is a web page, convert the web page into a file in the document format; and store the file obtained from the external link destination in an offline environment in association with the document.
-
公开(公告)号:US11741293B2
公开(公告)日:2023-08-29
申请号:US17558070
申请日:2021-12-21
申请人: Open Text SA ULC
发明人: Gregory R. Petti
IPC分类号: G06F40/00 , G06F40/143 , G06F8/40 , G06F40/151 , G06F9/44 , G06F8/41 , G06F9/455 , G06F40/154
CPC分类号: G06F40/143 , G06F8/40 , G06F40/151 , G06F8/41 , G06F9/44 , G06F9/45512 , G06F40/154
摘要: A template built by a user may be converted by a Server Script Generation Engine (SSGE) into script code. In converting, the SSGE may load and parse a framework file containing static script syntax to locate insertion points, each associated with an iteration number, and may iteratively parse the template, utilizing the iteration number to resolve, in order, tags and sub-tags contained in the template. If a tag is set to respond to the iteration number, a function of the tag is invoked to process any related sub-tags and return a script associated therewith at the appropriate insertion point. The framework file (with the appropriate script code inserted) is compiled and stored in a compiled script object which can be run multiple times to perform all of the output functions expected by the user in lieu of the need to reconvert the template.
-
公开(公告)号:US11734268B2
公开(公告)日:2023-08-22
申请号:US17358114
申请日:2021-06-25
申请人: Pryon Incorporated
发明人: David Nahamoo , Igor Roditis Jablokov , Vaibhava Goel , Etienne Marcheret , Ellen Eide Kislal , Steven John Rennie , Marie Wenzel Meteer , Neil Rohit Mallinar , Soonthorn Ativanichayaphong , Joseph Allen Pruitt , John Pruitt , Bryan Dempsey , Chui Sung
IPC分类号: G06F16/2452 , G06F40/151 , G06F40/137 , G06F16/957 , G06F16/332 , G06F16/338 , G06F16/335 , G06F16/9032 , G06F16/93 , G06F40/30 , G06F16/33 , G06F40/247 , G06N5/022 , G06N5/04 , G06F40/131 , G06F40/20 , G06F40/284 , G06N3/08 , G06N3/006 , G06N3/044 , G06N3/045
CPC分类号: G06F16/24522 , G06F16/335 , G06F16/338 , G06F16/3328 , G06F16/3329 , G06F16/3349 , G06F16/9032 , G06F16/93 , G06F16/9574 , G06F40/131 , G06F40/137 , G06F40/151 , G06F40/20 , G06F40/247 , G06F40/284 , G06F40/30 , G06N5/022 , G06N5/04 , G06N3/006 , G06N3/044 , G06N3/045 , G06N3/08
摘要: Disclosed are methods, systems, devices, apparatus, media, design structures, and other implementations, including a method that includes receiving a source document, applying one or more pre-processes to the source document to produce contextual information representative of the structure and content of the source document, and transforming the source document, based on the contextual information, to generate a question-and-answer searchable document.
-
-
-
-
-
-
-
-
-