-
公开(公告)号:US20240241886A1
公开(公告)日:2024-07-18
申请号:US18561925
申请日:2021-05-18
发明人: Yuichi SUTO , Ryosuke SATO , Kei TAKESHITA , Yuichiro ISHIZUKA
IPC分类号: G06F16/25 , G06F16/35 , G06F40/166 , G06F40/30
CPC分类号: G06F16/258 , G06F16/355 , G06F40/166 , G06F40/30
摘要: Provided is a data conversion device 1 that converts log data into structured data, the device including:
a determination unit 21 configured to determine, based on an appearance frequency of natural or non-natural language characters appearing in a document, whether the log data is first log data written in a natural language or second log data output mechanically output from a device;
a classification unit 25 configured to generate a classifier for classifying first log data into a category based on several pieces of first log data, as well as a plurality of categories, classify each piece of first log data into one of the plurality of categories using the classifier, and assign a vector obtained by vectorizing the meaning of a word contained in the several pieces of first log data;
a generation unit 26 configured to replace a plurality of words with a specific word, wherein the plurality of words have a vector similarity not less than a threshold and are regarded as the same word, among a plurality of words contained in the several pieces of first log data, for each category, and to generate log data composed of sentences shared by the several pieces of post-replacement first log data as a category template; and
a second extraction unit 22 configured to specify, in a case where it is determined that to-be-converted log data is the first log data, a category into which the to-be-converted log data will be classified using the classifier, to extract a unique variable for the to-be-converted log data by comparing the to-be-converted log data and a category template of the specified category, and to output the category template and the unique variable as structured data of the to-be-converted log data.-
52.
公开(公告)号:US12039264B2
公开(公告)日:2024-07-16
申请号:US17851738
申请日:2022-06-28
发明人: Rahul Prasad , Sumegha Yadav , Achyut Saxena
IPC分类号: G06F17/00 , G06F40/166 , G06F40/242 , G06F40/274 , G06F40/284
CPC分类号: G06F40/274 , G06F40/166 , G06F40/242 , G06F40/284
摘要: The present invention discloses a system and for sentence completion in an edge device. The method comprises receiving input from a user through a user interface (202); scrutinizing a length of the received input by a smart sentence composer; sanitizing the received input text and restricting the length of the input text to be within a preset threshold number; splitting the received input text from into letters/words (208); predicting several words and a cluster of most probable word's indices by the smart sentence composer (210), and wherein the word's indices have a similarity above the preset threshold number; converting the word's indices to words using index to word mapping (212); choosing the word/words in the cluster having highest score for prediction (214); and the words are continuously predicted until the end of token of word index (216), is determined for completing the sentence prediction.
-
公开(公告)号:US12039256B2
公开(公告)日:2024-07-16
申请号:US17969862
申请日:2022-10-20
申请人: Dell Products L.P.
IPC分类号: G06F40/166 , G06F40/289 , G06N20/00 , G06F40/205
CPC分类号: G06F40/166 , G06F40/289 , G06N20/00 , G06F40/205
摘要: An apparatus comprises a processing device configured to receive a request to generate a synthesized document comprising one or more search terms, and to extract, utilizing a first machine learning model, keywords from a set of documents. The processing device is also configured to select first content for inclusion in a first section of the synthesized document based on a similarity of the search terms and the extracted keywords from corresponding first sections of the set of documents, and to determine, utilizing a second machine learning model that takes as input the selected first content, a set of terms for a second section of the synthesized document. The processing device is further configured to select second content for inclusion in the second section of the synthesized document based on a similarity of the determined set of terms and the extracted keywords from corresponding sections of the set of documents.
-
公开(公告)号:US12039252B2
公开(公告)日:2024-07-16
申请号:US18295049
申请日:2023-04-03
申请人: Dropbox, Inc.
发明人: Nils Peter Welinder , Peter N Belhumeur , Ying Xiong , Jongmin Baek , Simon Kozlov , Thomas Berg , David J Kriegman
IPC分类号: G06F40/123 , G06F3/04842 , G06F3/12 , G06F16/93 , G06F40/103 , G06F40/106 , G06F40/166 , G06F40/169 , G06F40/197 , G06T5/90 , G06T7/194 , G06T11/00 , G06T11/60 , G06V10/10 , G06V10/30 , G06V10/44 , G06V20/40 , G06V30/40 , G06V30/413 , G06V30/414 , G06T5/70 , G06V10/24 , G06V30/418
CPC分类号: G06F40/123 , G06F16/93 , G06F40/106 , G06F40/166 , G06F40/197 , G06T5/90 , G06T7/194 , G06T11/001 , G06V10/10 , G06V10/44 , G06V20/46 , G06V30/40 , G06V30/413 , G06V30/414 , G06F3/04842 , G06F3/126 , G06F40/103 , G06F40/169 , G06T5/70 , G06T11/60 , G06T2207/20056 , G06T2207/20081 , G06T2207/20084 , G06T2207/30176 , G06T2210/22 , G06V10/242 , G06V10/30 , G06V30/418
摘要: The present disclosure is directed toward systems and methods that efficiently and effectively generate an enhanced document image of a displayed document in an image frame captured from a live image feed. For example, systems and methods described herein apply a document enhancement process to a displayed document in an image frame that result in an enhanced document image that is cropped, rectified, un-shadowed, and with dark text against a mostly white background. Additionally, systems and method described herein determine whether a stored digital content item includes a displayed document. In response to determining that a stored digital content item does include a displayed document, systems and methods described herein generate an enhanced document image of a displayed document included in the stored digital content item.
-
公开(公告)号:US12033008B2
公开(公告)日:2024-07-09
申请号:US18363031
申请日:2023-08-01
申请人: PagerDuty, Inc.
IPC分类号: G06N20/00 , G06F9/54 , G06F16/33 , G06F16/35 , G06F40/211 , G06F40/279 , G06F40/284 , G06F40/30 , G06F40/166
CPC分类号: G06F9/542 , G06F16/3347 , G06F16/353 , G06F40/211 , G06F40/279 , G06F40/284 , G06F40/30 , G06F40/166
摘要: A learner object that incorporates indications of agreements and disagreements with determinations obtained from a clustering engine of adding incoming events to one or more events groups is generated. An event is received based on monitored conditions. A determination is made not to add the event to an events group based on a first similarity score obtained from the clustering engine between the event and the events group not exceeding a threshold value. In response to determining not to add the event to the events group, a determination to add the event to the events group is obtained based on the learner object. In response to the determination obtained based on the learner object, the event is added with to the events group. A user interface configured to visually display and obtain feedback regarding additions of events to the events groups based on determinations of the clustering engine is generated.
-
公开(公告)号:US12032921B2
公开(公告)日:2024-07-09
申请号:US18153561
申请日:2023-01-12
申请人: AI21 LABS
发明人: Barak Peleg , Dan Padnos , Amnon Morag , Gilad Lumbroso , Yoav Shoham , Ori Goshen , Barak Lenz , Or Dagan , Guy Einy
IPC分类号: G06F17/00 , G06F3/0482 , G06F40/166 , G06F40/211 , G06F40/274 , G06F40/289 , G06F40/30 , G06F40/56 , G06F40/58 , G06F3/0486
CPC分类号: G06F40/56 , G06F3/0482 , G06F40/166 , G06F40/211 , G06F40/274 , G06F40/289 , G06F40/30 , G06F40/58 , G06F3/0486
摘要: Disclosed embodiments include a computer readable medium that may include instructions that when executed by one or more processing devices cause the one or more processing devices to perform a method. The method may include: identifying at least one reviewer-generated comment in an electronic document; based on analysis of the at least one reviewer-generated comment, generating one or more text output options each responsive to at least one aspect of the reviewer-generated comment; causing the one or more text output options to be displayed to a user; receiving an input from the user indicative of a selection of one of the one or more text output options; and automatically revising text implicated by the reviewer-generated comment in accordance with the selected one of the one or more text options.
-
公开(公告)号:US20240220084A1
公开(公告)日:2024-07-04
申请号:US18523829
申请日:2023-11-29
发明人: Huijing NIE , Le WANG , Dongling GAO
IPC分类号: G06F3/0483 , G06F16/28 , G06F40/166
CPC分类号: G06F3/0483 , G06F16/285 , G06F40/166 , G06F3/04842
摘要: The present disclosure provides an information display method, device, computer apparatus and storage medium, wherein the method comprises: receiving an access request for book encyclopedia information of a target book; acquiring the book encyclopedia information of the target book, wherein, the book encyclopedia information comprises a plurality of information modules, each information module corresponding to at least one book attribute dimension, and the book encyclopedia information belonging to each book attribute dimension is determined according to user's original innovative information and/or information obtained by automatically identifying book-related content of the target book; acquiring and displaying a book encyclopedia page matching book category of the target book, and displaying the book encyclopedia information of each information module in the book encyclopedia page.
-
公开(公告)号:US12026452B1
公开(公告)日:2024-07-02
申请号:US18351923
申请日:2023-07-13
发明人: Bryan J. Jakovcic
IPC分类号: G06F40/166 , G06F16/38 , G06F21/60
CPC分类号: G06F40/166 , G06F16/38 , G06F21/60
摘要: A method of enabling a remotely hosted AI to enhance a document while withholding sensitive information from the AI includes identifying sensitive terminology associated with the sensitive information and replacing by an exclusion filter of the sensitive terminology with redaction markers that cannot be interpreted as misspelled words or coined terms, thereby creating a redacted draft that is submitted to the AI. Upon receiving an enhanced, redacted draft from the AI, the original sensitive terminology is restored in place of the redaction markers, and the resulting enhanced document is delivered to a user. Sensitive terms can be local or global. Globally sensitive terms can be stored in databases directed to categories of sensitive information. Sensitive terms can include indicators directed to sensitive numerical quantities and/or other targets. In embodiments, the exclusion filter automatically corrects any grammatical errors arising from replacement of the redaction markers by the original sensitive terminology.
-
59.
公开(公告)号:US20240212715A1
公开(公告)日:2024-06-27
申请号:US18146590
申请日:2022-12-27
申请人: Dropbox, Inc.
发明人: Derrick Yee , LeeJun Park , Harry Twyford , Ernest Cheng , Isaac Leong
IPC分类号: G11B27/031 , G06F3/04842 , G06F40/166 , G11B27/06 , G11B27/34
CPC分类号: G11B27/031 , G06F3/04842 , G06F40/166 , G11B27/06 , G11B27/34
摘要: The present disclosure is directed toward systems, methods, and non-transitory computer-readable media for editing and collaborating with digital videos through interactions with video transcripts. For example, the disclosed systems can provide a user interface for interacting with a video transcript associated with a digital video. Based on interacting with the video transcript, the disclosed systems can perform editing operations and/or collaborating operations in relation to the digital video. For instance, the disclosed systems can edit a digital video at a video portion corresponding to transcript location where a user interaction occurs within a video transcript.
-
公开(公告)号:US12019992B2
公开(公告)日:2024-06-25
申请号:US17657530
申请日:2022-03-31
申请人: FUJITSU LIMITED
发明人: Mehdi Bahrami , Wei-Peng Chen
IPC分类号: G06F8/60 , G06F8/30 , G06F8/36 , G06F8/41 , G06F8/65 , G06F8/73 , G06F11/36 , G06F16/951 , G06F18/20 , G06F18/22 , G06F18/23213 , G06F40/166 , G06F40/211 , G06F40/216 , G06F40/242 , G06F40/30 , G06F40/40 , G06F40/44 , G06N3/04 , G06N3/08
CPC分类号: G06F40/30 , G06F8/30 , G06F8/36 , G06F8/42 , G06F8/436 , G06F8/65 , G06F8/73 , G06F11/3624 , G06F16/951 , G06F18/22 , G06F18/23213 , G06F18/285 , G06F40/166 , G06F40/211 , G06F40/216 , G06F40/242 , G06F40/40 , G06F40/44 , G06N3/04 , G06N3/08
摘要: According to an aspect of an embodiment, operations for code enrichment for training language models on tasks related to computer programming are provided. The operations include receiving source code data including a computer-executable code and a natural language (NL) text. The operations further include determining blocks of code from the computer-executable code. The operations further include extracting a set of features related to components of the source code data from the blocks of code. The extraction is performed by parsing the blocks of code using Abstract Syntax Tree (AST) data of the blocks of code. The operations further include revising the AST data. The operations further include updating the source code data based on the revised AST data and generating a dataset of NL and abstracted code features as training data based on the updated source code data and further training a language model on a sequence-to-sequence generation task.
-
-
-
-
-
-
-
-
-