-
公开(公告)号:US12197412B2
公开(公告)日:2025-01-14
申请号:US18763909
申请日:2024-07-03
Applicant: TUNGSTEN AUTOMATION CORPORATION
Inventor: Steve Thompson , Veronika Levdik , Iurii Vymenets , Donghan Lee
IPC: G06F16/22 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414
Abstract: Recent developments in machine learning (commonly coined “artificial intelligence” or “AI”) have vastly expanded applications for this technology, such as myriad “chat” agents adept at understanding natural human language. While state of the art generative models can parse text queries from a user and provide comprehensive, accurate responses (including generating images depicting desired content), current implementations struggle with understanding all information present in images of documents, especially images of business documents. In particular, generative models fail to understand structured and semi-structured information, e.g., as indicated by graphical information such as lines, geometric relationships (e.g., indicated by tables, graphs, figures, etc.), formatting, and other contextual information that human readers easily and implicitly understand. The disclosed inventive concepts transform structured and semi-structured information along with textual content into a textual representation that allows generative models to better understand textual content and non-textual structured information present in document images.
-
2.
公开(公告)号:US20240362197A1
公开(公告)日:2024-10-31
申请号:US18763909
申请日:2024-07-03
Applicant: TUNGSTEN AUTOMATION CORPORATION
Inventor: Steve Thompson , Veronika Levdik , Iurii Vymenets , Donghan Lee
IPC: G06F16/22 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414
CPC classification number: G06F16/2282 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414
Abstract: Recent developments in machine learning (commonly coined “artificial intelligence” or “AI”) have vastly expanded applications for this technology, such as myriad “chat” agents adept at understanding natural human language. While state of the art generative models can parse text queries from a user and provide comprehensive, accurate responses (including generating images depicting desired content), current implementations struggle with understanding all information present in images of documents, especially images of business documents. In particular, generative models fail to understand structured and semi-structured information, e.g., as indicated by graphical information such as lines, geometric relationships (e.g., indicated by tables, graphs, figures, etc.), formatting, and other contextual information that human readers easily and implicitly understand. The disclosed inventive concepts transform structured and semi-structured information along with textual content into a textual representation that allows generative models to better understand textual content and non-textual structured information present in document images.
-
3.
公开(公告)号:US12205058B2
公开(公告)日:2025-01-21
申请号:US18378580
申请日:2023-10-10
Applicant: TUNGSTEN AUTOMATION CORPORATION
Inventor: Jiyong Ma , Stephen Michael Thompson , Jan W. Amtrup
IPC: G06F3/01 , G06F18/23213 , G06Q10/0633
Abstract: According to one aspect, a computer-implemented method of discovering processes for robotic process automation (RPA) includes: recording a plurality of event streams, each event stream corresponding to a human user interacting with a computing device to perform one or more tasks; concatenating the event streams; segmenting some or all of the concatenated event streams to generate one or more application traces performed by the user interacting with the computing device, each application trace corresponding to one of the one or more tasks performed by the user; clustering the traces according to a task type; identifying, from among some or all of the clustered traces, one or more candidate processes for robotic automation; prioritizing the candidate processes; and selecting at least one of the prioritized candidate processes for robotic automation. Corresponding systems and computer program products are also described.
-
-