-
公开(公告)号:US20230132061A1
公开(公告)日:2023-04-27
申请号:US17508117
申请日:2021-10-22
发明人: Birgit Monika Pfitzmann , Christoph Auer , Kasper Dinkla , Michele Dolfi , Peter Willem Jan Staar
IPC分类号: G06N5/04 , G06F40/284 , G06F40/295 , G06K9/00 , G06N5/02
摘要: Information extraction systems and computer-implemented methods for producing a searchable representation of information contained in a corpus of documents by generating a document structure graph for each document, the graph indicating a structural hierarchy of document items in that document based on a predefined hierarchy of predetermined item-types, and linking document items to a parent document item in the structural hierarchy, for each document, generating a knowledge graph including first nodes, representing document items in the corpus and second nodes representing language items identified in those document items, interconnecting the first nodes and second nodes by edges representing a defined relation between items represented by the nodes interconnected by that edge, storing the knowledge graph in a knowledge graph database, and producing the searchable representation by traversing edges of the graph in response to input search queries.
-
公开(公告)号:US11532174B2
公开(公告)日:2022-12-20
申请号:US16701191
申请日:2019-12-03
发明人: Changhua Sun , HongLei Guo , Birgit Monika Pfitzmann , Dorothea Wiesmann Rothuizen , Lynette Yvonne Mitchell , Brent Alan Goebel
IPC分类号: G06V30/414 , G06Q30/06
摘要: In an approach for automatically extracting product baseline information from a request for proposal document, a processor receives the document. A processor detects a table in the document. A processor identifies a table header on the table. The table header is associated with a name and an associated volume of the product. A processor extracts context based on the table header from the table. The context includes the name and the associated volume of the product. A processor maps the extracted context with the name of the product in the table to an associated name of the product based on a pre-defined product ontology.
-
公开(公告)号:US20210166016A1
公开(公告)日:2021-06-03
申请号:US16701191
申请日:2019-12-03
发明人: Changhua Sun , HongLei Guo , Birgit Monika Pfitzmann , Dorothea Wiesmann Rothuizen , Lynette Yvonne Mitchell , Brent Alan Goebel
摘要: In an approach for automatically extracting product baseline information from a request for proposal document, a processor receives the document. A processor detects a table in the document. A processor identifies a table header on the table. The table header is associated with a name and an associated volume of the product. A processor extracts context based on the table header from the table. The context includes the name and the associated volume of the product. A processor maps the extracted context with the name of the product in the table to an associated name of the product based on a pre-defined product ontology.
-
公开(公告)号:US11265288B2
公开(公告)日:2022-03-01
申请号:US16529927
申请日:2019-08-02
IPC分类号: H04L29/12 , H04L29/08 , H04L12/24 , H04L29/06 , H04L61/2514 , H04L41/0853 , H04L41/12 , H04L67/1004 , H04L67/1038
摘要: Various embodiments manage the migration of servers. In one embodiment, a set of server-level dependency information is obtained for servers to be migrated from a source computing environment to a target computing environment. A set of network configuration data is obtained for a plurality of network devices associated with the servers. The set of server-level dependency information is updated to include one or more additional dependencies of at least one of the servers based on the set of network configuration data. Updating the set of server-level dependency information generates an updated set of dependency information. The servers are assigned to multiple migration groups based on the updated set of dependency information. The migration groups optimize cross-group dependencies among the migration groups.
-
公开(公告)号:US20210350274A1
公开(公告)日:2021-11-11
申请号:US16868565
申请日:2020-05-07
摘要: A method, a computer system, and a computer program product for managing a dataset of training samples, labeled by class, during training of a machine learning model is provided. Embodiments of the present invention may include training the model on a sequence of increasing-sized sets of the training samples and testing performance of the model after training with each set to obtain class-specific performance metrics corresponding to each set size. Embodiments of the present invention may include generating class-specific learning curves from the performance metrics for the plurality of classes. Embodiments of the present invention may include extrapolating the learning curves. Embodiments of the present invention may include optimizing a function of the predicted performance metrics to identify a set of augmentation actions to augment the dataset for further training of the model. Embodiments of the present invention may include providing an output indicative of the set of augmentation actions.
-
公开(公告)号:US20210004372A1
公开(公告)日:2021-01-07
申请号:US16502802
申请日:2019-07-03
IPC分类号: G06F16/2455 , G06F16/28 , G06F16/248 , G06N20/00
摘要: A method, apparatus, and non-transitory computer readable medium for performing joins on data from hierarchical databases are described. The method, apparatus, and non-transitory computer readable medium may provide for receiving one or more search results from each of a plurality of hierarchical databases, identifying one or more matching fields from each of the search results, joining the search results according to a set of rules for processing related fields to the matching fields, wherein the related fields comprise sibling fields, neighbor fields, ancestor fields, descendant fields, or any combination thereof, and generating one or more combined search results based on the joining.
-
公开(公告)号:US11822892B2
公开(公告)日:2023-11-21
申请号:US17124451
申请日:2020-12-16
IPC分类号: G06F40/35 , G06N5/02 , G06F40/205
CPC分类号: G06F40/35 , G06F40/205 , G06N5/02
摘要: Splitting a natural language sentence into primitive phrases retaining relations of terms includes receiving a natural language sentence, building a parse tree from the natural language sentence using a natural language parser, and recursively identifying discourse markers in subtrees of the parse tree, starting with the highest ranking discourse marker in the parse tree, thereby separating each of the respective subtrees at the respective discourse marker using a set of predefined rules until a set of basic subtrees remains. The recursive identification includes looking-ahead for identifying long ranging discourse markers before identifying local discourse markers.
-
公开(公告)号:US20230252309A1
公开(公告)日:2023-08-10
申请号:US17650086
申请日:2022-02-07
发明人: Birgit Monika Pfitzmann , Christoph Auer , Kasper Dinkla , Michele Dolfi , Peter Willem Jan Staar
IPC分类号: G06N5/02 , G06F40/279
CPC分类号: G06N5/022 , G06F40/279
摘要: A computer-implemented method, a computer program product, and a computer system for building a knowledge graph. A computer system converts user inputs as to a partial topology of a knowledge graph that a user wants to build into one or more initial nodes corresponding to respective natural language descriptions. A computer system interprets the respective natural language descriptions using natural language processing to match the one or more initial nodes against reference data. A computer system, based on matched reference data, obtains a valid topology of nodes and edges, wherein the nodes and edges are mapped onto the matched reference data. A computer system, based on the valid topology, generates a data flow linking to the matched reference data via associations of the nodes and edges and the matched reference data. A computer system builds an executable knowledge graph from the data flow.
-
公开(公告)号:US11687700B1
公开(公告)日:2023-06-27
申请号:US17649597
申请日:2022-02-01
发明人: Birgit Monika Pfitzmann , Christoph Auer , Michele Dolfi , Peter Willem Jan Staar , Ahmed Samy Nassar
IPC分类号: G06F40/00 , G06F40/103 , G06N3/08 , G06V30/412 , G06V30/414
CPC分类号: G06F40/103 , G06N3/08 , G06V30/412 , G06V30/414
摘要: The present disclosure relates to a method for generating a structure of a PDF-document, wherein the PDF-document comprises elements. The method comprises detecting document cells of the PDF-document dependent on commands of a page description language for printing the elements of the PDF-document. The method comprises determining parts of the PDF-document dependent on the PDF-document by a machine learning module. The determining of the respective part comprises associating a respective portion of the elements of the PDF-document with the respective part. Furthermore, a respective label may be assigned to the respective part. The method may further comprise using a symbolic artificial intelligence module, wherein rules of the symbolic AI-module for reconciling the document cells with the parts may be applied. The elements of the structure of the PDF-document may be generated and labelled dependent on a result of the reconciling and dependent on the respective label to the respective part.
-
公开(公告)号:US20210027315A1
公开(公告)日:2021-01-28
申请号:US16522527
申请日:2019-07-25
摘要: A computer-implemented method of automatically identifying a product offering for a customer using a generated decision tree from a directed acyclic graph knowledge base is described. The method includes, by a processor, identifying a set of product offerings, where each product offering is described by a file. The method converts each file into a Directed Acyclic Graph (DAG) and clusters the DAGs. For each cluster, the processor creates a decision tree to distinguish between the product offerings.
-
-
-
-
-
-
-
-
-