Abstract:
An accumulated information data store may include topic nodes, each having a text description of limited length and (in some cases) one or more attributes. A particular topic node may be associated as a parent topic node other child topic nodes such that the topic nodes form at least one data tree. An information processing engine may access information in the accumulated information data store and determine that a plurality of topic node text descriptions are similar and classify them as similar topic nodes. At least a part of the text description associated with one of the similar topic nodes may be selected as a favorable text description for the similar topic nodes. The system may also unify the similar topic nodes as identical topic nodes when they are currently grouped together as having the same upper tree hierarchy.
Abstract:
A computerized method for searching and organizing a healthcare textual data set is provided. The method includes receiving, by a server including a processor and a memory, an input of a user of a selected healthcare treatment product delimiting a subject data set; displaying, in response to the input subject data set received by the server, an intertopic distance map on a topic modeler graphical user interface displaying topics of the input subject data set as raw uncategorized data, the topic modeler graphical user interface displaying icons each representing a corresponding topic within the data set, the icons illustrating a prevalence of the topics in the data set by sizes of the icons and an interrelatedness of the topics by spacing and/or overlap of the icons; modifying, in response to a selection of one of the icons, a terms graph of the topic modeler graphical user interface to display representative keywords of the topic represented by the selected icon; receiving an input of a pattern in a taxonomy modifier graphical user interface generated by the server; displaying, in response to the input pattern, text of the data set corresponding to the input pattern on the taxonomy modifier graphical user interface; and adding, in response to a request of the user, the input pattern to one or more existing levels of a taxonomy of the data set to alter the structure of the taxonomy and provide a modified healthcare treatment taxonomy on the memory of the server.
Abstract:
The present invention relates to a computer-implemented method for locating regulatory information. The method including the steps of constructing a corporate profile; identifying regulatory information from a plurality of sources; and mapping the corporate profile to the identified regulatory information to locate relevant regulatory information. A system is also disclosed.
Abstract:
The present invention relates to a computer-implemented method of defining an entity for a graph-based information system. The method includes the steps of receiving user input at one or more processors to create or select one of a plurality of options for a node in a graph; receiving user input at one or more processors to create or select one of a plurality of options for one or more child nodes in the graph; defining a value for the created/selected option for one of the child nodes at one or more processors in response to user input; and storing the value in at least one memory. The node and one or more child nodes are displayed within a user interface at a user device. A system for defining an entity for a graph-based information system is also disclosed.
Abstract:
A method of automatically generating a lemma dictionary from a web resource may include extracting a plurality of tokens from text-based documents within the web resource, and generating a plurality of N-grams from the plurality of tokens. The method may additionally include receiving one or more filter definitions that identify valid N-grams, and filtering the plurality of N-grams using the one or more filter definitions to generate a lemma dictionary. The method may further include generating an ontology that comprises the lemma dictionary.
Abstract:
A configuration system and method adapted to perform a configuration or reconfiguration of applications run by an automation system, said configuration system comprising; a processing unit adapted to process at least one natural language statement of a user requirement input by a user concerning a control and/or monitoring functionality of the automation system based on a user ontology of the user and/or an automation system ontology of the automation system to generate a formal requirements specification; and a matching unit adapted to match the generated formal requirements specification and formal component specifications read from a component library to derive a configuration deployment comprising one or several suitable components with configurations fulfilling the input user requirement.
Abstract:
Systems and methods are provided for improving language models for speech recognition by personalizing knowledge sources utilized by the language models to specific users or user-population characteristics. A knowledge source, such as a knowledge graph, is personalized for a particular user by mapping entities or user actions from usage history for the user, such as query logs, to the knowledge source. The personalized knowledge source may be used to build a personal language model by training a language model with queries corresponding to entities or entity pairs that appear in usage history. In some embodiments, a personalized knowledge source for a specific user can be extended based on personalized knowledge sources of similar users.
Abstract:
A system and method for searching and presenting data may include receiving a search request, storing data in an in memory database, accessing that data by one or more 64 bit processors, determining exact or approximate matches among the data to the search request, building a machine-readable data file including the search results, and serving the data file to a user's machine, where it is visible through a portal interface, which may be local or remotely served to the user's machine. The system may employ 64 bit technology in order to access and serve significant amounts of information, including mappings between code set and other data that may include mappings to one or more code sets. The system also may employ raw TCP/IP sockets to reduce latency and increase search speed.
Abstract:
Techniques for entity detection include matching a token from at least a portion of a text string with a matching concept in an ontology. A first concept may be identified as being hierarchically related to the matching concept within the ontology, and a second concept may be identified as being hierarchically related to the first concept within the ontology. The first and second concepts may be included in a set of features of the token. Based at least in part on the set of features of the token, a measure related to a likelihood that the at least a portion of the text string corresponds to a particular entity type may be determined.