摘要:
Methods, systems and computer readable media for network-based identification of significant molecules, for which at least one biological network is provided to include significant molecules to be identified. A node in the network is identified. A member-specific sub-network containing nodes connected to the identified node is identified for L levels of nearest neighbors, wherein L is a positive integer, and a connectivity score is calculated for the molecule represented by the identified node based on significance scores of each node contained in the member-specific sub-network. These steps are repeated for other nodes in the network. Methods, systems and computer readable media for network-based identification of significant molecules, for which at least one biological network is provided to include significant molecules to be identified, a data set including data values characterizing molecules experimented on is provided, and an interesting list of molecules is provided as a subset of the molecules from the dataset, the interesting list including significance scores for the molecules in the list. Such identification includes identifying a node in the network; identifying a member-specific sub-network containing nodes connected to the identified node for L levels of nearest neighbors, wherein L is a positive integer; extracting the member-specific sub-network from the network; and repeating these steps for each of the other nodes in the network that corresponds to a molecule in the interesting list.
摘要:
Systems, methods and computer readable media for performing a domain-specific metasearch, and obtaining search results therefrom. A metasearch engine capable of accessing generic, web-based search engines and domain-relevant search engines is provided to receive one or more queries inputted by a user, and to search for documents on at least one the generic, web-based search engines and domain-relevant search engines which are relevant to the queries. Raw data search results are fetched in the form of text documents. Relevant data including semantic information are extracted from the raw data search results, and converted to a local format. The relevant data having been converted to the local format may be visualized as a network visualization. Additionally or alternatively, the raw data search results may be ranked and/or filtered based on the linking of the relevant data. Visualization of the raw data having been ranked and/or filtered may be performed in addition to, or alternative to visualization of the network.
摘要:
Methods, systems and computer readable media for correlating data from data sets to higher level categories of characterization of the data. Data from a first set of data is analyzed to determine where members of the first set map to an ontology. Data from a second set of data is analyzed to determine where members of the second set map to the ontology. From such analysis a subset of the first set of data is identified and a subset of the second set of data is identified. The subset of the first set of data is statistically analyzed with regard to its mapping to the ontology, and a first set of ontology terms are identified that are statistically differentiated by members of the subset of the first set of data. The subset of the second set of data is statistically analyzed with regard to its mapping to the ontology, and a second set of ontology terms is identified that are statistically differentiated by members of the subset of the second set of data. Correlation of the first set of ontology terms with the second set of ontology terms may further be performed.
摘要:
Systems, methods and recordable media for interactively importing, creating, and manipulating biological diagrams. Such diagrams may be used for linking and navigating to other sources of biological information. Such diagrams may also be used interactively with other diagrams or other views of biological knowledge
摘要:
A method for feature selection is provided. The method includes the steps of selecting a predictor set of features, adding at least one complementary feature to the predictor set based on a quality of prediction, checking to see if all of the features of the predictor set are repeated, and if not, removing at least one feature from the predictor set. The algorithm and method repeats the steps of adding complements, checking the predictor set and removing features until the features of the predictor set are repeated. Once the features of. the predictor set are repeated the proper number of times, the algorithm and method terminate.
摘要:
A method for feature selection is provided. The method includes the steps of selecting a predictor set of features, adding at least one complementary feature to the predictor set based on a quality of prediction, checking to see if all of the features of the predictor set are repeated, and if not, removing at least one feature from the predictor set. The algorithm and method repeats the steps of adding complements, checking the predictor set and removing features until the features of the predictor set are repeated. Once the features of. the predictor set are repeated the proper number of times, the algorithm and method terminate.
摘要:
Systems, tools, methods and recordable media for providing a visual grammar to be associated with a local format for creating interactive biological diagrams. Stencils are provided to represent higher level representations of associated entities and interactions are descriptively and unambiguously displayed. Additionally, information may be overlaid and compared with existing biological diagrams.