摘要:
Methods and systems are provided that enable text in various sections of data records to be separately catalogued, indexed, or vectorized for analysis in a text visualization and mining system. A text processing system receives a plurality of data records, where each data record has one or a plurality of attribute fields associated with the records. The attributes fields containing textual information are identified. The specific textual content of each attribute field is identified. An index is generated that associates the textual content contained in each attribute field with the attribute field containing the textual content. The index is operable for use in text processing. The plurality of data records may be located in a data table and the textual information may be contained within cells of the data table. In another aspect, a plurality of data records is received, where at least some of the data records contain text terms. A first method is applied to weight text terms of the data records in a first manner to aid in distinguishing records from each other in response to selection of the first method. A second method is applied to weight text terms of the data records in a second manner to aid in distinguishing records from each other in response to selection of the second method. A vector is generated to distinguish each of the data records based on the text terms weighted by either the first or second method.
摘要:
A data import system enables access to data of multiple types from multiple data sources of different formats and provides an interface for importing data into a data analysis system. The interface enables a user to customize the formatting of the data as the data is being imported into a data analysis system. A user may select first user defined options for operating on a first data set received during a data importation process. An intermediate representation of the data set is generated based on the user first defined options. A user may specify second user defined options based on the intermediate representation during the data importation process. The second user defined options are processed to produce a final data representation of the data set to be used for analysis of the data. The intermediate representation may be a data table. The processing of a data set may include merging a first and second data set to produce the final data representation. The second user defined options may enable a user to select a basic operation for merging the data sets or to select a non-basic operation for merging the data sets. The basic operation may combine data sets in response to a user's selection of a first graphical interface control, and the non-basic operation may combine the data sets based on user selection of at least two graphical interface controls from a group of graphical interface controls.
摘要:
A system or method consistent with an embodiment of the present invention is useful in analyzing large volumes of different types of data, such as textual data, numeric data, categorical data, or sequential string data, for use in identifying relationships among the data types or different operations that have been performed on the data. A system or method consistent with the present invention determines and displays the relative content and context of related information and is operative to aid in identifying relationships among disparate data types. Various data types, such as numerical data, protein and DNA sequence data, categorical information, and textual information, such as annotations associated with the numerical data or research papers may be correlated for visual analysis. A variety of user-selectable views may be correlated for user interaction to identify relationships that exist among the different types of data or various operations performed on the data.Furthermore, the user may explore the information contained in sets of records and their associated attributes through the use of interactive 2-D line charts and interactive summary miniplots.
摘要:
Systems and methods provide several enhancements for the viewing, analysis, and generation of landscape views in a data analysis system, including: allowing a user to select from multiple methods to generate a landscape view, providing labels for peaks of a landscape, enabling the user to replace labels displayed on the landscape view, enabling a landscape view to be recalculated based on the replacement labels, and allowing a user to switch or morph between two landscape views generated by different methods. Such methods or systems generate graphical landscape map visualizations from a set of data records.
摘要:
Systems for creating high-dimensional vectors representing sequence strings and biopolymer materials are provided. A first system for divides respective sequence strings into blocks of at least three units to create a vocabulary of blocks. A second system selects predefined domains of a plurality of items of biopolymer materials. A third system defines each item of biopolymer material in a data set of biopolymer materials as a surface using descriptors of at least one of structure and function. A fourth system compares information regarding each biopolymer material of a plurality of biopolymer materials to information regarding each other biopolymer material.
摘要:
Methods and apparatus allow a user to explore the information contained in sets of records and their associated attributes through the use of interactive surface maps. The records may contain various types of attributes, including text, numeric, categoric, and sequence data.
摘要:
A protective glove is formed of composite material for close fitting over the fingers and the back and palm of the hand. The composite material includes a layer of flexible elastic material for tactile sensitivity through the layer. Optically reflective and dispersive particles are distributed and embedded within the layer for dispersing incident laser light thereby preventing laser burn injuries to the hand of a wearer.
摘要:
A microwave antenna structure couples electromagnetic microwave energy from a microwave transmission line into a sample contained in a sample container without invasion of the sample. The microwave antenna structure is formed by a bifilar helix of conducting first and second helical elements. The first and second helical elements are arranged in a parallel relationship defining a double helix with alternating spaced apart helical turns from the respective first and second helical elements. The double helix forms a holder for receiving and holding a sample container within the turns of the double helix. The first and second helical elements are formed with coupling extension for coupling to opposite polarity conductors of a microwave transmission line. Electromagnetic microwave energy propagating along the transmission line is coupled into sample material within the sample container. A multiple sample microwave irradiation system provides an oil bath in the form of a reservoir containing relatively low dielectric constant fluid oil. A temperature regulator and circulator is immersed in the oil bath for uniform temperature control throughout the reservoir. Multiple microwave antenna structures and sample containers are suspended and immersed in the oil bath. Multiple microwave branch transmission lines couple microwave energy into the respective antenna structures.
摘要:
A term variant discernment system identifies terms in content and executes one or more discernment processes to determine a meaning for each term. An ID is assigned to each term based on its meaning, with terms and their variant terms being assigned a distinct ID when they have different meanings and with terms and their variant terms being assigned the same ID when they have the same meaning. The terms and variants can then be individually queried via a query even though the terms and their variants may have the same spelling, abbreviation, or other characteristics.
摘要:
The invention encompasses purified and isolated Salmonella nucleotide fragments, methods of expressing and isolating polypeptides coded for by a Salmonella nucleotide sequence, methods for detecting the presence of Salmonella nucleotide sequences, methods for blocking transcription or translation of Salmonella nucleotide sequences, methods for blocking production or activity of polypeptide sequences from Salmonella nucleotide sequences, DNA chips containing Salmonella nucleotide sequences, and purified polypeptides expressed by Salmonella nucleotide sequences.