摘要:
Sets of genetic markers for specific tumor classes are described, as well as methods of identifying a biological sample based on these markers. Also described are diagnostic, prognostic, and therapeutic screening uses for these markers, as well as oligonucleotide arrays comprising these markers.
摘要:
Methods and apparatus for classifying or predicting the classes for samples based on gene expression are described. Also described are methods and apparatus for ascertaining or discovering new, previously unknown classes based on gene expression.
摘要:
Methods and apparatus for classifying or predicting the classes for samples based on gene expression are described. Also described are methods and apparatus for ascertaining or discovering new, previously unknown classes based on gene expression. Methods, computer systems and apparatus for classifying or predicting whether a sample is treatment sensitive (e.g., chemosensitive) or treatment resistant (e.g., chemoresistant) are also provided. Classification occurs based on analysis of gene expression data from samples that have been subjected to one or more compounds.
摘要:
Methods and apparatus for classifying or predicting the classes for samples based on gene expression are described. Also described are methods and apparatus for ascertaining or discovering new, previously unknown classes based on gene expression.
摘要:
An implementation of NMF functionality integrated into a relational database management system provides the capability to apply NMF to relational datasets and to sparse datasets. A database management system comprises a multi-dimensional data table operable to store data and a processing unit operable to perform non-negative matrix factorization on data stored in the multi-dimensional data table and to generate a plurality of data tables, each data table being smaller than the multi-dimensional data table and having reduced dimensionality relative to the multi-dimensional data table. The multi-dimensional data table may be a relational data table.
摘要:
A method, system, and computer program product for generating a representation of a data mining model that improves the transparency of data mining models so as to be more easily interpretable by human users. The method comprises the steps of: receiving a dataset, generating a callable version of the data mining model, and generating a tree representing decisional logic of the data mining model using the dataset.
摘要:
An implementation of NMF functionality integrated into a relational database management system provides the capability to apply NMF to relational datasets and to sparse datasets. A database management system comprises a multi-dimensional data table operable to store data and a processing unit operable to perform non-negative matrix factorization on data stored in the multi-dimensional data table and to generate a plurality of data tables, each data table being smaller than the multi-dimensional data table and having reduced dimensionality relative to the multi-dimensional data table. The multi-dimensional data table may be a relational data table.
摘要:
A method for projection mining comprises performing a first projection on a first data object of a first type comprising a plurality of data entries and a second data object of a second type comprising a plurality of data entries to create definitions of attributes of the first data object and definitions of attributes of the second data object, performing a second projection of the definitions of the attributes of the first data object and the definitions of the attributes of the second data object into a space of meta-attributes based on semantic relationships among the attributes of the first data object and the second data object, learning relationships between the space of meta-attributes formed by the projections of the first data object and the second data object and a space of meta-attributes relating to new data not included in the first data object and the second data object, and generating at least one new data object of the first or second type based on the new data using the learned relationships.