摘要:
A method and system correlate candidate information and provide batch classification of a number of related candidates. The batch of candidates may be identified from a single data set. There may be internal correlations and/or differences among the candidates. The candidates may be classified taking into consideration the internal correlations and/or differences. The locations and descriptive features of a batch of candidates may be determined. In turn, the locations and/or descriptive features determined may used to enhance the accuracy of the classification of some or all of the candidates within the batch. In one embodiment, the single data set analyzed is associated with an internal image of patient and the distance between candidates is accounted for. Two different algorithms may each simultaneously classify all of the samples within a batch, one being based upon probabilistic analysis and the other upon a mathematical programming approach. Alternate algorithms may be used.
摘要:
A method and system correlate candidate information and provide batch classification of a number of related candidates. The batch of candidates may be identified from a single data set. There may be internal correlations and/or differences among the candidates. The candidates may be classified taking into consideration the internal correlations and/or differences. The locations and descriptive features of a batch of candidates may be determined. In turn, the locations and/or descriptive features determined may used to enhance the accuracy of the classification of some or all of the candidates within the batch. In one embodiment, the single data set analyzed is associated with an internal image of patient and the distance between candidates is accounted for. Two different algorithms may each simultaneously classify all of the samples within a batch, one being based upon probabilistic analysis and the other upon a mathematical programming approach. Alternate algorithms may be used.
摘要:
Automatic mapping of semantics in healthcare is provided. Data sets have different semantics (e.g., Gender designated with M and F in one system and Sex designated with 1 or 2 in another system). For semantic interoperability, the semantic links between the semantic systems of different healthcare entities are created (e.g., Gender=Sex and/or 1=F and 2=M) by a processor from statistics of the data itself. The distribution of variables, values, or variables and values, with or without other information and/or logic, is used to create a map from one semantic system to another. Similar distributions of other variable and/or values are likely to be for variables and/or values with the same meaning.
摘要:
Inclusion of a patient in a medical category is determined by triggering an analysis of an electronic medical record of the patient in response to an input of data into the electronic medical record. Identifying characteristics that indicate inclusion in the medical category with the analysis, and determining a probability the patient belongs to the medical category based on the identified characteristics.
摘要:
A predictive model of medical knowledge is trained from patient data of multiple different medical centers. The predictive model is machine learnt from routine patient data from multiple medical centers. Distributed learning avoids transfer of the patient data from any of the medical centers. Each medical center trains the predictive model from the local patient data. The learned statistics, and not patient data, are transmitted to a central server. The central server reconciles the statistics and proposes new statistics to each of the local medical centers. In an iterative approach, the predictive model is developed without transfer of patient data but with statistics responsive to patient data available from multiple medical centers. To assure comfort with the process, the transmitted statistics may be in a human readable format.
摘要:
A method for finding a ranking function ƒ that classifies feature points in an n-dimensional space includes providing a plurality of feature points xk derived from tissue sample regions in a digital medical image, providing training data A comprising training samples Aj where A = ⋃ j = 1 S ( A j = { x i j } i = 1 m j ) , providing an ordering E={(P,Q)|APAQ} of at least some training data sets where all training samples xiεAP are ranked higher than any sample xjεAQ, solving a mathematical optimization program to determine the ranking function ƒ that classifies said feature points x into sets A. For any two sets Ai, Aj, AiAj, and the ranking function ƒ satisfies inequality constraints ƒ(xi)≦ƒ(xj) for all xiεconv(Ai) and xjεconv(Aj), where conv(A) represents the convex hull of the elements of set A.
摘要:
A method of training a classifier for computer aided detection of digitized medical images, includes providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in a medical image, wherein said features include texture, shape, intensity, and contrast of said region-of-interest, wherein each region-of-interest has been labeled as either malignant or healthy, and training a classifier on said plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to a feature sample, is correctly classified according to the labeled of the associated region-of-interest.
摘要:
A method, including receiving a data source selection from a user or software application, the data source including medical information of a plurality of patients, receiving, from the user or software application, a data pattern that is related to a concept to be explored in the data source, querying the data source to find information that approximately matches the data pattern; and receiving the information from the data source, wherein the information includes unstructured data, assigning a classification to individual parts of the information based on the part's relationship to the data pattern, and outputting the classified information to the user or software application.
摘要:
A method, including receiving a data source selection from a user or software application, the data source including medical information of a plurality of patients, receiving, from the user or software application, a data pattern that is related to a concept to be explored in the data source, querying the data source to find information that approximately matches the data pattern; and receiving the information from the data source, wherein the information includes unstructured data, assigning a classification to individual parts of the information based on the part's relationship to the data pattern, and outputting the classified information to the user or software application.
摘要:
A method of training a classifier for computer aided detection of digitized medical image, includes providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in a medical image, where each region-of-interest has been labeled as either malignant or healthy. The training uses candidates that are spatially adjacent to each other, modeled by a “bag”, rather than each candidate by itself. A classifier is trained on the plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to a feature sample, is correctly classified according to the label of the associated region-of-interest, rather than a large set of discrete constraints where at least one instance in each bag has to be correctly classified.