摘要:
A system, method, and computer program product to automatically eliminate the distribution information available for reconstruction from a disguised dataset. The invention flattens input numerical values into a substantially uniformly distributed dataset, then maps the uniformly distributed dataset into equivalent data in a target distribution. The invention allows the incremental encryption of new values in an encrypted database while leaving existing encrypted values unchanged. The flattening comprises (1) partitioning, (2) mapping, and (3) saving auxiliary information about the data processing, which is encrypted and not updated. The partitioning is MDL based, and includes a growth phase for dividing a space into fine partitions and a prune phase for merging some partitions together.
摘要:
A system and method for exporing a web-accessible database includes providing a GUI that a user can manipulate to quickly modify the results of a query to expand or contract the results set, without requiring additional querying. Attribute controls can be manipulated to impose restrictions on the results set, including by designating example records the attributes of which are used to restrict the records displayed to the user. Only records that can be displayed are instantiated, to further increase the speed of the system.
摘要:
A system and method for mining data while preserving a user's privacy includes perturbing user-related information at the user's computer and sending the perturbed data to a Web site. At the Web site, perturbed data from many users is aggregated, and from the distribution of the perturbed data, the distribution of the original data is reconstructed, although individual records cannot be reconstructed. Based on the reconstructed distribution, a decision tree classification model or a Naive Bayes classification model is developed, with the model then being provided back to the users, who can use the model on their individual data to generate classifications that are then sent back to the Web site such that the Web site can display a page appropriately configured for the user's classification. Or, the classification model need not be provided to users, but the Web site can use the model to, e.g., send search results and a ranking model to a user, with the ranking model being used at the user computer to rank the search results based on the user's individual classification data.
摘要:
The process removes from a diluted gas one or more fluorine compounds present at a first concentration in the diluted gas and concentrates the fluorine compounds. This is done by directly contacting the diluted gas with a cryogenic liquid having a temperature less than or equal to about −150° F., thereby producing a vapor lean in the fluorine compounds and liquid enriched with the fluorine compounds at a second concentration. The second concentration is greater than the first concentration.
摘要:
An apparatus and process for separating perfluorocarbon compounds from a gas mixture passes an incoming stream of a gas into a cold trap, the gas stream including a plurality of perfluorocarbon compounds. The gas mixture is cooled within the cold trap to a temperature below −100° C. to produce a condensate that is enriched in at least one perfluorocarbon compound and a non-condensed stream from which the condensate was separated. The condensate is withdrawn from the cold trap. The condensate may be withdrawn by warming the cold trap to vaporize the condensate and thereafter flowing the vaporized condensate into a storage vessel. The non-condensed stream may be vented to the atmosphere, re-circulated into the cold trap or flowed through subsequent separation processes to extract additional perfluorocarbon compounds.
摘要:
A method of gas liquefaction wherein the refrigeration to cool and liquefy an essentially water-free feed gas is provided by a single recirculating mixed refrigerant cycle in which refrigeration is provided by the vaporization of two mixed refrigerant streams of different compositions at a lower and higher pressure levels respectively. A lower pressure level vaporizing refrigerant cools the feed gas stream in a first cooling zone and a higher pressure level vaporizing refrigerant further cools and condenses the cooled gas in a second cooling zone to provide the final liquid product. The lower pressure level vaporizing refrigerant is provided by one or more liquids obtained by ambient cooling of compressed mixed refrigerant vapor. The vaporized lower pressure level refrigerant can be returned to the refrigerant compressor at a temperature below ambient, without further warming, and this cool refrigerant is compressed and combined with the vaporized higher pressure level refrigerant, which is returned at about ambient temperature.
摘要:
The process recovers NF3 from a multicomponent fluid containing NF3, one or more components less volatile than NF3, and one or more components more volatile than NF3. The process uses a first distillation column and a second distillation column, each distillation column having a top and a bottom. The process includes the following steps: feeding the multicomponent fluid to the first distillation column at a first feed location below the top of the first distillation column; feeding a cryogenic liquid to the first or second distillation column adjacent the top of the first or second distillation column; withdrawing a mixture containing NF3 from the first distillation column; feeding the mixture to the second distillation column at a second feed location; separating NF3 from the mixture in the second distillation column; and removing a stream of NF3 from the second distillation column.
摘要:
An efficient multieffect distillation process separates multicomponent mixtures containing three or more components into product streams each enriched in one of the components. The distillation process consists of at least two distillation columns that are thermally linked. A mixture stream that can be the given feed or a stream recovered from another distillation column and containing three or more components is fed to one of the thermally linked distillation columns and at least two submixture streams are produced from this distillation column and transferred to other distillation columns for further distillation. At least one of the submixture streams is transferred to the other thermally linked distillation column. One submixture stream is lean in the least volatile component of the mixture feed and the other submixture stream is lean in the most volatile component of the mixture feed.
摘要:
A process separates a multicomponent feed by distillation into at least one stream enriched in the most volatile component, one stream enriched in the least volatile component, and one stream enriched in a component of intermediate volatility. In one embodiment, the process produces an argon-enriched stream from a cryogenic air separation process. The distillation system includes at least one distillation column that produces a nitrogen-enriched stream from the top of the column, an oxygen product stream from the bottom of the column, and an argon-enriched stream from a partitioned section. The geometry of the partitioned section minimizes vapor and liquid distribution, and includes a vertical separating element and an end separating element to isolate the partitioned section from the intermediate distillation section. The equivalent diameter (De) of the partitioned section is at least 60% of the ideal diameter (Di) of the partitioned section, wherein the equivalent diameter is defined as four times the cross sectional flow area enclosed by the vertical separating element divided by the perimeter formed by the vertical separating element, and the ideal diameter (Di) is the diameter of a circle which has the same cross sectional flow area as that cross sectional flow area which is enclosed by the vertical separating element.
摘要:
A method and system are disclosed for generating a decision-tree classifier in parallel in a multi-processor system, from a training set of records. The method comprises the steps of: partitioning the records among the processors, each processor generating an attribute list for each attribute, and the processors cooperatively generating a decision tree by repeatedly partitioning the records using the attribute lists. For each node, each processor determines its best split test and, along with other processors, selects the best overall split for the records at that node. Preferably, the gini-index and class histograms are used in determining the best splits. Also, each processor builds a hash table using the attribute list of the split attribute and shares it with other processors. The hash tables are used for splitting the remaining attribute lists. The created tree is then pruned based on the MDL principle, which encodes the tree and split tests in an MDL-based code, and determines whether to prune and how to prune each node based on the code length of the node.