Abstract:
A method and system are presented of automatically suggesting rules for data stored in a table, with the table comprising a plurality of columns. The table is profiled to identify a content type for each of one or more of the plurality of columns. A rule knowledge base is accessed to locate rules specified for identified content types. Then, one or more of the located rules specified for identified content types are presented as suggestions. Acceptance of one or more of the suggested rules is received from a user, and the received validations are stored in the rule knowledge base. The accepted rules are applied to data for quality detection and monitoring. Embodiments are also described where columns are suggested based on a given rule.
Abstract:
In an example embodiment, a method of automatically generating data validation rules from data stored in a column of a table is provided. Outliers for the data are determined by analyzing a profiling statistic for the data, the profiling statistic having a type. Then it is determined if a predefined limit is exceeded, based on a quantity of the outliers determined for the data through the analysis of the profiling statistic. A data validation rule is then automatically generated based on non-outliers detected in the data through the analysis of the profiling statistic, the generated data validation rule also being based on the type of the profiling statistic. The data validation rule can then be applied to data subsequently entered for the column, causing at least a portion of the data subsequently entered for the column to be rejected.
Abstract:
According to some embodiments, a method and an apparatus of enriching search results with metadata are provided to receive a plurality of metadata associated with an entity and storing the plurality of metadata in a repository. A search request associated with the entity is received and search results that comprise a portion of the plurality of metadata stored in the repository are determined.
Abstract:
A computer implemented method of calculating a cost impact. The method includes associating cost amounts with various rules, using the rules to identify bad data, and calculating an aggregate cost of the bad data. In this manner, the Data Steward can prioritize various data quality improvement projects.
Abstract:
According to particular embodiments, determining paths in a network with asymmetric switches includes receiving a graph representing the network. Each asymmetric switch has defined degree connectivity between one or more pairs of degrees of the asymmetric switch. The graph is transformed to yield a transformed graph that accounts for the asymmetric switches. A routing process is applied to the transformed graph to yield one or more paths through the network.
Abstract:
The present invention provides an anti-frost coating, which forms a coat having a hydrophilic and hydrophobic composite structure after being applied on a substrate, and contains a hydrophobic polymer and a hydrophilic polymer. The application method of the anti-frost coating comprises: dissolving the hydrophilic polymer and the hydrophobic polymer in a solvent to form a homogeneous solution, coating the solution on a substrate to form a film, drying and curing to form an anti-frost coat with a hydrophilic and super-hydrophobic composite structure including a super-hydrophobic surface layer and a hydrophilic inner layer. Water drips can roll off easily and the dust and impurities deposited on the surface can be easily removed. The anti-frost effect is desirable.
Abstract:
Disclosed is a user interface on a display for editing data transformations comprising an ETL process. A first display area presents a data representation of a data transformation. A second display area presents a view of input data, and a third display are presents a view of output data. User input to modify the data transformation is received. In response to receiving the user input, the third display area is updated with output data generated by applying the modified data transformation to the input data.
Abstract:
A system for real-time data surveillance in an oilfield operation includes a data collection system deployed at a well site for collecting oilfield data; an office-based computer system for storing and analyzing the oilfield data; a portable device, wherein the portable device includes at least one program for communicating with the office-based computer system, for surveying the oilfield data stored in the office-based computer system, and for causing the office-based computer system to perform a process implemented on the office-based computer system; and a communication system providing communication links among the data collection system, the office-based computer system, and the portable device.
Abstract:
Stable rare earth tris (organophosphate) solutions comprise a rare earth tris (organophosphate) and a hydrocarbon solvent. From about 2% to about 10% rare earth element, preferably from about 3% to about 8%, is present in the solutions. The rare earth tris (organophosphate) solutions are stable from precipitation of the rare earth tris (organophosphate) for at least about fifteen (15) days, preferably for at least about twenty (20) days and most preferably at least about thirty (30) days. A process for preparing these solutions is described herein. A stabilizing additive, acid, glycol or mixtures thereof, is utilized to inhibit precipitation. The molar ratios of free acid, glycol and/or water to the rare earth element are controlled to inhibit precipitation.
Abstract:
The invention provides a biosynthetic gene cluster for mitomycin, for example, a mitomycin biosynthetic cluster from organisms such as Streptomyces, for instance, S. lavendulae, as well as methods of using gene(s) within the cluster to alter antibiotic biosynthesis and to prepare a polyketide synthase.