摘要:
A system, method, and computer-readable medium for performing an analytics workflow generation operation. The analytics workflow generation operation enables generation of targeted analytics workflows (e.g., via a data scientist (i.e., an expert in data modeling)) that are then published to a workflow storage repository so that the targeted analytics workflows can be used by domain experts and self-service business end-users to solve specific classes of analytics operations.
摘要:
A system, method, and computer-readable medium are disclosed for identifying sources of variation in complex manufacturing processes via a variation identification operation. In certain embodiments, the variation identification operation is performed via a variation identification system. The variation identification operation addresses special class of analytic problems, namely the estimation of variance components and related statistics from very large (big data) hierarchically nested designs of random factors. These types of data structures occur frequently across various industries, and in particular in automated and batch manufacturing where the variability in product quality as measured in final product testing should be related to batches, lots, wafers, suppliers, etc. upstream of the process.
摘要:
A system, method, and computer-readable medium are disclosed for analysis and characterization of manufacturing information such as process trees or genealogies using graph theory. More specifically, using graph theory to analyze manufacturing information of a manufacturing operation allows for deep analysis of relationships between batches or units in a process tree and their closeness or distance, to identify clusters associated with specific quality characteristics or problems, to identify common antecedents of specifically labeled batches (e.g., problem batches), and/or to detect overall desirable or undesirable characteristics of the process tree (e.g., centrality, etc.).
摘要:
A system, method, and computer-readable medium are disclosed performing an optimization operation. The optimization operation optimizes continuous processes by identifying process states associated with specific ranges for a limited subset of control parameter inputs. In certain embodiments, the optimization operation states comprise clear, stable, and robust process states. Such an optimization operation provides a simpler and cost effective means to optimize continuous processes. Additionally, such an optimization operation is applicable more rapidly to a wider range of real-world operational issues as they occur regularly in continuous process scenarios.
摘要:
A system, method, and computer-readable medium for performing a variable and interaction detection operation. In certain embodiments, the variable and interaction detection operation detects important variables and as well as interactions among certain important variables in big data. In certain embodiments, the variable and interaction detection operation includes a robust automated process flow (i.e., an analytic recipe) which identifies important variables and their interactions in large data sets.
摘要:
A system, method, and computer-readable medium for performing in-database operations, comprising: presenting an automation interface to a user, the user interface automation interface enabling a user to select one or more key performance indicators; instantiating an in-database processing operation, the in-database processing operation performing feature selection from a high dimensional parameter space; executing at least one database statement within the storage system to derive a subset of diagnostic parameters from the high dimensional parameter space.
摘要:
A system, method, and computer-readable medium for performing a distributed analytics operation. The distributed analytics operation uses interface technologies to de-couple an actual data storage technology from an implementation of distributed analytics. Such a distributed analytics operation obviates requirements to deploy specific computer code onto a data storage platform to specifically target that platform for distributed predictive analytics computations.
摘要:
A system, method, and computer-readable medium are disclosed for analysis and characterization of manufacturing information such as process trees or genealogies using graph theory. More specifically, using graph theory to analyze manufacturing information of a manufacturing operation allows for deep analysis of relationships between batches or units in a process tree and their closeness or distance, to identify clusters associated with specific quality characteristics or problems, to identify common antecedents of specifically labeled batches (e.g., problem batches), and/or to detect overall desirable or undesirable characteristics of the process tree (e.g., centrality, etc.).
摘要:
A system, method, and computer-readable medium for performing an auto-query construction operation for use with a distributed analytics operation. More specifically, in certain embodiments, the auto-query construction operation provides automatically generates SQL code instructions via an auto-query construction user interface (UI) settings in a computational system, such as the Dell Statistica computational system. The auto-query construction operation allows a user to interact with a common interface to provide query information including decision variables, parameters of an analysis and convergence criteria. The query information provided via the UI is automatically transformed to database queries and subsequent computation system operations. Thus, the user experience remains intact whether the analytics is performed in database or within the computation system.
摘要:
A system, method, and computer-readable medium for extracting the samples from big data to extract most information about the relationships of interest between dimensions and variables in the data repository. More specifically, extracting information from large data repositories follows an adaptive process that uses systematic sampling procedures derived from optimal experimental designs to target from a large data set specific observations with information value of interest for the analytic task under consideration. The application of adaptive optimal design to guide exploration of large data repositories provides advantages over known big data technologies.