摘要:
Systems and methods are disclosed to consolidate workload on cloud-based computers by co-locating one or more high-penalty tenants on one or more cloud-based servers; reducing service level agreement (SLA) violations by over-provisioning the cloud-based server; and maximizing an SLA profit.
摘要:
Systems and methods are disclosed to detect communities of a social network by receiving linked documents from the social network; generating one or more conditional link models and one or more discriminative content models from the linked documents; creating a discriminative model by combining the one or more conditional link models and discriminative content models; and applying the discriminative model to the social networks.
摘要:
Systems and methods are disclosed to analyze a social network by generating a data tensor from social networking data; applying a non-negative tensor factorization (NTF) with user prior knowledge and preferences to generate a core tensor and facet matrices; and rendering information to social networking users based on the core tensor and facet matrices.
摘要:
Systems and methods are disclosed for extracting characteristics from a corpus of linked documents by deriving a content link model that explicitly captures direct and indirect relations represented by the links, and extracting document topics and the topic distributions for all the documents in the corpus using the content-link model.
摘要:
A ruthenium complex is provided. The ruthenium complex is represented by the following Formula (I): in which, X is a monodentate anion ligand, R1, R2, R4 and R5 are the same or different substituents and represent alkyl, alkoxy, aminoalkyl, haloalkanes or substituted phenyl group, carboxylic acid group or acid radical salt thereof, sulfonic acid group or acid radical salt thereof, phosphoric acid group or acid radical salt thereof or hydrogen atom. R3 represents perhalogenated alkyl group, alkoxy, alkyl, amino, halogens, or hydrogen atom. The ruthenium complexes are suitable for being used as dye-sensitizers for fabricating dye-sensitized solar cells.
摘要:
Towards mining closed frequent itemsets over a sliding window using limited memory space, a synopsis data structure to monitor transactions in the sliding window so that one can output the current closed frequent itemsets at any time. Due to time and memory constraints, the synopsis data structure cannot monitor all possible itemsets, but monitoring only frequent itemsets makes it difficult to detect new itemsets when they become frequent. Herein, there is introduced a compact data structure, the closed enumeration tree (CET), to maintain a dynamically selected set of itemsets over a sliding-window. The selected itemsets include a boundary between closed frequent itemsets and the rest of the itemsets Because the boundary is relatively stable, the cost of mining closed frequent itemsets over a sliding window is dramatically reduced to that of mining transactions that can possibly cause boundary movements in the CET.
摘要:
The invention is directed generally to providing methods and systems for trend extraction and analysis. Embodiments include methods and systems for trend extraction and analysis of information extracted from dynamically changing data included in computer systems and/or networks. Various exemplary embodiments are provided that may generate characteristic indicators for trend(s) and/or distribution(s) for one or more data sources by use of, for example, temporal indicators derived through analysis of the difference in contribution separate portions of the data to the whole data set being considered, contribution of individual sources, and/or the interaction of the separate portions of the data with one another. Some exemplary approaches may include the use of singular value decomposition (SVD) and higher-order singular value decomposition (HOSVD) data extraction and analysis techniques. One use of these techniques is in the analysis of the dynamic data contained in Weblogs and the blogosphere.
摘要:
Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.
摘要:
An efficient cost-based scheduling method called incremental cost-based scheduling, iCBS, maps each job, based on its arrival time and SLA function, to a fixed point in the dual space of linear functions. Due to this mapping, in the dual space, the job will not change their locations over time. Instead, at the time of selecting the next job with the highest priority to execute, a line with appropriate angle in the query space is used to locate the current job with the highest CBS score in logarithmic time. Because only those points that are located on the convex hull in the dual space can be chosen, a dynamic convex hull maintaining method incrementally maintains the job with the highest CBS score over time.
摘要:
An admission control system for a cloud database includes a machine learning prediction module to estimate a predicted probability for a newly arrived query with a deadline, if admitted into the cloud database, to finish its execution before said deadline, wherein the prediction considers query characteristics and current system conditions. The system also includes a decision module applying the predicted probability to admit a query into the cloud database with a target of profit maximization with an expected profit determined using one or more service level agreements (SLAs).